In recent years, the rapid development of big data, cloud computing and artificial intelligence (AI) technology has brought about both opportunities and challenges to all walks of life.
In the vertical field of patent translation, the use of AI technology is gradually freeing translators from the more laborious work, and enabling them to dedicate their time to the more crucial aspects.
So, how does AI make contributions to patent translation?
It will be discussed in the following three aspects:
1. Difficulties in machine translation of patents
2. Three basic steps of machine translation
3. Translation model building
01 Difficulties in machine translation of patents
A patent or patent application basically consists of: an abstract, claims, and a description, each having different expression rules with terminology that can be difficult to understand.
For example, each claim shall always be organized into one sentence, no matter how complicated the sentence structure or how long the sentence is.
Such a sentence, though meeting the requirements in grammar and syntax, can be hard to read and sometimes incoherent.
Therefore, adding enumeration commas, commas or semicolons into the sentence in the proper places to segment the sentence appropriately as well as ending the whole sentence with a full stop could improve the readability of the claim and avoid ambiguity or misinterpretation.
However, this may bring tremendous challenges to machine translation technology which is still in development. In particular, machine translation still requires sentences having complete structures and meanings for training, and there are limits to the sentence length training with longer sentences may result in poorer translation quality and mistranslations.
Whats more, a patent document requires a high consistency of terms throughout the whole text, but machine translation is completed sentence by sentence, without understanding the context like a human, and therefore the consistency of terminology is also a challenge to the industry. This is of course a threshold for vertical industries. The one who is able to better respond to these challenges will be in a dominant position in the market.
02 Three basic steps of machine translation
The implementation process of machine translation basically includes three steps: data pre-processing, machine translation, and post-processing.
Data pre-processing mainly includes performing coding unification and text normalization on the aligned bilingual sentences, so as to meet the requirements for adaptation to translation models, for instance, amending numbers, symbols, date formats and non-standard expressions into the standard form and style.
The pre-processing stage is important for improving the quality of machine translation, and has a significant impact on the translation result. The less data noises, the better the translation quality.
Furthermore, attention should also be paid to the characteristics of different translation models, so as to perform targeted adjustment of the data pre-processing method.
Machine translation is the process of translating inputted text data into the target language. Here, the most important part of machine translation is called translation model. A translation model is a model formed by deep learning based on mass aligned bilingual sentences through AI algorithms.
Therefore, it would be better to prepare as much bilingual data having high quality and complicated structures as possible, so as to enable the model to have a higher generalization ability and better comprehensive performance.
Algorithm optimization and model training should be performed alternatively to form a spirally rising iterative process, optimizing the algorithm and parameters by iterative trainings. Transformer is an excellent open-source neural network model, which can be implemented using TensorFlow and PyTorch.
A relatively mature tool - TensorFlow serving - is used for deployment of the translation model, and PythonAPI is used for invocation. Once successfully published, the translation model is able to provide services.
Post-processing is to convert and re-arrange the translation result, splice the modeling units and process special symbols, so as to make the translation result readable. Moreover, post-processing may also include word segmentation checking, BLEU scoring, word count calculating, etc.
All these processings serve to guarantee a better upgrade of the translation model in the future. Post-processing plays the role of assisting machine translation and can improve translation normalization, but cannot improve the translation quality fundamentally. So far at the current stage of development of machine translation, post-processing is still a necessary procedure.
03 Translation model building
Training a translation model generally involves three aspects, namely linguistic data (i.e. the aligned bilingual sentences) processing, algorithm writing and model training, and deployment.
Linguistic data processing
Linguistic data processing is the first step of machine learning, also called parallel corpus building. Corpus building includes aligning and storing sentences in the source language and in the target language in a one-to-one manner.
Only when the sentences are aligned exactly, can the linguistic data be used for training of the translation model. In addition, extremely long sentences in the linguistic data also need to be processed for effective segmentation of sentences. In a technical aspect, regularization and denoising are also necessary.
Furthermore, there is no consensus about the influence of Chinese word segmentation on machine translation. Many research papers online suggest that word segmentation, if anything, leads to a better translation result.
Manual processing and technological processing mutually promote each other, and during continuous upgrading of the translation model, the quality of linguistic data plays a decisive role.
Algorithm writing and model training
In the development history of machine translation, the network model structure developed and evolved from Seq2Seq, Transformer to BERT.
At the very beginning, deep learning was completed on the basis of Seq2Seq, and CNN and then RNN activated smart machine translation. The Transformer model greatly improves the quality of smart machine translation, overcomes the defect of slow training of RNN, which is often criticized, and achieves fast parallel processing using a self-attention mechanism.
In addition, Transformer enables deep learning, sufficiently explores the characteristics of a DNN model, and improves the translation accuracy of the model. The increasingly popular BERT is also constructed on the basis of Transformer.
Transformer is a network structure published by Google in 2017 to replace RNN and CNN. It is the first model built only using attention, and enables direct acquisition of global information - this is different to RNN which obtains a global information link by continuous recursion, and different to CNN which merely acquires local information; moreover, Transformer supports parallel computing. Therefore, Transformer enables a faster speed, and can also provide better translation results.
Once the network structure is determined, it is necessary to set parameters thereof, such as batch_size, learning_rate, hidden_size, max_length, dropout and num_heads. As for the implementation of the Encoder-Decoder, there are many source codes online for the optimizer, loss value calculation and gradient updating.
After processing the network coding, it is recommended to observe the curves of the visualized graph in logs to check whether the network structure is properly configured. A proper network structure configuration and hyper-parameter setting enable curve convergence within a few hours, as shown in the graph below. Setting of hyper-parameters has a great influence on the learning curves, and the graph shows that different hyper-parameter settings result in big differences in BLEU values trained on the basis of the same data.
TensorFlow board can be used to display a visualized graph, as it is easy to operate, has a good visualization effect, and provides various curves showing different learning results under different hyper-parameters.
Alternative network structure programming and model training form a spirally rising iterative process, as the influence of algorithm selection or parameter settings need to be proven through continuous practice.
Therefore, it is important to understand the model on the basis of algorithm principles and to analyze the data fed back from practice. Only in this way, can we optimize the translation model in the correct way, and the experiences accumulated from iterative debugging practice enable better and thorough understanding of the translation model.
Herein, we list some examples based on our experience: a smooth curve indicates a high quality of linguistic data; a fluctuating curve indicates excessive noises in linguistic data; the more layers the network has, the slower the learning is, but it also means the curve could rise higher later on; the number of layers of the model requires a corresponding amount of data; GPU supports a training speed dozens of times faster than CPU; more data results in a slower decreasing of loss; a better time to adjust dropout is in the middle-to-late period, etc.
Deployment
With regard to the deployment of the translation model, Googles TensorFlow Serving can be used as an application framework. TensorFlow Serving provides, up till now, the most mature and stable application services.
TensorFlow Serving provides a flexible server architecture and supports cluster deployment, aiming to deploy and serve an ML model. A trained model can be used for predication, and TensorFlow Serving is able to export the model in a servable compatible format.
TensorFlow Serving combines the core service components together to construct a GRPC/HTTP server. This server is able to serve multiple ML models (or multiple model versions trained with same data under different parameter settings), invocation of model services is realized via an API interface obtained from an official channel, and an external service interface communicates with TensorFlow Serving end by means of gRPC and RESTfull API, so as to acquire services.
In addition, an official recommendation is to deploy the model services in combination with Docker, so as to enable high speed and convenience. Once the deployment is completed, evaluation of the translation model can be performed.
Conclusion
Throughout the history of the translation industry, the working method has developed from pure hand-writing to computer-assisted translation, and then to the AI translation of today. We believe that the development of AI technology will exert more positive effects on the patent translation industry, and contribute assistance to human translation, rather than replace human translation. The combination of human translation and AI technology will enable the best balance between efficiency and quality.
Premiword Machine Translation (www.premiword.com.cn) AI neural network machine translation based on more than 50 million bilingual sentence pairs from 120 million global patents and tens of thousands of office actions accumulated over the years, supporting Chinese-English and Chinese-Japanese translation and reverse, being expert in translation of patents in most technical fields as well as in translation of patent office actions.
An example of machine translation:
Chinese:
English Translation:
The present technology relates to an information processing apparatus, an imaging control method, a program, a digital microscope system, a display control apparatus, a display control method, and a program.
Japanese Translation:
Read more:
AI's increasing contributions to patent translation - will humans be replaced? - Lexology
- Research, Evaluation and Learning at the International Rescue Committee - World - ReliefWeb [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Conserving Biodiversity with AI - BBN Times [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- DevOps Fundamentals You Ever Wanted To Know - hackernoon.com [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Another Perspective on Evictions - Bacon's Rebellion [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Amitabh Bachchan on fans alternate job suggestion: My job is now insured - The Indian Express [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Will You Soon Download Packaging Machine Controls from the Internet? - Packaging Digest [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- 5 free resources every data scientist should start using today - The Next Web [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Who's hoping to make an Epic impact on Green Bay area music scene with a new concert venue? | Streetwise - Green Bay Press Gazette [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Industrial robots are dominating but are they safe from cyber-attacks? - TechHQ [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Friday Rant - Rise of the Rogue-Bots? - Diginomica [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Important Reasons Why You Should Pick RoR As Your Web-Based Development Project - Customer Think [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Portrait of the software developer as an artist - ComputerWeekly.com [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Python may be your safest bet for a career in coding - Gadgets Now [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- 1Password is coming to Linux - ZDNet [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- IBM creates an open source tool to simplify API documentation - TechRepublic [Last Updated On: August 10th, 2020] [Originally Added On: August 10th, 2020]
- Mastercard : Accelerate Ignites Next Generation of Fintech Disruptors and Partners to Build the Future of Commerce - Marketscreener.com [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- Expanding the Universe of Haptics | by Lofelt | Aug, 2020 - Medium [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- UX Designer Salary: 5 Important Things to Know - Dice Insights [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- Persistent memory reshaping advanced analytics to improve customer experiences - IT World Canada [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- NextCorps and SecondMuse Open Application Period for Programs that Help Climate Technology Startups Accelerate Hardware Manufacturing - GlobeNewswire [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- Buried deep in the ice is the GitHub code vault humanity's safeguard against devastation - ABC News [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- Top 12 Most Used Tools By Developers In 2020 - Analytics India Magazine [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- Facebook's React 17 JavaScript library: Here's why its top feature is 'no new features' - ZDNet [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- CORRECTING and REPLACING Anyscale Hosts Inaugural Ray Summit on Scalable Python and Scalable Machine Learning - Business Wire [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- Google: Here's how much we give to open source through our GitHub activity - ZDNet [Last Updated On: August 12th, 2020] [Originally Added On: August 12th, 2020]
- How Chriselle Lim And Joan Nguyen Created Bmo, The Coworking Space And Virtual Classroom Of The Future (With A Childcare Twist) - Forbes [Last Updated On: August 13th, 2020] [Originally Added On: August 13th, 2020]
- How Will Public Libraries Adapt To New School Year Norms? - Book Riot [Last Updated On: August 13th, 2020] [Originally Added On: August 13th, 2020]
- Google: We'll test hiding the full URL in Chrome 86 to combat phishing - ZDNet [Last Updated On: August 13th, 2020] [Originally Added On: August 13th, 2020]
- How to install Python 3 and PIP 3 on Ubuntu 20.04 LTS - Linux Shout - H2S Media [Last Updated On: August 13th, 2020] [Originally Added On: August 13th, 2020]
- What are Bitcoin Wallets: Everything You Need to Know - Programming Insider [Last Updated On: August 13th, 2020] [Originally Added On: August 13th, 2020]
- JSHint is Now Free Software after Updating License to MIT Expat - WP Tavern [Last Updated On: August 13th, 2020] [Originally Added On: August 13th, 2020]
- How to learn JavaScript: These are the best online courses - Mashable [Last Updated On: August 13th, 2020] [Originally Added On: August 13th, 2020]
- What developers need to know about inter-blockchain communication - ComputerWeekly.com [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Introducing the CDK construct library for the serverless LAMP stack - idk.dev [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- IBM asked software developers to take on the wrath of Mother Nature - The Drum [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Aspire Technology Launches First Truly Secure Public Blockchain for Creation of Digital Assets - GlobeNewswire [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- GM Creates And Shares New Workplace Safety Technologies - Pulse 2.0 [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Key Considerations and Tools for IP Protection of Computer Programs in Europe and Beyond - Lexology [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- The state of application security: What the statistics tell us - CSO Online [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Open Source: What's the delay on the former high/middle school on North Mulberry? - knoxpages.com [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- The Risks Associated with OSS and How to Mitigate Them - Security Boulevard [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- news digest: Microsoft launches open source website, TensorFlow Recorder released, and Stackery brings serverless to the Jamstack - SD Times -... [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Build Your Own PaaS with Crossplane: Kubernetes, OAM, and Core Workflows - InfoQ.com [Last Updated On: August 17th, 2020] [Originally Added On: August 17th, 2020]
- ISRO Is Recruiting For Vacancies with Salary Upto Rs 54000: How to Apply - The Better India [Last Updated On: August 17th, 2020] [Originally Added On: August 17th, 2020]
- Does technology increase the problem of racism and discrimination? - TechTarget [Last Updated On: August 17th, 2020] [Originally Added On: August 17th, 2020]
- CORRECTING and REPLACING Anyscale Hosts Inaugural Ray Summit on Scalable Python and Scalable Machine Learning - Yahoo Finance [Last Updated On: August 17th, 2020] [Originally Added On: August 17th, 2020]
- In the City: Take advantage of open recreation, cultural and park amenities - Coloradoan [Last Updated On: August 17th, 2020] [Originally Added On: August 17th, 2020]
- Exploring the future of modern software development - ComputerWeekly.com [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Hadoop Developer Interview Questions: What to Know to Land the Job - Dice Insights [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- SiFive Opens Business Unit to Build Chips With Arm and RISC-V Inside - Electronic Design [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Use Pulumi and Azure DevOps to deploy infrastructure as code - TechTarget [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Why ASP.NET Core Is Regarded As One Of The Best Frameworks For Building Highly Scalable And Modern Web Applications - WhaTech [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- NITK figures 4th in Google Summer of Code ranking - BusinessLine [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Learn More About Dynamo for Revit: Features, Functions, and News - ArchDaily [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Linux Foundation showcases the greater good of open source - ComputerWeekly.com [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Programming language Kotlin 1.4 is out: This is how it's improved quality and performance - ZDNet [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Top 10 Languages That Paid Highest Salaries Worldwide In 2020 - Analytics India Magazine [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Programming language Rust: Mozilla job cuts have hit us badly but here's how we'll survive - ZDNet [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- In-App Bidding Gathers Steam, But Adoption Looks Nothing Like Header Bidding On The Web - AdExchanger [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- 13 thoughts on Fitting Snake Into A QR Code - Hackaday [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Newham test and trace app was designed by man who grew up in the borough - Newham Recorder [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- 'Trapped in a code' the fight over our algorithmic future - Open Democracy [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Telegram launches one-on-one video calls on iOS and Android - The Verge [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- AWS Controllers for Kubernetes Will Be A 'Boon For Developers' - CRN: Technology news for channel partners and solution providers [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Coding within company constraints - ComputerWeekly.com [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Open Source and Open Standards: The Recipe for Success Featured - The Fast Mode [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- How Intel helped give the worlds first cyborg a voice - The Next Web [Last Updated On: August 21st, 2020] [Originally Added On: August 21st, 2020]
- Tiger Woods, Rory McIlroy near bottom of field at The Northern Trust - ESPN [Last Updated On: August 22nd, 2020] [Originally Added On: August 22nd, 2020]
- Intel Owl OSINT tool automates the intel-gathering process using a single API - The Daily Swig [Last Updated On: August 22nd, 2020] [Originally Added On: August 22nd, 2020]
- IOTA Foundation presents the current projects in the mobility industry - Crypto News Flash [Last Updated On: August 22nd, 2020] [Originally Added On: August 22nd, 2020]
- How 'Fortnite' and 'Second Life' Shaped the Future of Indian Market - Santa Fe Reporter [Last Updated On: August 22nd, 2020] [Originally Added On: August 22nd, 2020]
- Apple Enters $ 2 Trillion Club, Github's Chinese Counterpart And More In This Week's Top News - Analytics India Magazine [Last Updated On: August 22nd, 2020] [Originally Added On: August 22nd, 2020]
- As world grapples with pandemic, schools are the epicenter - ABC News [Last Updated On: August 24th, 2020] [Originally Added On: August 24th, 2020]
- Why Businesses Should Embrace Modernizing Their Legacy Applications - TechBullion [Last Updated On: August 24th, 2020] [Originally Added On: August 24th, 2020]
- Is It Time To Rename RPG? - IT Jungle [Last Updated On: August 24th, 2020] [Originally Added On: August 24th, 2020]
- Phantasy Star Online programmers on breaking new ground and their Diablo-style isometric prototype - Polygon [Last Updated On: August 24th, 2020] [Originally Added On: August 24th, 2020]
- How To Learn To Program In Python By Playing Videogames - Analytics India Magazine [Last Updated On: August 24th, 2020] [Originally Added On: August 24th, 2020]
- New Microsoft program to help develop the quantum computing workforce of the future in India - Microsoft [Last Updated On: August 24th, 2020] [Originally Added On: August 24th, 2020]
- How the Docker Revolution Will Change Your Programming, Part 1 - Walter Bradley Center for Natural and Artificial Intelligence [Last Updated On: August 24th, 2020] [Originally Added On: August 24th, 2020]
- The art of developing happy customers - ComputerWeekly.com [Last Updated On: August 24th, 2020] [Originally Added On: August 24th, 2020]