Your single source for new lessons on legal technology, e-discovery, and the people innovating behind the scenes.

The Case for Custom Bilingual Glossaries

The primary aim of e-discovery software is to process and analyze tens of thousands of documents (sometimes millions) in the quest to find and organize relevant files with incredible speed—while understanding the meaning and subplots within. When it’s done well, these platforms save tremendous amounts of time and money.

Still, e-discovery can grind to a halt—or go terribly wrong—if the context of foreign language content is misunderstood. It takes a combination of human and artificial intelligence to get that context right.

Producing correct meaning is one of the biggest challenges for e-discovery translation software. In fact, this is a challenge for any translation project. Getting the literal definitions of words is not enough; meaning can change greatly depending on how words are used in a sentence.

Fortunately, for translating foreign language documents there is a powerful tool—the custom bilingual glossary. It exists to clarify context for important yet still nonsensical or ambiguous passages.

The Contextual Conundrum

Machine translation is a helpful way for legal teams to get a quick understanding of the foreign language content in their e-discovery databases. But you have to be careful with raw machine translation output.

Why is that? Context is often misunderstood or misrepresented by machine translation workflows. Here are some of the main reasons.

  • A machine translation database is not, and realistically cannot be, 100 percent accurate. Such databases rely, in part, on bilingual translations of text, in which not all variants of a word are included. Doing so would overtax the system, particularly for complex languages like English and in other languages, such as German, where new words are invented all the time—making it impossible to keep up.
  • Words can have multiple meanings—particularly when used as part of an expression—that go beyond prescribed definitions and into the realm of connotation and cultural understanding. Even the most sophisticated AI translation engines often need guidance on which of two or more meanings is the correct one in a particular context.
  • Company names, product names, and personal names are generally not translated. But machine translation engines don’t know this. So, their attempts to translate such words usually produce unintelligible results, or they need instruction regarding when not to translate these words or expressions.
  • Industry-specific terms and acronyms may not be in the database and, therefore, won’t be recognized or understood by a machine translation engine. Supplementary manual instructions are required to help the machine translation engine produce the correct translation of these outlier expressions.

Combinations of these technical and linguistic complications mean that automatically translated data just won’t cut it when it’s at the crux of your case. So how can you bridge the gaps in understanding your foreign language data without breaking the bank?

What are Custom Bilingual Glossaries?

Custom bilingual glossaries are essentially a list of words and expressions that contain either “do not translate” guidance or precise instructions on which meaning to use in a particular context. Think of the glossary as the specialized code to translate predetermined words in a specific language pair, and for a specific client and legal case.

Custom bilingual glossaries are created by skilled translators and terminologists working in cooperation with the legal team and/or client representatives. After the first glossary iteration, the team typically reviews subsequent passes of machine translation output to identify additional words or expressions that have been mistranslated by the software.

The bilingual glossary is created to add preferential information that instructs the engine to  make correct choices in accordance with the needs of the case. Several iterations are usually necessary to fully obtain the desired results. Since the changes involve key passages of the text, the improvement in output performance is sometimes amazing. You get better readability and accuracy.

What Makes Custom Bilingual Glossaries Helpful?

Custom bilingual glossaries are a small investment, yet they deliver great value in improving accuracy and comprehension. By working with a trusted, experienced language services partner to build a custom language glossary for your next e-discovery project, you’ll be producing a higher quality output that gives your team a better chance of securing a positive case outcome.

e-Discovery is still a significant investment, and there are many checks and balances in place to ensure you strike the right note when it comes to comprehension and cost. Custom bilingual glossaries can help you find that equilibrium.

Explore the Relativity App Hub


Mark Ettinger is Vice President of Linguistic Systems, Inc. in Boston, MA. Linguistic Systems offers more than 50 years of translation services experience across multiple industries. Ai Translate by LSI is the company’s technology-based solution that has served the legal industry and e-discovery service providers for nearly a decade. The Ai Translate Plugin for Relativity is current through all supported versions of Relativity and RelativityOne.

The latest insights, trends, and spotlights — directly to your inbox.

The Relativity Blog covers the latest in legal tech and compliance, professional development topics, and spotlights on the many bright minds in our space. Subscribe today to learn something new, stay ahead of emerging tech, and up-level your career.

Interested in being one of our authors? Learn more about how to contribute to The Relativity Blog.