Basic level: First steps into migration

You are not familiar with translation management systems, your experience is a bit rusty, or you just want a short overview?

We've got your back!

In this part of our guide you will learn about some basic concepts and how to use them during your data migration process.

First, you need to know that memoQ has two kinds of resources:

Resources are information used regularly during translation or translation management. They make your work in memoQ more efficient or enhance its quality. You and your colleagues can reuse them in different projects.

Want to learn more? Read the documentation page about all types of resources.

Let's start with heavy resources

Heavy resources contain a large amount of language data, for example, monolingual or bilingual text, pairs of segments, glossary entries, or a large amount of statistical data. Here, we will talk about two the most useful when migrating data:

  • Translation memories

    Translation memories (TMs) are the key to every modern CAT tool (Computer-Aided Translation tool) or TMS (Translation Management System).

    When you confirm a segment in the translation editor, memoQ saves the source text and its corresponding target text into a translation memory. So the TM contains pairs of equivalent sentences in both source and target languages.

    When you translate or pre-translate a document, memoQ will scan every translation memory attached to the project, and it will show the segments where the source text is the same or similar enough to the segment you need to translate, so you can re-use your previous translation. This is how you can recycle your work and save time.

    Translation memories in memoQ are bilingual. This means that each TM includes one language combination only. If they are online TMs (they usually are), every user in a project can see the changes made to the TM in real time.

  • LiveDocs corpora

    A LiveDocs corpus is a collection of documents. It can contain monolingual documents, bilingual document, as well as aligned document pairs. Thanks to LiveDocs corpora you can reuse existing translations (look up phrases and segments) without going through translation memories.

    You can also use it to add background material to the project - documents and document pairs of any language. A LiveDocs corpus offers matches in all the languages of the documents in it.

  • Metadata

    Metadata in memoQ is basically all information that are stored alongside with a translation memory entry (or term in a termbase), besides the actual source segment, target segment (and the context of a segment).

    Metadata also appear on several other levels within memoQ, for example in projects, translation memory and term base properties. The translator can also see them for a given match. The main purpose of metadata is to define relations between all of these levels, but also to describe the content of a resource.

It's time for the light resources!

What are light resources?

Light resources are settings that doesn't contain actual text or too much data. They are here to help memoQ make your work more efficient.

We will focus on those that are most related to your workflows (not to specific types of content) and are rarely changed:

  • Segmentation rules

    When you add a translation document to a project, it is segmented during the import process - the text is split into segments, or translation units. Then translator must create a translation for every unit, creating a translation pair that can be stored in a translation memory.

    Segmentation rules define how this text is split. In most cases, segmentation is performed on the sentence level. This means punctuation is used to mark the end of a segment (there are special rules to handle abbreviations or number formats.)

  • QA settings

    memoQ can check many things in the translation automatically. These automatic checks are called quality assurance checks.

    A QA settings resource tells memoQ what to check and how. For example, you can choose to check terminology, consistency, and segment length, or simply the inline tags only.

  • TM settings (and LiveDocs settings)

    A TM settings or LiveDocs settings resource tells memoQ how to get matches from a translation memory or a LiveDocs corpus.

    This means two things:

    • match thresholds - set the minimum match rate of any match that memoQ shows, and tell memoQ what counts as a good match.

    • penalties - used in pre-translation. Penalties are used when a match is not reliable. If the translation is bad or inadequate, the match will be worse than the actual match rate.

Want to learn more?

Check Advanced level: the more I know...