Translation memory

7 articles

Set up translation memories

Setting up translation memories | Smartcat Help Center Translation memories (TMs) are the single most important linguistic asset to optimize your translation performance. They improve quality and consistency, reduce project turnaround times, and decrease the number of words to translate by pre-populating new translations with previously translated content that matches current content for translation. For these reasons, it's crucial that they are set up properly when you create an account on Smartcat. In addition to enhancing project workflow, correct creation of TMs also averts the possibility of missed word and phrase matches, which would mean higher translation costs. Essential information on Smartcat translation memories Here are some essential need-to-know facts about TMs in Smartcat. Limitless TM creationIn Smartcat, you can create as many TMs as you like. For instance, you may wish to have a different TM for each specific subject matter focus for your content. As an example, a corporate legal department might have one TM for product terms and conditions-related content and another for employee contracts. Limitless data storageThere are also no limits as to the amount of data that you can store in your TM databases in your Smartcat account. You can organize TMs using client and project group labels. TMs can be single-language or multilingualAs well as the standard one source and one target language form of TM, Smartcat also enables you to create multilingual translation memories. This means that you can have one source language and as many target languages as you want. This helps reduce the number of TMs that you would need to create if you could only use single-language TMs, helping with organization and workflow management. How to create translation memories in Smartcat Creating a new TM in Smartcat is simple. Step 1Scroll down and click on Linguistic Assets via your Smartcat workspace home page. Step 2Click Create TM. Clicking Create TM will generate the following dialogue box.Note: the numbers have been superimposed to provide instructions below Field 1: TMX, SDLTM OR XLSX fileImport a TM file in one of the available formats. Field 2: NameAssign a unique name. In many cases, it is recommended to choose a name based on the customer name to simplify data management. Fields 3 and 4: Source and target languagesChoose the source language (3) and all the target languages (4). There can be only one source language but you can choose as many target languages as needed for a specific client. In most scenarios, matches are found based on comparing new sentences with sentences stored as the source language but Smartcat now allows translation memories to be reversed automatically when used as reference (in read-only mode) and matches can be found in the target language in this case. Fields 5 and 6Labels can be assigned for a client (5) and project tag (6). Field 7: SubjectThere is an option to define a subject (7) for the translation memory but this option is generally redundant if the translation memory is assigned to a specific client. It could be useful when dealing with clients who work across multiple fields and separate translation memories need to be maintained. Field 8: CommentsYou can also add comments or a description (8) for each translation memory. Click Save when you are done. That's it! With this brief article, you should now know how to correctly create a TM in Smartcat, with one or more target languages.

Organize your translation resources

Organizing resources | Smartcat Help Center Smartcat offers numerous translation resources, or, linguistic assets, to help ensure quality and consistency in your translations. Translation memories Smartcat supports multilingual translation memories, which makes creating and managing translation memories easy! To create a translation memory, you need to follow these steps:Click on Linguistic Assets from your Smartcat workspace: 2. Make sure you have chosen Translation Memories from the drop-down menu: 3. Select “Create TM.” Quick note : One of the less obvious advantages of multilingual TMs is that you can create a new TM where the source language is one of the target languages of the original TM. This can be convenient if you work in mixed language pairs. 4. Fill in the fields: 5. You can upload external translation memories in the following file formats: TMX, SDLTM or XLSX file. Or, you can also add new terms one-by-one, or as you go along. Once you have created several TMs, your list might look like this: Glossaries You can now use the same approach to organize your terminology databases, which are called glossaries in Smartcat. To do this, go to Linguistic Assets → Glossaries. You can create a glossary using the same client and project group labels: Here we have replicated the same structure as the one for translation memories. Just like TMs, glossaries are multilingual in Smartcat. Unlike TMs, though, they don’t have a strictly set source language — you can use a glossary in projects where any two of its languages are present as the source or the target. How detailed should you go? With both TMs and glossaries, you can go with a high level organization structure with one resource per customer, or do something similar to what we described above. Our suggestion is to mix both on a case by case basis:For clients with a complex corporate structure, it is a good idea to match that structure with the resources.For smaller clients, a single terminology database will take less time to configure and will be easier to maintain. If you want to go hardcore, you can create a new TM for every project, while making sure to properly choose the client and project group for each. Smartcat will still fetch these “micro-resources” for subsequent projects, as described in the next section. Note: While creating a new TM with each project might help with data segregation, it might also create a lot of TM duplication that could reduce translator productivity. Putting it all to work Now, whenever you create a project and choose a client/project group, Smartcat will automatically fetch the translation memories and glossaries associated with them: This ensures that you leverage every possible resource related to said client and project group, eliminates many potential project creation errors that lead to unnecessary work, and just saves time. Takeaway Organizing your data properly from the get-go increases the productivity of your project managers, who will no longer spend time looking for matching resources. This will also make things easier for your translation team, who will benefit from optimized TM leveraging and consistent terminology. Ultimately, this will increase your translation throughput, profit, and quality alike. This article was created in partnership with Braahmam .

Manage linguistic resources in the Editor

Managing linguistic resources | Smartcat Help Center Smartcat includes a variety of tools like translation memories and glossaries that help you improve translation quality and consistency. Translation memory matches represent the segments that were previously translated (each time you set up a project you can assign a translation memory to it or it ill be created automatically) in one of your projects or taken from an imported translation memory linked to the project. Glossary matches represent words and phrases taken from a glossary linked to the project. Translation memory and glossary matches are displayed in the CAT Panel. Inserting translation memory and glossary matches Translations from the TM are sorted in the descending order of the match percentage. By default the system uses the translation with the highest match percentage. By you are free to make a different choice, go with AI translation results or correct the segment to your liking. To insert a TM match, press Ctrl+the number of the match in the list. You can also set up automated insertion of TM matches when creating a project. Glossary matches are displayed in the CAT panel along with the TM matches and are inserted in the same manner. The terms are also highlighted in yellow in the source text. Managing TM matches This option is only available for project managers or linguists assigned to the editing stage. All other users can view the TM match but not make changes to it. Clicking on the pencil icon or information icon (an “i” inside a circle) depending on the user role at the top-right corner of a TM match will open a dialog box, which allows users to either edit or remove the selected TM match. Managing glossary terms You can add terms into glossaries applied to the project either on the glossary page or directly in the Editor. Note, that if you are working on a project that a client has created in the system, you might not have the right to work with glossaries, but you can still suggest a term. In this case, the client will see the suggestion then add it to the glossary, edit, or reject it. To add or suggest a term, select it with your mouse and press Ctrl+E. Then fill in the required fields and press Add . The term will be added to the glossary and will then be available in the CAT panel. As opposed to translation memories where only one TM is selected for writing, it is possible to add (or suggest) terms to any of the glossaries associated with the project. Editing terms To edit a term, select it in the Editor panel, and click Edit in the CAT panel. Make changes you want and click Save. You can also work with terms on the glossary page. Click Go to term in the CAT panel. The glossary will open in the new browser tab.

Understand glossaries and translation memories

Translation memories, glossaries | Smartcat Help Center Critical localization projects, especially the more technical, require extra attention to terminology and phrasing to ensure consistency. That’s why we let our users create and use translation memories and glossaries. Translation memories (TMs) are databases of previously translated sentences, usually in CAT tools, and glossaries are databases of set terms. You can use them as translation references for your projects and save time and money on editing and keep translation quality standards high. Glossaries are organized collections of approved translations for specific terms. These can be technical terms, industry-specific jargon, or just set words that are used frequently in your content. While you can use TMs to search for previous translations of a given term, glossaries provide a much more structured and organized interface, making them easier to use and maintain. How do translation memories work? Each time you edit and confirm a segment in the Editor it’s saved in Smartcat’s internal database for further use. Translation memories automatically retrieve and suggest previously translated text when the technology detects identical sentences or similar fragments of text. When you get a suggestion from the translation memory, you can either accept the proposed translation, edit it, or ignore the suggestion and translate the segment from scratch. How do translation memories help you? Saving time. The more content that is already translated, the less work the translator and the editor have to do.Improving quality. Translations are more consistent in terms of terminology and style.Cutting costs. The less work involved in the translation, the less you’ll have to pay translators and editors. In addition, Smartcat offers special rates for TM matches. If we take a document containing 31 words, where 14 words are new, 12 words are fuzzy matches, and 7 words are repetitions, this means you’ll save almost 40% on translation costs. The calculation looks like this: (14 × 1) + (12 × 0,4) + (7 × 0) = 18.8. Translation memory settings while creating a project https://help.smartcat.com/adding-translation-memory-project/ Importing translation memory https://help.smartcat.com/1539635-importing-exporting-translation-memories/ Deleting translation memory https://help.smartcat.com/1539671-deleting-or-overwriting-a-translation-memory/ How do glossaries work? After you create or import a glossary you can associate it with a specific project. After, when a glossary term is detected in the source text, the system will automatically offer the stored translations for the term. You can accept or decline the word or phrase suggestion. How do glossaries help you? Diversity. You can have the same term translated differently for different clients as it's stored in different glossaries. Easier collaboration. Your translators always have enough context even if they have to switch between multiple projects. Speed and quality. Glossaries help to ensure consistency the same way as translation memories do.

Understanding word match levels in Smartcats CAT tool

Word match levels | Smartcat Help Center Learn the differences between word match levels in Smartcat's CAT tool, Smartcat Editor It can sometimes be confusing differentiating between the different level of word matches when using a computer-assisted-translation (CAT) tool like Smartcat Editor. In this article, let's look at these differences, explaining 100% matches compared to 101% and 102% matches, and also fuzzies and near matches. 100% and 101%/102%. What's the difference? 101/102% matches are called by different names, depending on the CAT tool. also referred to as context matches, perfect matches or ICE matches. When a segment is stored in a Smartcat translation memory, Smartcat stores not only the source and target text, but also the content of the previous and following source segment. For example, this is what you might see in the TM. <Previous Segment> I live in a small village. <Source Segment> I have a small house. <Translated Segment> J'ai une petite maison. <Following Segment> It is blue. The translation is stored only for the segment that is being translated, but the other two segments are used to provide context. 100% matches explained If this same segment was encountered again, and neither of the two accompanying segments matched the TM-store, there would be a 100% match because only the text matches. 101% matches explained If in the next document, one of the context sentences was present, there would be a 101% match. 102% matches explained If both were present, it would be a 102% match. Having the context sentences match what is stored in the TM helps increase the certainty that the translation is a perfect match for new segment. In practice, 101/102% matches are often locked during pre-translation by project managers when the project is started because customers don't pay for these segments in most cases. Explaining nearly exact and fuzzy matches in Smartcat Let's look at the difference between nearly exact and fuzzy matches, as well as the different tiers of fuzzy matches. Nearly exact match (95%-99%)The source text in the segment is identical to the match, albeit with minor discrepancies in numbers, tags, punctuation marks, or spacing. In pre-translation, this represents a good match by default, though it can be customized. Fuzzy match (50%-94%)The source text closely resembles the source text in the match, yet some variations already exist in the text. When it comes to the required editing, three categories of fuzzy matches can be identified. High fuzzy (85-95%): For segments of average length or longer (typically 8-10 words or more), there is usually a discrepancy of just one word.Medium fuzzy (75-84%): In segments of average length or longer (8-10 words or more), typically there is a variation of two words.Low fuzzy (50%-74%): In segments of average length or longer (8-10 words or more), the difference encompasses more than two words. In pre-translation, the term "any match" encompasses all types of partial matches together, commencing at 50% by default, though this can also be adjusted to suit preferences.

Delete or overwrite a translation memory

Managing memory deletion or overwriting | Smartcat Help Center In this brief article, let's look at how to delete and overwrite translation memories (TMs). How to delete a translation memory If you don't need a TM, select it from your TM list and click Delete. Keep in mind that you can't delete a TM if it is still in use in any project. If such a TM has to be deleted, you may add another TM for the project or delete the whole project.Also, please note that a deleted TM cannot be restored. How to overwrite a translation memory If you realize that a TM contains erroneous information, you can overwrite the content with updated, correct, data. Option 1 If you already have another approved version of the TM in an existing file, you can use that file. Option 2 You can also download the TM in a TMX format and use a text editor to make corrections as needed. When you overwrite a translation memory, the Smartcat system removes all existing translation units from the TM. Only after this step does it then import the correct TMX, SDLTM, or XLSX file. Overwrite a TM in three steps Step 1 Select it from your TM list and click Overwrite TM Step 2 Click Add and select the file from your computer Step 3 Click Import Smartcat will process the file overwrite command and display messages to inform you of its progress. That's it! With this information, you should now know how to delete and overwrite your TMs in Smartcat.

Choose an translation engine

MT engine selection | Smartcat Help Center Smartcat has eight industry-leading translation engines that are used to instantly translations, whether that be for files, websites, software, designs, videos – anything you need translated can first be pre-translated with high accuracy. The machine translation engines are: Google Statistical translation engineGoogle Neural translation engineMicrosoft Translator - This includes statistical and neural engines depending on the language pair.Yandex translation engineBaidu Translate APIDeepLAmazon TranslateModernMT Translation engine Intelligent routing analyzes your text and selects the best translation engine for the specific language pair. When you translate a document or create a translation project, you can enable intelligent routing or select translation engine (available for Unite subscribers and higher). Watch the video or read through the steps as described below the video: When translating a document: click on the Advanced settings when creating translation and select the desired Provider for machine translation in the window that opens. Press Apply now. When creating a translation project: open the Linguistic assets list from your account homepage or from the projects Overview page, click on the gear icon in the machine translation section and adjust the settings in the window that opens. Press Save&Run. 3. It is possible to select a different engine for each language. By clicking Add option, you can add languages to a specific engine. Each engine supports a specific number of languages. So some translation engines may become unavailable if there is no coverage from these engines. Backup machine translation engine We also have a feature that greatly improves the pre-translation performance! If the default machine translation engine provides an empty or broken result, meaning there is a critical error show in the Smartcat Editor, the platform will automatically fall back to its second machine translation engine option, that being Google Neural Machine Translation (NMT). As such, you won't have to worry about errors (or rather, this will decrease significantly). FAQ Which translation engine is best for my projects? There is no good answer to this question as the final choice of an translation engine might come down to many possible factor. For example, some translation engines produce better results for a specific language pair or if equally capable, results might be affected by the topic. Some companies publish some analysis of AI output and rank engines for some language pairs. One such company is inten.to, and they post some results on a regular basis. The latest study is available here.Studies like this one might give some insights into what engine is better for some language pairs, but when dealing with a large project, users should perform some testing with their translation teams and collect some feedback. The perception from translators might vary from the automated results generated for the studies. Testing a couple of thousand words with selected engines should be sufficient in most cases. I already have an account with Google/Microsoft. Can I use it with Smartcat? If you have accounts for the custom Microsoft and Google translation engines you can connect them to Smartcat.To do so you need to provide your credentials in the Workspace settings section and contact our support team.Once the accounts are connected, the corresponding AI engines will be available for use among the rest of Smartcat translation engines. The words translated using these engines will be deducted from your Smartwords balance, as per usual. What machine translation engines are integrated with Smartcat? The following engines are integrated with Smartcat.Google Statistical translation engineGoogle Neural translation engineMicrosoft Translator - This includes statistical and neural engines depending on the language pair.Yandex translation engineBaidu Translate APIDeepLAmazon TranslateModernMT translation engineTo learn more about selecting a translation engine, refer to this article.