YouTube

Top 10 YouTube channels for translation technology

YouTube is one of the most popular social medias nowadays and many actors of the translation industry – both companies/organizations and individual translators – are using it to promote their products or their views. I have listed in the table below the top 10 channels (by number of subscribers), as per December 9th, 2014. Do let me know if you feel I have forgotten a popular channel!

Channel's nameSubscribersViewsVideosCreated
1. SDL Trados3,505730,809140Mar-2009
2. CATguruEN1,822272,045117Oct-2011
3. Wordfast TM1,037107,49625Oct-2011
4. Marta Stelmaszak94129,01720Apr-2011
5. TAUS Videos41782,399339Dec-2009
6. LionbridgeTech30424,73756Nov-2009
7. memoQ30127,167181May-2009
8. Atril Déjà Vu20585,63235Jun-2011
9. Kevin Lossner18923,30356Apr-2011
10. Paul Filkin??16,52751Oct-2006

Import from Excel into DVX3 multilingual termbase

Déjà Vu X3 supports multilingual termbases. This video shows how to create such a termbase (using the Minimal template) and how to import terminology into it from an Excel sheet with four languages (English, French, Finnish, Swedish). The Excel file needs to be saved as XLS, as Déjà Vu X3 doesn’t support the newer XLSX format. For optimal results, Unicode should be selected as the encoding when importing terminology from Excel.

The same sample Excel file was also used to import terminology into a memoQ multilingual termbase in my First Steps with memoQ 2014 R2 video. The termbase portion of that video starts at 02:58.

First steps with memoQ 2014 R2

This video shows how to translate a Word document from scratch in memoQ 2014 R2. A new project (English-French) is created, the document is imported , a new translation memory is created, a new termbase (multilingual: English-French-Finnish-Swedish) is created, with sample terminology imported from an Excel sheet, the document is translated interactively and finally the translated document is exported. The video also shows the new ribbon first introduced in memoQ 2014 R2.

Create a Custom Search Engine, integrate with IntelliWebSearch

Google’s Custom Search Engine can be a convenient way to restrict searches to several specified domains, like the site:domain.com setting in Google’s Advanced Search for a single site/domain.

Anyone can create any number of custom search engines for free, as long as they have a Google account. Each custom search engine can include any number of sites/domains.

This video shows how to create a custom search engine for Finnish newspapers that includes the websites of five newspapers, as well as another custom search engine for Wikipedia in three different languages.

Once a custom search engine has been created, it can be integrated into IntelliWebSearch (Windows-only) using the following values for the Start and Finish fields:

Start -> http://www.google.com/cse?cx=PART1%3APART2&ie=UTF-8&q=
Finish -> &sa=Search&hl=en

where PART1 and PART2 are the two components of the unique identifier of the custom search engine, as displayed in the public URL of the engine.

For example, if the public URL of a custom search engine is:

https://www.google.com/cse/publicurl?cx=001578379530956773680:y3n4c5omgnq

then, PART1 is 001578379530956773680 and PART2 is y3n4c5omgnq

Get rid of unneeded tags with Translator Tools

Poorly formatted Word documents may contain lots of unneeded tags when imported in CAT tools such as memoQ, Wordfast Pro, SDL Trados Studio etc. Translator Tools is a free tool (Windows only) by fellow translator Stanislav Okhvat that attempts to remove most of these unneeded tags. This video shows how to use the Document Cleaner function of Translator Tools and the impact it had on a sample Word document in memoQ 2014 R2.

Relevant links:
Download Translator Tools
Similar post: Use Code Zapper to kill unneeded tags in Word documents

Solution to Wordfast Classic crashes on Mac

Many Wordfast Classic users experience crashes with Word 2011 for Mac. These crashes occur when using any VBA application (for instance Wordfast Classic, but also Zotero) in Word 2011 and accessing the Dock in OS X to switch to other applications. The solution is simply to auto-hide the Dock, by right-clicking the end of the Dock and selecting Turn Hiding On. It’s also possible to reach the same setting via System Preferences > Dock > Automatically hide and show the Dock.

How to use LF Aligner on Mac

LF Aligner is a freeware, open source, multiplatform (Windows, OS X, Linux) alignment tool based on Hunalign and written by Farkas András. This video shows how to use LF Aligner to align two Word documents, how to review the alignment in Excel and how to create a translation memory in the TMX format from the alignment. Heartsome TMX Editor is used to open the TMX memory and browse through its content. Any translation tool (Wordfast Pro, OmegaT, CafeTran, Swordfish etc.) should be able to open the TMX file.

The last step of LF Aligner for Mac 3.12 (saving alignment as TMX) ignores any changes made to the alignment in Excel during the reviewing process. For that reason, the TMX translation memory should be created with TMX maker, a script included in the Tools subfolder of the main LF Aligner folder.

A handy macro (MergeCells.xla) is included in LF Aligner, but only in the Windows distribution, though it works perfectly in Excel 2011 for Mac as well. It is therefore a good idea to download the Windows distribution (LF Aligner 4.05 for Windows) and extract MergeCells.xla from it. The macro features a keyboard shortcut (Ctrl+J) used to merge two consecutive cells.

This video demonstrates all of the above with two sample Word documents (DOCX) in Finnish and Swedish.

Wordfast Pro glossary from Word table or Excel sheet

Existing terminology in a Word table or in an Excel sheet can be easily imported into a Wordfast Pro glossary, as long as the table/sheet has two columns (source terms, target terms) or three columns (source terms, target terms, comments).

A Word table first needs to be converted to text (function found under the Layout tab, once the table is selected), with tabs (default) as separators. The resulting document must be saved as plain text, with Unicode as the encoding (UTF-8 also works). The saved text file can be imported into a Wordfast Pro glossary.

An Excel sheet can be saved directly as Unicode text, after which it can be imported into a Wordfast Pro glossary.

This video shows how to do it with a sample Finnish-Russian term list.

Five Studio 2014 videos by Daniel Brockmann

The latest version of SDL Trados Studio, 2014, was released on September 30th, 2013. Its Welcome screen includes a new Get Started tab with links to five videos made by SDL’s Daniel Brockmann:

Studio 2014 – get started tab

These videos can be viewed prior to installing Studio 2014 by clicking the links below:

A quick tour around SDL Trados Studio (15:04)
Creating and managing projects (12:05)
Translating single documents (25:40)
Working with project packages (06:16)
Working with GroupShare online projects (06:35)

The video files are in the WebM format and it should be possible to view them in any recent browser, including on mobile devices.

The files can also be downloaded for offline viewing with supported video players, for instance the popular VLC media player.

It should be noted that – for some reason – SDL decided to produce the videos in standard definition (480 x 852 pixels, also known as 480p). Hopefully, they will eventually make higher definition (720p, 1080p) versions available on their YouTube channel.

Terminology recognition and auto-completer in OmegaT 3.0

Terminology recognition in OmegaT is handled via tokenizers. Starting with version 3.0.0, tokenizers are included in the standard OmegaT distribution, whereas one had to download them separately in previous versions. They are also automatically selected during the project creation process, whereas one had to launch them via the command line in previous versions. Tokenizers are especially important for terminology recognition in heavily inflected languages. This video shows how the tokenizer works with Finnish as the source language.

Starting with OmegaT version 3.0.1, recognized terminology can be inserted in the target segment via a new auto-completer feature, which works entirely in the editor pane and with the keyboard (the shortcut is Ctrl+space in Windows, and Esc in OS X, so as to stay consistent with the system-wide completion engine). In previous versions, one had to right-click with the mouse in the glossary pane. This video shows how terminology can be inserted in the target segment, using a sample Finnish-English project.

Related link:
Finnish tokenizer (Lucene)

Related posts:
First steps with OmegaT
Machine translation in OmegaT for Mac
Machine translation in OmegaT for Windows