A sample text widget

Etiam pulvinar consectetur dolor sed malesuada. Ut convallis euismod dolor nec pretium. Nunc ut tristique massa.

Nam sodales mi vitae dolor ullamcorper et vulputate enim accumsan. Morbi orci magna, tincidunt vitae molestie nec, molestie at mi. Nulla nulla lorem, suscipit in posuere in, interdum non magna.

Low-tech way to get rid of unneeded tags in DOCX in memoQ

This video shows a low-tech way to get rid of unnecessary tags (usually displayed as “rpr”) in DOCX documents imported in memoQ. The trick is to go back to Word, cut the tag ridden portion, paste it back as unformatted and reimport the document in memoQ.

The document used in the video was originally received in the DOC format. It was saved as DOCX in Word 2010, because Kilgray recommends DOCX as a much better format than DOC, and say their DOCX filter are better than the DOC filter.

SDL Trados Studio 2011 imported the same tag ridden document without a single tag. Kilgray should improve their DOCX filter with regard to tag handling.

6 comments to Low-tech way to get rid of unneeded tags in DOCX in memoQ

  • Piotr Bienkowski

    The CodeZapper template with the CZL macro is not hi-tech either, it just takes installing the template and then clicking the CZL button. Helps a lot in reducing the tag soup.

    • Dominique

      Well, Code Zapper is much higher-tech than the method I described, but it didn’t help with this particular file: running it (I used CZ All) only had a marginal effect on the number of tags. Where Code Zapper did help was in Déjà Vu: when importing the same document without Code Zapper, it was literally infected with codes in the DV grid; when imported with Code Zapper (using the built-in version of DV), there were only a few codes left (other than those for bold formatting, which DV would use no matter what). In other words, it looks like memoQ has its “own build-in Code Zapper”, but it only works up to a point. Studio 2011, on the other hand, did manage to kill all tags.

  • Chun-yi Chen

    Thanks for the tip, Dominique. This will be another method for me to try when a file imported in MQ is full of tags. It is true that Studio does a better job in getting rid of tags. I tested several files in both Studio 2011 and MemoQ and got the same results as you.

    • Dominique

      Thanks, Chun-yi, for confirming there is a pattern and the problem isn’t limited to an isolated file! For many translators, DOCX is the most common format they encounter, which is why Kilgray should give special attention to the DOCX filter.

  • Sadly this still seems to be an issue in memoQ’s latest versions. And unfortunately for me, the high-tech Code Zapper probably isn’t applicable (non-MS Office user) and the low-tech solution would take an age to perform on some of these mammoth tag-infested files.

    • Dominique

      Even if you dislike MS Office, don’t plan to use it for producing documents and translate with a standalone tool like memoQ, I believe it really makes sense to have a copy at your disposal, for a number of reasons. The cheapest usable Office package (Home & Student) doesn’t cost an arm and a leg …

Leave a Reply

You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>