This topic is for memoQ 8.7. Have an older version? Click here.

PDF (Portable Document Format) files

memoQ can import PDF files. On its own, memoQ can open them as plain text, or convert them into DOCX first, and imports the DOCX file.

To make sure all PDF documents are imported successfully, even if they have text in images: Use the TransPDF service. You can choose to use TransPDF in this Document import settings window. Before you do this, you need to register with TransPDF and save your TransPDF account in memoQ.

You need to pay for TransPDF: TransPDF is not free. After you register, you can produce 25 pages of translated PDF for free, but you need to pay for the rest. TransPDF will charge you after the number of the final, translated pages that you export. So, the PDF will be imported for free, and you pay when you export the finished work.

If you do not use TransPDF but rely on memoQ to import PDF documents, you need to live with these limitations:

  • Can't export PDF: If the source document is PDF, memoQ exports the translation in plain text or in DOCX, depending on the method of the import.
  • Can't import password-protected PDF files. You need to supply the password to TransPDF, too.
  • Can't import scanned PDF files: Without TransPDF, memoQ doesn't extract text from scanned PDF files, where the pages are saved as images and not as text. To translate these documents, run them through a page reader program such as Nuance OmniPage or ABBYY FineReader (PDF Reader). These programs save well-formed DOCX files where the text flow and the formatting is retained as much as possible. Or, use TransPDF, it is probably cheaper than these two.
  • Text may become garbled: PDF is not a text format. Normally, it doesn't try to preserve the text flow. As a result, some of the text may be missing or may appear in the wrong order when you import a PDF into memoQ. When this happens, run the documents through a page reader program such as Nuance OmniPage or ABBYY FineReader (PDF Reader). These programs save well-formed DOCX files where the text flow and the formatting is retained as much as possible. This may happen with TransPDF, too, although it is less likely.

How to get here

  1. Start importing an Portable Document Format (PDF) file.
  2. In the Document import options window, select the PDF files, and click Change filter and configuration.
  3. The Document import settings window appears. From the Filter drop-down list, choose PDF (Portable Document Format).

What can you do?

When you finish

To confirm the settings, and return to the Document import options window: Click OK.

To return the Document import options window, and not change the filter settings: Click Cancel.

If this is a cascading filter, you can change the settings of another filter in the chain: Click the name of the filter at the top of the window.

In the Document import options window: Click OK again to start importing the documents.

memoQ doesn't import PDF directly

memoQ relies on external modules that help importing PDF documents. These modules are installed with memoQ, but come from other software makers.

To convert PDF documents into Word (DOCX), memoQ uses Aspose.PDF. To learn how this is done: See the developer's web page.

To convert PDF documents into plain text memoQ uses xPDF. Xpdf copyright © 1996-2009 Glyph & Cog, LLC.