Enhanced OCR Output Format Window - Microsoft Word and Microsoft 2007 and later

Use this window to control the output format for the Microsoft Word file that is generated by the Kofax Enhanced OCR Full Text recognition engine.

Output format

Changing the output format may make other options available. The settings of disabled options are retained, so that if you return to that format, the most recent settings are still used.

You can select an output format from this list:

  • Plain Text (.txt)

  • Rich Text Format (.rtf)

  • HTML (.mht)

  • Microsoft Word (*.doc)

  • Comma-Separated Values (*.csv)

  • Microsoft Excel (*.xls)

  • Microsoft Word 2007 and later (*.docx)

  • Microsoft Excel 2007 and later (*.xlsx)

Page layout

Select the page layout characteristics for exporting to the output format:

  • Retain Original Layout: Document layout is retained in full.

  • Retain Original Layout (Optimized): The original layout of the page is retained including columns.

  • Retain Paragraphs and Fonts: Recognized text is formatted into a single column. Paragraphs, font types, font sizes, highlights, and strikethroughs are retained.

Text Settings

The text attributes (Bold, Italic, Underline) are always retained when the recognized data is saved to the output file.

Suppress line breaks

Select this check box if you want line breaks in the original document to be suppressed (discarded) when the recognized data is saved. If not, the line breaks are retained.

Use page break as page separator

Select this check box when you want page breaks in the original document to be used as page separators when the recognized data is saved. If not, the page breaks are ignored.

Retain text color

Select this check box if you want the color of the text in the original document to be retained when the recognized data is saved. If not, the original color is ignored.

Picture Settings

Use the Picture Settings to set preferences for the images.

Remove pictures

Select this check box to remove any pictures that belong to a page from the output file.

Resolution

Specify the original resolution of the images to be used. You can select from among the following output resolutions in dots per inch:

  • Original

  • 72

  • 100

  • 150

  • 200

  • 300

Resolution can only be reduced, not increased. For example, if the original image resolution of the scanned page is 200 dpi and the resolution combo box is set to 300, the image resolution on the output file is 200 dpi and not 300 dpi.