Advanced OCR Output Format Window - Comma-Separated Values

Use this window to control the output format for the comma-separated values file that is generated by the Kofax Advanced OCR Full Text recognition engine. Although it can be used for any type of text content, this format is most suited to extracting data from tables. Each cell in a table is delimited by the separator, and each row in the table corresponds to a row in the output file.

Suppress line breaks

Select this check box if you want line breaks in the original document to be suppressed (discarded) when the recognized data is saved. If not, the line breaks are retained.

Use page break as page separator

Select this check box when you want page breaks in the original document to be used as page separators when the recognized data is saved. If not, the page breaks are ignored.

Use blank line as paragraph separator

Select this check box when you want page breaks in the original document to be used as page separators when the recognized data is saved. If not, the page breaks are ignored.

Tables only

Select this check box if you want only tables to be in the comma-separated values format file. This causes text or other document elements outside tables to be ignored.

Separator

Specify a single character to use as a separator between recognized words. Any printable character is allowed. In addition to the printable characters, you can also specify that the tab character be used to delimit your file. To do this, enter the tab sequence (\t).