Enhanced OCR Output Format Window - Comma-Separated Values

Use this window to control the output format for the comma-separated values file that is generated by the Kofax Enhanced OCR Full Text recognition engine. Although it can be used for any type of text content, this format is most suited to extracting data from tables. Each cell in a table is delimited by the separator, and each row in the table corresponds to a row in the output file. The following rules apply:

  • The recognized lines are broken into separated lines in the final output.
  • The recognized paragraphs are put in series without blank lines as separator in the final output.
  • The recognized text and tables are always exported to the final output.

Use Operating System separator

Use Operating System separator is the default value for the profile. The separator is recognized according to the operating system settings.

Use User-defined separator

If you select this option, the editable field Separator is activated. Specify a single character to use as a separator between recognized words. Any printable character is allowed. In addition to the printable characters, you can also specify that the tab character can be used to delimit your file. To do this, enter the tab sequence (\t).