Recognition Profiles Window - Kofax PDF Text Under Image

Use this window to select settings for the Kofax PDF Text Under Image recognition profile.

Name

Use the list to select a recognition profile. The other settings on the window are refreshed to indicate the settings defined for the selected profile.

Note For new Kofax Capture installations, Kofax PDF Text Under Image appears on the list of recognition profiles, and Kofax PDF Image + Text is no longer available. However, you can continue to use any Kofax PDF Image + Text recognition profile (which supports Text Over Image) created in an earlier version of Kofax Capture. If you import a batch class that uses a Kofax PDF Image + Text profile created in an earlier version, it is added to the list of recognition profiles.

Engine

This is preset to PDF Image + Text.

Languages

This contains a single language, or a list of multiple languages separated by semicolons. The edit box can scroll to display all the selected languages. You cannot select languages here; this is for informative purposes only.

Select button

Click this button to select languages from the Recognition Languages window.

Mark level

Use these settings to specify the minimum level of confidence to accept for character recognition. Characters that do not meet this minimum level are identified with the mark flag.

General

Use this setting to select from three levels of confidence. The default level is Medium. The other choices are Low and High.

A setting of Low indicates a lower level of recognition confidence, which results in fewer mark flags.

A setting of Medium indicates a moderate amount of recognition confidence which results in more mark flags than with the Low setting.

A setting of High indicates that you require a greater degree of recognition confidence, which may result in more mark flags than the other levels.

Specific

Use this setting to define a precise level of confidence ranging from 0 to 100.

The level value for Low is 75.

The level value for Medium is 85.

The level value for High is 95.

Spell check and Spell check flag

Select the Spell check option to have unrecognized words compared to entries in a dictionary of known values. If they do not match, a spell check flag is inserted in front of the non-matching word. A default dictionary corresponding to the currently selected language is used. You can also specify a custom dictionary by selecting a text file in the Document Class Properties window OCR tab.

Use the Spell check flag to specify a character to use to indicate words not found in a dictionary. You can specify only a single character. Note that the following occurs when you specify a spell check flag:

  • For a single language: all words are flagged when the selected language does not have a dictionary.

  • For multiple languages: only words not found in the dictionary are flagged. If the selected language does not have a dictionary, then the word is not flagged.

If a word is flagged twice, once with the spell check flag and once with the confidence mark flag, the spell check flag is first, followed immediately by the confidence mark flag.  However, if the two flags are set to the same character (for example, ^), both flags are represented by a single character. This is the default behavior.

Non-natural language

Use the text entry field below the check box to specify the characters that are valid to include in a word.

Output button

Use this button to open the output format window, where you select preferences for displaying PDF output.

Image cleanup

Select an image cleanup profile from the list.

Edit button

To modify an existing image cleanup profile or create a new one, use the Edit button to open the Image Cleanup Profiles window, where you specify the type of image cleanup to use, along with other advanced settings.

Delete button

Use this button to delete the currently selected profile.  You cannot delete profiles that are built in to Kofax Capture.

Script button

If available, use this button to assign a recognition script to the selected profile in the Recognition Script window, where you associate a recognition script with the recognition profile.

Test button

This button is unavailable for this recognition profile.