Recognition Profiles Window - Enhanced OCR Zonal

Use this window to select settings for the Kofax Enhanced OCR Zonal recognition profile.

Name

Use the list to select a recognition profile. The other settings on the window are refreshed with the settings defined for the selected profile.

Engine

Enhanced OCR Zonal is the default setting.

Languages

This contains a single language, or a list of multiple languages separated by semicolons. The edit box can scroll to display all the selected languages. You cannot select languages here; this is for informative purposes only.

Select button

Click this button to select languages, a character set, or text direction. The Recognition Languages window appears.

Use Code page to define a specific code page for the recognition engine. The default value is Unicode. Otherwise, select Current Operating System's code page to apply the code page of the current operating system.

You can select more than one language by using the Add button. If you use Chinese, Japanese, Korean or the Arabic language, adjust the Text Direction setting.

Mark and Spell

Use these settings to specify the minimum level of confidence to accept for character recognition. Characters that do not meet this minimum level are marked with the mark flag.

General

If you select the General option, the adjacent list gives you a choice of three levels of confidence. The default level is Medium. The other choices are Low and High.

A setting of Low means that you accept a lower level of recognition confidence. There may be fewer mark flags in the results.

A setting of Medium means that you accept a moderate amount of recognition confidence. There may be more mark flags than with the Low setting.

A setting of High means that you require a greater degree of recognition confidence. There may be more mark flags than with the other settings.

Specific

If you select the Specific option, you can specify a precise level of confidence ranging from 0 to 100.

The level value for Low is 75.

The level value for Medium is 85.

The level value for High is 95.

Spell check and Spell check flag

Select the Spell check option to have unrecognized words compared to entries in a dictionary of known values. If they do not match, a spell check flag is inserted in front of the non-matching word. A default dictionary is used, based on the currently selected language. You can also specify a custom dictionary by selecting a text file in the Document Class Properties window OCR tab.

Use the Spell check flag to specify a character to indicate words that are not found in a dictionary. You can specify only a single character. Note that the following occurs when you have enabled and specified a spell check flag:

  • For a single language: All words are flagged when the selected language does not have a dictionary.

  • For multiple languages: Only words not found in the dictionary are flagged. If the selected language does not have a dictionary, then the word is not flagged.

If a word is flagged twice, once with the spell check flag and once with the confidence mark flag, the spell check flag is first, followed immediately by the confidence mark flag. However, if the two flags are set to the same character (for example ^), both flags are represented by a single character. This is the default behavior.

Character set

Use these settings to define the digit or character data filter.

A setting of Any means that the recognized data can contain any characters.

A setting of Numbers only means that the recognized data contains only digits.

A setting of Letters only means that the recognized data can contain only letters.

A setting of Custom means that you can add a set of characters to include to the recognition result. For example, if the recognition zone includes dates in the format DD/MM/YYYY, enter digits and the "/" symbol in the custom filter input field. In this case, the engine recognizes the date correctly.

Image Cleanup

Select an image cleanup profile from the list.

Edit button

To modify an existing image cleanup profile or create a new one, click the Edit button. The Image Cleanup Profiles window appears, and you can specify the type of image cleanup and other advanced settings.

Delete button

Click this button to delete the currently selected profile. It is not possible to delete profiles that are built in to Kofax Capture.

Script button

If enabled, use this button to assign a recognition script to the selected profile. The Recognition Script window appears, and you can associate a recognition script with the recognition profile.

Test button

Click this button to test your zone settings. Your recognition and cleanup settings are applied to the zone with the results displayed in the Zone Test window.