Analysis Profiles

These settings define formats for generating alternatives for the Custom 1 to Custom 5 extraction fields. Each set of formats is referred to as an analysis profile.

It is possible to add multiple entries for these settings. As a result, the settings are displayed in a table.

The table has a composite key that is formed by the combination of the Analysis Profile ID and the Index ID. This is because alternatives may be generated using multiple format strings.

Analysis profiles are assigned to extraction profiles via the Analysis Profile ID setting in the Solution Configuration Manager > Global Settings > Custom Extraction Profiles > Extraction Profiles.

The following column settings are available.

Analysis Profile ID

The unique ID given to the analysis profile.

Index ID

The index ID given to each format string that is used to generate alternatives for the field.

The Index ID is unique per Analysis Profile ID.

Compare Type

The compare type used to generate alternatives bas on the format sting. The following options are available for this setting.

  • SIMPLE. Indicates that the format is a simple expression.

  • REGULAR. Indicates that the format is a regular expression.

  • TRIGRAM. Indicates that the trigram method is used to find alternative based upon the format string.

  • LEVEN. Indicates that the Levenshtein method is used to find alternatives based upon the format string.

  • STRINGCOMPARE. Indicates that the String Compare method is used to find alternative based upon the format string.

For more information on these comparison types, refer to the Transformation Designer Help.

Format

The format string used to generate alternatives for the field.

Ignore Characters

A list of characters that are allowed to appear in an alternative at any position. The system should be tolerant of these characters when generating alternatives.

For example, hyphens or periods. This list does not need to be comma-separated.