Classification and the order of processing

During classification, the classifiers run in a specific order.

Initially, layout classification is run. Because the Layout Classifier does not rely on recognition results to perform classification, any classes that use layout classification can be run without recognition results.

If layout classification is unsuccessful, or a class does not use layout classification, recognition is performed followed by content classification. This includes the top-level content classifier, subtree classifiers if enabled, and instruction classifiers if configured.

For any projects with child classes, hierarchical rules are evaluated for all documents to determine the need for additional classification refinements. These rules are applied automatically and cannot be disabled. This is because these rules ensure that the parent-to-child relationship does not hinder the classification result. The final classification result depends on both the hierarchical rules and the classification configuration.

For example, when the Single child wins over its parent hierarchical rule is run, and a parent class and related child class have the same confidence after layout and content classification is performed, the classification result for the document is the child class. The hierarchical rule overrides the parent class in favor of the child class.

Similarly, when the Parent represents competing children rule is run, the classification result is the parent class when related child classes have the same confidence.

The following example shows the order of classification processing.

Figure 1. Classification processing order
An image that shows the order of classification processing.