Document Separation tab - Project Settings window

Use the Document Separation tab to manage how multi-page images are separated into single documents or loose pages are grouped into multi-page documents. When document separation is enabled, the Kofax Transformation - Server performs document separation before extraction. The documents you process usually determine which document separation type you should use.

The following options define how the server handles unclassified pages.

No document separation

Select this option to disable document separation for this project. This option is selected by default.

Note If "No document separation" is selected and you are using Kofax Transformation -Kofax Transformation Modules together with Kofax Reporting, the counters to report any separation results, such as, correct, missing or wrong splits, are set to 0.
Trainable Document Separation (TDS)

Select this option to activate trainable document separation (TDS) for the project. This option is cleared by default.

Compatibility Mode

This option is disabled unless Trainable Document Separation (TDS) is selected. Select this option only if you want to use an old separation model created in Project Planner. This option is cleared by default.

How should the TDS classification result be used for document classification

This option is disabled unless Trainable Document Separation (TDS) is selected.

Select one of the following options to configure how the TDS classification result is used to classify a document:

  • Keep TDS classification result also for document classification.

    This option is only available if the "Compatibility Mode" option is selected. Used to accept the TDS classification results for the actual Project Builder project, where no document classification is defined.

  • Run subtree classification based on TDS classification result if possible.

    Used to accept the TDS classification result and use this as the parent trigger for subtree classification. If there is no TDS result then the document is reclassified and this result is used for subtree classification. If no subtree classification is defined then it behaves as Keep TDS classification result also for document classification. This is the default value for this option.

  • Discard TDS classification result and classify document again.

    Used to ignore the TDS classification results and to reclassify the current document to document classes for the actual Project Builder project.

TDS Page Classifier

This classifier is enabled automatically if you are using Trainable Document Separation. This classifier looks at each page and classifies it and then these page classifications are evaluated to determine the best way of separating the pages into documents.

The following buttons are available for maintaining this classifier.

  • Properties. Click this button to edit the TDS page classifier AFC properties.

  • Reset. Click this to button to reset the page classifier properties back to their default values.

TDS Model import path

This option is disabled unless Trainable Document Separation (TDS) and Compatibility Mode are selected.

Select the location of the TDS model file and click Import TDS Model to import the file into the project.

Important For projects created in Kofax Transformation Modules 5.0 or an earlier version, it is not possible to import a TDS model if there is more than one model. In Kofax Transformation Modules 5.5, you can test and train multiple models as only the most recently built classifiers are imported for each TDS classifier.
Standard Document Separation

Select this option to use the class properties and project settings to determine how documents are separated. This option is cleared by default.

Duplex scan mode (front and back side will never be split)

This option is disabled unless Standard Document Separation is selected.

Select this option if you have two-sided pages. The back side is ignored. This option is cleared by default.

Unclassified pages should be handled as

This option is disabled unless Standard Document Separation is selected.

This option defines how unclassified pages are handled during Standard Document Separation:

  • first page of new document.

    Used to handle an unclassified page as the first page of a new document. For example, a multi-page document consists of four pages. Document separation processes the pages sequentially and the first page belongs to class A, whereas the other pages stay unclassified. If this first option is selected, for each unclassified page a document is created so that as the result of the document separation four single page documents are created.

  • attachment to previous document.

    Used to handle an unclassified page as an attachment to the previously classified document. For example, a multi-page document consists of four pages. Document separation processes the pages sequentially and the first page belongs to class A, whereas the other pages stay unclassified. If this option is selected, the unclassified pages are added to the current document so as result of the document separation process, one multi-page document is created that consists of four pages. This is the default value for this option.

  • attachment if previous document was unclassified

    Used to handle an unclassified page as an attachment to the previous document, if that document was unclassified. For example, a multi-page document consists of four pages. Document separation processes the pages sequentially, and the first page is assigned to class A while the other pages remain unclassified. A document is created for the first page and the next page is used to create a new document. The subsequent pages are treated in the same way.

Definitions for the buttons at the bottom of this window can be found in Common Project Builder Buttons.