Classification Locator sample
This sample project was created to be used as an "embedded" project. This means that you can configure a Classification Locator and reference this sample project rather than create a classification project from scratch. The sample project is pre-configured for layout or content classification, either instructions or Adaptive Feature Classifier and has a subdirectory structure that can be used to classify a document by its language. When processing documents, string values describing the languages are subsequently returned.
After you have extracted the CLSLoc_Language sample zip file, you can find the project file and another file needed for content classification. The multi-lingual documents used for training are not part of this sample project, because all training information is saved to the "Language_Content Classifier.dat" file.
When setting up your extraction project, a project that does not have a Classification Set yet is sufficient. Set up this project with a default classification result (1 base class), define 1 field and assign it to a Classification Locator. In the properties of the Classification Locator, reference the "CLS_loc_language" sample project and leave the rest on default. Now feed your project with documents in various languages: the configured field contains the language as estimated by the Classification Locator.