Associative search vs. fuzzy databases

Since both associative search databases and fuzzy databases use CSV files, it is not always obvious which type of database is best for your needs. Use the following information to determine which database type to use:

  • The Associative Search Database type is faster and more accurate than fuzzy databases. It is able to better differentiate between the best alternative and the second best alternative, resulting in fewer false negatives.

  • However, if you require the database source file to be accessible from a URL, or if you want to normalize a search or database records using replacement texts, only fuzzy databases have this functionality.

  • Also, only fuzzy databases support the application of an exclusion database. An exclusion database is helpful when documents have multiple addresses, including both recipient and sender. The exclusion database can ensure that the certain addresses are excluded from the list.

  • Associative Search Databases support only ANSI and Unicode little endian encoding for the .csv file. Also, the first character of the .csv file is expected to be a BOM character.

    Fuzzy databases also support UTF-8 and Big Endian encoding.

    Tip If you are using an Associative Search Database, you can use an application such as Notepad++ to save a .csv file using one of the supported encoding types.
Note When using an associative search database as input for a Database Locator, the search results are not highlighted on the document as they are for a fuzzy database.