BusinessObjects ThingFinder SDK Features

Language-Aware Tokenization
Uses language-aware tokenization, part-of-speech tagging and noun phrase identification to automatically extract/classify entities, relations, events.

Easily Extended
Can be easily extended to identify and extract custom pattern-based or list-based entities, relation and events using a GUI or RegEx language.

Relevance Scores
ThingFinder entities are given relevance scores reflecting their importance to the document as a whole, making ThingFinder a useful categorizer.

Variant Identification and Grouping
Variant Identification and Grouping provides true counts reflecting the number and location of ALL appearances of a given entity.

Normalization
Normalization creates standard formats (e.g., ISO) for entities like dates/measurements

Custom Extraction
Custom extraction can include regex, stems, part-of-speech, phrase/clause boundaries, list matching, input matching filters, case insensitive matching, etc.

Contact sales now:

Locate a sales representative

Request a demo:

U.S. & Canada 1 866 681 3435
Europe: 00800 55 11 55 11
Global contact list

Request more information
Find a reseller