Language-Aware Tokenization
Uses language-aware tokenization, part-of-speech tagging and noun phrase identification to automatically extract/classify entities, relations, events.
Easily Extended
Can be easily extended to identify and extract custom pattern-based or list-based entities, relation and events using a GUI or RegEx language.
Relevance Scores
ThingFinder entities are given relevance scores reflecting their importance to the document as a whole, making ThingFinder a useful categorizer.
Variant Identification and Grouping
Variant Identification and Grouping provides true counts reflecting the number and location of ALL appearances of a given entity.
Normalization
Normalization creates standard formats (e.g., ISO) for entities like dates/measurements
Custom Extraction
Custom extraction can include regex, stems, part-of-speech, phrase/clause boundaries, list matching, input matching filters, case insensitive matching, etc.
U.S. & Canada 1 866 681 3435
Europe: 00800 55 11 55 11
Global contact list