M.A.I.™ Components
Whether you're running a machine or an indexing system, optimal operation depends on the quality of the individual components and the smooth interaction (including the human one) between them.
M.A.I. (Machine Aided Indexer) runs on four components which interact seamlessly to maximize the quality of indexing from the start, and to support its evolution as your vocabulary grows in scale and complexity.
Rule Builder
The Rule Builder is an interactive module that allows the user to create rules using a large selection of language terms denoting spatial relationship (near, around, with, etc.), format (all caps, truncate, wild cards, etc.), location (begin sentence, end sentence, in title, etc.) and match, among others. The user can edit, add to, and review rules in the Rule Builder, or search for a set of terms or rules.
Rule Base
The Rule Base is a collection of the rules and the valid terms to be used in the indexing of the data set, and is built using the Rule Builder. It is used by the Concept Extractor to select suggested indexing terms.
Concept Extractor
The Concept Extractor compares the text in the document with the Rule Base and presents the suggested terms to the user for selecting or discarding. It recognizes all of the conditions set forth in the Rule Base and also recognizes data in tagged strings for special treatment.
Once the Concept Extractor selects the terms, it ranks them and presents the 20 most frequently mentioned terms to the user for review. (This number may be changed for individual applications.)
Statistics Collector
The Statistics Collector keeps a record of documents that have been passed through the Concept Extractor, together with the M.A.I.-suggested terms and those used by the editor. It stores, then gathers the information into groups of editors, terms, M.A.I. suggestions, and terms suggested but not used by the editor.
The "make statistics" function allows the user to create a list of "miss" and "noise" terms to review for building new rules. By employing the Statistics Collector in tandem with the Rule Builder, the user can gather terms needing a rule and review the relevant text for meaning and context.

