MarkLogic Server’s ability to convert and store data as xml records and to search huge quantities of records—in record-breaking time—is enhanced by the semantic metadata that can be added to each by Data Harmony tools and retrieved with a user-friendly search application tied into the super-efficient MarkLogic Xquery engine.
Inline tagging adds subject terms from an organization’s taxonomy and identifies bibliographic information, also as tags, within the xml document. Data Harmony’s Thesaurus Master® maintains an authority file that enables adding the preferred form of the metadata value (e.g., full first, correctly spelled last name, and department) to the tag when an alternate form is located.
Subject terms are applied, either with the author’s review, or automatically by Access Innovations’ Machine Aided Indexer (M.A.I.™), using a rule base to put terms in context for high-precision indexing. The combined thesaurus and rule base hold and use thousands of synonyms and near-synonyms to point to the preferred expression of each concept in the thesaurus.
Data Harmony’s Search Harmony application uses both synonyms and thesaurus terms to assist a user in choosing the best term to hand off to the Xquery search engine, suggesting them as the user types. With standardized metadata easily accessed, Search Harmony queries can be targeted for even more precise returns. And for the searcher who is exploring, related terms are presented to enable a broader search. Too many results? Narrower terms are presented to enable a more specific search.
It is also possible to display the entire thesaurus hierarchy to allow users to “drill down” in the appropriate branch to the level of specificity required and initiate a search with a single click.