Data Harmony "plugs into" MarkLogic Server
MarkLogic Server, the industry's leading XML Server, includes a unique set of capabilities to store, manage, enrich, search, navigate, and dynamically deliver content. Designed and optimized for handling XML content, MarkLogic Server is simply unmatched in its ability to maximize your information assets, at the highest levels of performance and scalability.
With their major upgrade to 4.0, the Mark Logic platform added built-in support for entity identification and inline markup, identifying by default 18 different types of entities, including person, organization, location, credit card number, email address, latitude/longitude, date, and time.
Also with the upgrade came an enhanced ability for integration with 3rd party entity extraction engines to identify and markup other types of entities, called the
Open Enrichment Framework.
See the Mark Logic press release
Data Harmony's M.A.I. uses this facility to identify words and character strings in text that suggest Subject Terms contained in an organization's taxonomy (thesaurus or ontology). The subject terms can then be added to the document with unique tags.Once "indexed" in this way, the Mark Logic search engine's performance is significantly improved in both relevancy of returns and completeness. With more effective results, users' productivity increases.
“By enhancing MarkLogic Server with our powerful suite of Data Harmony products, we've found that organizations can expect to reduce the time to conduct individual queries by more than 50 percent and achieve seven-fold productivity increases, said Marjorie Hlava, president of Access Innovations, Inc., which offers Data Harmony software products.“This results in a higher degree of user satisfaction and search effectiveness. We’re delighted to be teaming with Mark Logic to provide this powerful solution for customers.”
Another feature of Mark Logic 4.0 are Alerts - the ability to contact a user when a content item added to the collection meets specified criteria. Data Harmony's Thesaurus Master product can be used to maintain the hierachy used in the creation of these criteria.
For organizations that need metadata to populate catalogs or websites, and summaries of longer content items, Data Harmony's suite includes a Meta-data Extractor with automatic summarization tool as well. The Automatic Summarizer constructs a summary by extracting key, representative sentences from the source document. These summaries are tunable to accommodate different writing styles and different document types. Positional and formatting information is also used.
Complementing the unique advantages of the Mark Logic ECM system, Data Harmony adds important enhancements that improve any organization's "bottom line".

