Notes
Slide Show
Outline
1
Linking a Thesaurus To SharePoint for Content Management

  • Scott Denning
  • Tao Liu


  • Access Innovations, Inc.
2
ASRT Taxonomy
  • American Society of Radiologic Technologists
  • Membership organization, more than 100,000 members
  • Access Innovations, Inc.
  • Taxonomy to encompass
    • Knowledge domain
    • Organizational structure
3
ASRT Taxonomy
  • Intent was to have the taxonomy serve both as a structure for indexing documents, and eventually as a tool which would facilitate keyword suggestion for documents at time of generation.
  • Thus, terms needed to be linked to content, as well as descriptive of content
4
ASRT Taxonomy
  • Not just for indexing, but in support of total content management of documents from many different sources
5
Requirements
  • Use metadata from existing documents, as well as providing/suggesting metadata for created documents
  • ASRT is a “MicroSoft Shop”
  • Support storage as XML documents
  • MS Office 2003, XML support features
  • SharePoint™


6
SharePoint
  • Supports taxonomies, but does not            provide taxonomies
  • SharePoint’s strengths are collaboration, version control, and searching.
  • Provides some basic hierarchical structure:
    • Categories
    • Keywords
    • “Best Bets”

7
The Challenges:
  • Integrate ASRT taxonomy with SharePoint, allowing users to exploit familiar features while capitalizing on the hierarchical structure of the taxonomy.
  • Use M.A.I.™ (Machine Aided Indexer) to suggest terms from the taxonomy as keywords at the time of document generation.
8
The Challenges – cont’d
  • M.A.I. to run quietly in the background until needed
  • Provide/suggest indexing terms as document is versioned or finalized
9
Requirements
  • Encompass full trajectory of documents: creation – search – repurposing - archiving
  • Broad range of documents – administrative, accounting, archival, educational, etc.
  • Different document formats
  • Flexible for content management
10
Interlink


  • Colorado-based group specializing in technology architecture, including SharePoint
11
M.A.I. Considerations
  • M.A.I. is a text-based tool; documents are in many formats
  • Should allow familiar SharePoint search features to be used, while also suggesting indexing terms/keywords


12
Access work
  • Programs written to allow M.A.I. to handle documents in different formats:


    • Word (.doc)
    • Excel (.xls)
    • PowerPoint (.ppt)


  • Collaborative work: .pdf soon…
13
The Future?
  • SharePoint/M.A.I. used to identify “expert users” within ASRT, based upon congruency of individuals’ keyword usage with taxonomy terms
  • M.A.I. embedded within/merged with other programs, using versions of code written for this project