SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Managing Taxonomy Tagging
Taxonomy Boot Camp
Washington, DC
November 4, 2019
Heather Hedden & Terry Casey
Terry Casey
▪ Taxonomist
– Independent consultant, Casey
Indexing and Information Service
– Currently contract staff taxonomist
▪ Back-of-the book indexer
– Textbook, scholarly, trade book and
periodical indexer.
– Embedded indexes for digital
publications
About Us
2
Heather Hedden
▪ Taxonomist
– Independent consultant, Hedden
Information Management
– Previously employed and contract
consultant and staff taxonomist
▪ Former indexer
– Periodical article indexer at library
vendor IAC (acquired by Gale)
– Freelance back-of-the-book indexer
▪ Author of The Accidental Taxonomist
(2010, 2016, Information Today, Inc.)
▪ Introduction: Tagging, Indexing, Categorizing
▪ Taxonomy Design and Display for Indexing
▪ Indexing Policy, Documentation, and Training
▪ Automated Indexing Methods
▪ Manual Indexing
▪ Finding Indexers
▪ Examples of Indexing Projects
Outline
3
Tagging vs. indexing vs. categorizing/classifying
Tagging – assigning metadata labels (“tags”)
▪ By identifying topics and names within a document or content item
▪ By content creators or editors (minimally trained in tagging),
not as their primary job responsibility
▪ For metadata both with and without controlled vocabularies
▪ To support search
▪ Can also be semi-automated
Indexing – assigning index terms (subject metadata and related elements)
▪ By identifying topics and names within a document or content item
▪ By trained indexers, often as their primary job responsibility
▪ By selecting terms from a large controlled vocabulary/thesaurus/taxonomy
▪ To create a browsable index (and now also to support search)
▪ Can also be semi-automated
Introduction
4
Tagging vs. indexing vs. categorizing/classifying
Categorizing/classifying – organizing & assigning content into named categories
▪ By identifying which category a document or content item belongs within
▪ A feature of most content management systems, in addition to tagging.
▪ Often represented as virtual folders and subfolders.
▪ May be appropriate for Subjects or for Document Types.
▪ Content items can usually go into only one category, like classification.
▪ Categories are multi-level hierarchical.
▪ Category hierarchy is designed as a hierarchical taxonomy.
▪ Categories may or may not be metadata.
▪ Can also be automated or semi-automated.
Introduction
5
Introduction
6
Categories vs. Tags
Examples of both categories and tags within the same applications
Introduction
7
• What topics the
content contains
• Like an index
• More specific
• More numerous
• Overlapping
• Unstructured
• Less controlled
• Ad hoc
• Supports searching &
filtering
• What “buckets” the
content goes into
• Like a table of contents
• Relatively broad
• Limited in number
• Mutually exclusive
• Sometimes hierarchical
• More controlled
• Pre-planned
• Supports browsing &
filtering
Categories vs. Tags vs. Index terms
Categories Tags Index terms
• What topics the
content contains
• For an index
• More specific
• More numerous
• Overlapping
• Structured
• More controlled
• Pre-planned
• Supports browsing,
searching & filtering
Introduction
8
• Assigning any terms
desired
• Used by authors and
editors
• Tends to inconsistent
terms and indexing
• Responsive to trends
and dynamic
• May supplement a
controlled vocabulary
• Using only pre-approved
terms
• Used by indexers and
content managers
• Ensures consistent
indexing
• Slower to change and
updates
Indexing or tagging with a controlled vocabulary or not
Controlled vocabulary Keywords Folksonomy
• Assigning any terms and
reusing terms
• Used by authors, editors,
content managers, users
• Tends to inconsistent
terms and indexing
• Responsive to trends
and dynamic
• May supplement a
controlled vocabulary
• More collaborative as
“social tagging”
Taxonomy design for manual indexing
▪ Use of alternative labels/nonpreferred terms
(considering also search or browse UI, from start of term)
▪ Use of associative (related term – RT) relationships in addition to hierarchical
▪ Scope notes, dedicated Indexer notes, occasional definitions of terms
▪ Grouped distinct term sets, hierarchies, or facets for comprehensive indexing
(even if distinct term sets or facets are not supported in the end-user interface)
Taxonomy Design and Display for Indexing
9
Indexing user interface and experience (UI/UX) with taxonomy
Tagging interfaces of a commercial CMS are not user friendly.
For large volume manual tagging, develop your own.
Desirable features
▪ Both alphabetical and hierarchical browse options
▪ Alphabetical browse with alternative labels/nonpreferred terms
▪ Various search options: Begins with, Word/phrase within, Exact, Smart
▪ Exact term matches are validated and don’t require searching/browsing
▪ Shortcuts (abbreviations) for commonly indexed terms
▪ Auto-conversion of selected alternative labels/nonpreferred terms to preferred
▪ Indexing steps with keyboard shortcuts, and not just mouse, for speed
Taxonomy Design and Display for Indexing
10
Indexing UI display
Screenshot example
(Gale/Cengage internal)
Taxonomy Design and Display for Indexing
11
Indexing policy, rules, documentation, should cover:
▪ Criteria for determining topic or name relevancy for indexing
▪ Depth, level of detail
▪ Comprehensiveness of aspects (what, who, where, when, how, why, etc.)
▪ Required term types/facets (and any dependencies)
▪ Number of terms (of each type)
▪ Indexing of certain terms in combination
e.g.: a parent/broader term in addition to its narrower/child term
▪ Other required metadata to enter
➢ Recommendations/guidelines and rules/requirements
Indexing Policy, Documentation and Training
12
Indexer training
▪ Instructing the indexing policy/guidelines as a live or web presentation
▪ Training with examples on indexing that captures the “aboutness” of a
document rather than matching words in the text to taxonomy terms.
▪ Reviewing sample indexing and providing feedback.
Indexing Policy, Documentation and Training
13
Feedback from indexing to improve the taxonomy
Often based on statistics on term usage in indexing
▪ Underused terms may need added alternative labels or relationships.
▪ Overused terms may need to be split into more specific terms.
▪ Misused terms may need rewording, scope note, and/or alternative labels.
▪ Correctly used low-use terms can be dropped.
Also based on indexers’ individual requests and queries
Indexing Policy, Documentation and Training
14
Indexer-taxonomist communication for new terms
▪ Taxonomist informs indexers of new and changed terms, and indexing tips
(combinations of terms) for indexing new or recurring topics
▪ Indexers request taxonomist to clarify terms or create new terms
Methods:
▪ email distribution lists
▪ Intranet bulletin posts
▪ collaboration workspace posts
▪ indexing software feature for new term nomination
Indexing Policy, Documentation and Training
15
Automated indexing/Auto-categorization/Auto-classification
2 primary methods: machine-learning and rules-based
Machine-learning based
Automatically categorizes/tags based on previous examples.
▪ System has complex mathematical algorithms.
▪ Content managers must provide multiple (10’s or more) representative sample
documents for each taxonomy term to “train” the system. Results are reviewed
and training sets are “tuned.”
▪ Matches are to terms and alternative labels, which can be individually weighted.
▪ System may also “suggest” additional terms to add to taxonomy.
▪ Best if large body of pre-indexed records already exists
(such as migrating from manual to automated indexing)
Automated Indexing Methods
16
Rules-based auto-indexing
Rules are created for each taxonomy term.
▪ Rules are based on synonyms with more conditions.
▪ Some systems feature weighting of synonyms.
▪ Some systems feature more sophisticated rule-writing, like advanced Boolean
searching (in reverse) and proximity operators or regular expressions.
▪ Some systems feature auto-generated suggested rules for each term/synonym
which can be manually edited in addition to writing rules from scratch.
Automated Indexing Methods
17
Manual tasks for automated indexing
Continual update work is needed for each new term created.
▪ New training documents need to be added and taxonomy terms tuned.
▪ New rules need to created or edited.
➢ Identifying and tuning training documents is more appropriate for subject
matter experts, editors, indexers.
➢ Writing rules is more appropriate for information professionals, taxonomists,
knowledge engineers.
Automated Indexing Methods
18
Benefits of manual indexing
▪ Can audit and check indexers’ work immediately and make corrections or give
instructions
▪ Can respond to indexers request for new tags quickly
▪ Can make and/or use compound headings
▪ Can handle terms that could go under multiple headings and make educated
and nuanced decision of where to index the information correctly
▪ Human interpretation of complex subjects is hard to automate with rules.
Autoindexing can lead to inconsistent and uncertain results for complex
subjects
▪ Higher levels of precision and recall: indexers’ inconsistencies are minor,
compared with potential automated indexing errors.
Manual Indexing
19
Benefits of manual indexing:
Process advantages
▪ Handles complex documents that require human interpretation to analyze
terms
▪ Can figure out indexing parameters as you go. Everything does not need to be
decided ahead of time-very responsive to change
▪ Make major structural changes right away because decision maker is right
there
▪ Clean out old, out-of-date, unusable tags while tagging
▪ Create new tags immediately when needed. Able to adapt and change
taxonomy while it is evolving with new information instead of later coming back
to find information to tag
Manual Indexing
20
Considerations in choosing an indexing method
Manual versus Automated Indexing
21
Manual methods
➢ Manageable number of documents
➢ Higher accuracy in indexing
➢ May include non-text files
➢ Investing in people
➢ Low-tech: can build your own
indexing tool/user interface
➢ Internal control
Automated methods
➢ Very large number of documents
➢ Greater speed in indexing
➢ Text files only
➢ Investing in technology
➢ High-tech: must purchase auto-
indexing/classification software
➢ Software vendor relationship
Who are indexers and where to find them
▪ Full-time staff (for ongoing indexing)
‒ Editorial or subject specialization background + thorough indexing training
▪ Freelance or contractors (for temporary or part-time projects)
‒ Look for those with periodical article/database indexing experience
‒ Many have back-of-the-book indexing experience only
(so would require some additional training)
▪ Subject matter experts or not
‒ Scholarly or highly technical documents require subject expertise
‒ General enterprise or public content does not require subject expertise
▪ Rates can be hourly or per indexed record/document
▪ American Society for Indexing -- for finding freelance/contract indexers
Finding Indexers
22
American Society for Indexing
▪ ASI Website screen shot
Indexers
23
www.asindexing.org
▪ Educational institution: unique scholarly article for public research access
▪ Professional indexer for first phase; will train others on subsequent phases
▪ Taxonomy consultant remained available throughout first phase indexing
▪ International organization: SharePoint intranet taxonomy
▪ Request for not just guidelines but also training for tagging
▪ Fortune 500 firm: enterprise taxonomy for tagging articles
Examples of Indexing Projects
24
Questions/Contact
25
Terry Casey
Taxonomy Consultant
Casey Indexing and Information
Service
Saint Paul, MN
651-278-2023
terry@caseyindex.com
www.caseyindex.com
www.linkedin.com/in/terry-casey
Heather Hedden
Taxonomy Consultant
Hedden Information Management
Carlisle, MA
978-467-5195
heather@hedden.net
www.hedden-information.com
accidental-taxonomist.blogspot.com
www.linkedin.com/in/hedden
Twitter: @hhedden

Weitere ähnliche Inhalte

Was ist angesagt?

A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOSHeather Hedden
 
Taxonomy Design for SharePoint
Taxonomy Design for SharePointTaxonomy Design for SharePoint
Taxonomy Design for SharePointHeather Hedden
 
Successful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata DesignSuccessful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata Designsarakirsten
 
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy Development
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy DevelopmentOrganizing Knowledge: A Knowledge Manager’s Primer to Taxonomy Development
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy DevelopmentArt Schlussel
 
Taxonomies for E-commerce
Taxonomies for E-commerceTaxonomies for E-commerce
Taxonomies for E-commerceHeather Hedden
 
Taxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexingTaxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexingHeather Hedden
 
Taxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressTaxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressHeather Hedden
 
Taxonomies And Search Aiim Mn
Taxonomies And Search Aiim MnTaxonomies And Search Aiim Mn
Taxonomies And Search Aiim MnAIIM Minnesota
 
Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?Fred Leise
 
Customer-Focused Thesauri
Customer-Focused ThesauriCustomer-Focused Thesauri
Customer-Focused ThesauriHeather Hedden
 
Synonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred TermsSynonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred TermsHeather Hedden
 
Taxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingTaxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingHeather Hedden
 
Exploring Web Pages' Semantics and a Subject Hierarchy for Supporting Persona...
Exploring Web Pages' Semantics and a Subject Hierarchy for Supporting Persona...Exploring Web Pages' Semantics and a Subject Hierarchy for Supporting Persona...
Exploring Web Pages' Semantics and a Subject Hierarchy for Supporting Persona...inscit2006
 
Analyse and present week 1 - topic 1.1
Analyse and present   week 1 - topic 1.1Analyse and present   week 1 - topic 1.1
Analyse and present week 1 - topic 1.110shilcl
 
Evaluating Taxonomies
Evaluating TaxonomiesEvaluating Taxonomies
Evaluating TaxonomiesJoseph Busch
 
NISO/NFAIS Supplemental Journal Article Materials Working Group (2011 CrossRe...
NISO/NFAIS Supplemental Journal Article Materials Working Group (2011 CrossRe...NISO/NFAIS Supplemental Journal Article Materials Working Group (2011 CrossRe...
NISO/NFAIS Supplemental Journal Article Materials Working Group (2011 CrossRe...Crossref
 
Taxonomy Development and Digital Projects
Taxonomy Development and Digital ProjectsTaxonomy Development and Digital Projects
Taxonomy Development and Digital Projects daniela barbosa
 
Practice point resource guide feb 2016
Practice point resource guide feb 2016Practice point resource guide feb 2016
Practice point resource guide feb 2016Jean-Paul de Laureal
 

Was ist angesagt? (20)

A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOS
 
Taxonomy Design for SharePoint
Taxonomy Design for SharePointTaxonomy Design for SharePoint
Taxonomy Design for SharePoint
 
Tools for Taxonomies
Tools for TaxonomiesTools for Taxonomies
Tools for Taxonomies
 
Successful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata DesignSuccessful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata Design
 
Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013
 
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy Development
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy DevelopmentOrganizing Knowledge: A Knowledge Manager’s Primer to Taxonomy Development
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy Development
 
Taxonomies for E-commerce
Taxonomies for E-commerceTaxonomies for E-commerce
Taxonomies for E-commerce
 
Taxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexingTaxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexing
 
Taxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressTaxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPress
 
Taxonomies And Search Aiim Mn
Taxonomies And Search Aiim MnTaxonomies And Search Aiim Mn
Taxonomies And Search Aiim Mn
 
Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?
 
Customer-Focused Thesauri
Customer-Focused ThesauriCustomer-Focused Thesauri
Customer-Focused Thesauri
 
Synonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred TermsSynonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred Terms
 
Taxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingTaxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-Indexing
 
Exploring Web Pages' Semantics and a Subject Hierarchy for Supporting Persona...
Exploring Web Pages' Semantics and a Subject Hierarchy for Supporting Persona...Exploring Web Pages' Semantics and a Subject Hierarchy for Supporting Persona...
Exploring Web Pages' Semantics and a Subject Hierarchy for Supporting Persona...
 
Analyse and present week 1 - topic 1.1
Analyse and present   week 1 - topic 1.1Analyse and present   week 1 - topic 1.1
Analyse and present week 1 - topic 1.1
 
Evaluating Taxonomies
Evaluating TaxonomiesEvaluating Taxonomies
Evaluating Taxonomies
 
NISO/NFAIS Supplemental Journal Article Materials Working Group (2011 CrossRe...
NISO/NFAIS Supplemental Journal Article Materials Working Group (2011 CrossRe...NISO/NFAIS Supplemental Journal Article Materials Working Group (2011 CrossRe...
NISO/NFAIS Supplemental Journal Article Materials Working Group (2011 CrossRe...
 
Taxonomy Development and Digital Projects
Taxonomy Development and Digital ProjectsTaxonomy Development and Digital Projects
Taxonomy Development and Digital Projects
 
Practice point resource guide feb 2016
Practice point resource guide feb 2016Practice point resource guide feb 2016
Practice point resource guide feb 2016
 

Ähnlich wie Managing Taxonomy Tagging

[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker AIIM International
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementConcept Searching, Inc
 
Introduction to Intranet Planning
Introduction to Intranet PlanningIntroduction to Intranet Planning
Introduction to Intranet PlanningHaaron Gonzalez
 
Information Architecture Exposing the Secret Sauce for Success
Information Architecture Exposing the Secret Sauce for Success Information Architecture Exposing the Secret Sauce for Success
Information Architecture Exposing the Secret Sauce for Success Baltimore SharePoint (BSPUG)
 
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...Concept Searching, Inc
 
Groundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarGroundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarConcept Searching, Inc
 
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarThe Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarConcept Searching, Inc
 
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...Concept Searching, Inc
 
SPTechCon Austin 2015 - Perfecting Information Architecture
SPTechCon Austin 2015 - Perfecting Information ArchitectureSPTechCon Austin 2015 - Perfecting Information Architecture
SPTechCon Austin 2015 - Perfecting Information ArchitectureJill Hannemann
 
SharePoint Information Architecture Best Practices
SharePoint Information Architecture Best PracticesSharePoint Information Architecture Best Practices
SharePoint Information Architecture Best PracticesStephanie Lemieux
 
DC SPUG Feb 2015 The Secret Sauce to Information Architecture
DC SPUG Feb 2015 The Secret Sauce to Information ArchitectureDC SPUG Feb 2015 The Secret Sauce to Information Architecture
DC SPUG Feb 2015 The Secret Sauce to Information ArchitectureJill Hannemann
 
Optimizing DITA Content for Search Engine Optimization tekom tcworld 2016
Optimizing DITA Content for Search Engine Optimization tekom tcworld 2016Optimizing DITA Content for Search Engine Optimization tekom tcworld 2016
Optimizing DITA Content for Search Engine Optimization tekom tcworld 2016IXIASOFT
 
Optimizing Your Content for Search
Optimizing Your Content for SearchOptimizing Your Content for Search
Optimizing Your Content for SearchSharon Weaver
 
How did you find that?! Optimizing your SharePoint content for search
How did you find that?! Optimizing your SharePoint content for search How did you find that?! Optimizing your SharePoint content for search
How did you find that?! Optimizing your SharePoint content for search Sharon Weaver
 
Developing retention rules that work
Developing retention rules that workDeveloping retention rules that work
Developing retention rules that workBecky Bertram
 
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comEnhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comSimon Hughes
 
Going Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointGoing Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointConcept Searching, Inc
 
Jason lax sap - from seo practitioner to seo evangelist
Jason lax   sap - from seo practitioner to seo evangelistJason lax   sap - from seo practitioner to seo evangelist
Jason lax sap - from seo practitioner to seo evangelistBarry Schwartz
 
CapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyCapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyNatalya Minkovsky
 

Ähnlich wie Managing Taxonomy Tagging (20)

[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records Management
 
Introduction to Intranet Planning
Introduction to Intranet PlanningIntroduction to Intranet Planning
Introduction to Intranet Planning
 
Information Architecture Exposing the Secret Sauce for Success
Information Architecture Exposing the Secret Sauce for Success Information Architecture Exposing the Secret Sauce for Success
Information Architecture Exposing the Secret Sauce for Success
 
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
 
Groundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarGroundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search Webinar
 
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarThe Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
 
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
 
SPTechCon Austin 2015 - Perfecting Information Architecture
SPTechCon Austin 2015 - Perfecting Information ArchitectureSPTechCon Austin 2015 - Perfecting Information Architecture
SPTechCon Austin 2015 - Perfecting Information Architecture
 
SharePoint Information Architecture Best Practices
SharePoint Information Architecture Best PracticesSharePoint Information Architecture Best Practices
SharePoint Information Architecture Best Practices
 
DC SPUG Feb 2015 The Secret Sauce to Information Architecture
DC SPUG Feb 2015 The Secret Sauce to Information ArchitectureDC SPUG Feb 2015 The Secret Sauce to Information Architecture
DC SPUG Feb 2015 The Secret Sauce to Information Architecture
 
Testing Taxonomies
Testing TaxonomiesTesting Taxonomies
Testing Taxonomies
 
Optimizing DITA Content for Search Engine Optimization tekom tcworld 2016
Optimizing DITA Content for Search Engine Optimization tekom tcworld 2016Optimizing DITA Content for Search Engine Optimization tekom tcworld 2016
Optimizing DITA Content for Search Engine Optimization tekom tcworld 2016
 
Optimizing Your Content for Search
Optimizing Your Content for SearchOptimizing Your Content for Search
Optimizing Your Content for Search
 
How did you find that?! Optimizing your SharePoint content for search
How did you find that?! Optimizing your SharePoint content for search How did you find that?! Optimizing your SharePoint content for search
How did you find that?! Optimizing your SharePoint content for search
 
Developing retention rules that work
Developing retention rules that workDeveloping retention rules that work
Developing retention rules that work
 
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comEnhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
 
Going Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointGoing Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePoint
 
Jason lax sap - from seo practitioner to seo evangelist
Jason lax   sap - from seo practitioner to seo evangelistJason lax   sap - from seo practitioner to seo evangelist
Jason lax sap - from seo practitioner to seo evangelist
 
CapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyCapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: Taxonomy
 

Mehr von Heather Hedden

Introduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfIntroduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfHeather Hedden
 
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Heather Hedden
 
Managing Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan TermsManaging Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan TermsHeather Hedden
 
Taxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignTaxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignHeather Hedden
 
Mapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesMapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesHeather Hedden
 
Making Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesMaking Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesHeather Hedden
 

Mehr von Heather Hedden (6)

Introduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfIntroduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdf
 
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
 
Managing Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan TermsManaging Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan Terms
 
Taxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignTaxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy Design
 
Mapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesMapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual Taxonomies
 
Making Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesMaking Decisions in Creating Taxonomies
Making Decisions in Creating Taxonomies
 

Kürzlich hochgeladen

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Kürzlich hochgeladen (20)

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

Managing Taxonomy Tagging

  • 1. Managing Taxonomy Tagging Taxonomy Boot Camp Washington, DC November 4, 2019 Heather Hedden & Terry Casey
  • 2. Terry Casey ▪ Taxonomist – Independent consultant, Casey Indexing and Information Service – Currently contract staff taxonomist ▪ Back-of-the book indexer – Textbook, scholarly, trade book and periodical indexer. – Embedded indexes for digital publications About Us 2 Heather Hedden ▪ Taxonomist – Independent consultant, Hedden Information Management – Previously employed and contract consultant and staff taxonomist ▪ Former indexer – Periodical article indexer at library vendor IAC (acquired by Gale) – Freelance back-of-the-book indexer ▪ Author of The Accidental Taxonomist (2010, 2016, Information Today, Inc.)
  • 3. ▪ Introduction: Tagging, Indexing, Categorizing ▪ Taxonomy Design and Display for Indexing ▪ Indexing Policy, Documentation, and Training ▪ Automated Indexing Methods ▪ Manual Indexing ▪ Finding Indexers ▪ Examples of Indexing Projects Outline 3
  • 4. Tagging vs. indexing vs. categorizing/classifying Tagging – assigning metadata labels (“tags”) ▪ By identifying topics and names within a document or content item ▪ By content creators or editors (minimally trained in tagging), not as their primary job responsibility ▪ For metadata both with and without controlled vocabularies ▪ To support search ▪ Can also be semi-automated Indexing – assigning index terms (subject metadata and related elements) ▪ By identifying topics and names within a document or content item ▪ By trained indexers, often as their primary job responsibility ▪ By selecting terms from a large controlled vocabulary/thesaurus/taxonomy ▪ To create a browsable index (and now also to support search) ▪ Can also be semi-automated Introduction 4
  • 5. Tagging vs. indexing vs. categorizing/classifying Categorizing/classifying – organizing & assigning content into named categories ▪ By identifying which category a document or content item belongs within ▪ A feature of most content management systems, in addition to tagging. ▪ Often represented as virtual folders and subfolders. ▪ May be appropriate for Subjects or for Document Types. ▪ Content items can usually go into only one category, like classification. ▪ Categories are multi-level hierarchical. ▪ Category hierarchy is designed as a hierarchical taxonomy. ▪ Categories may or may not be metadata. ▪ Can also be automated or semi-automated. Introduction 5
  • 6. Introduction 6 Categories vs. Tags Examples of both categories and tags within the same applications
  • 7. Introduction 7 • What topics the content contains • Like an index • More specific • More numerous • Overlapping • Unstructured • Less controlled • Ad hoc • Supports searching & filtering • What “buckets” the content goes into • Like a table of contents • Relatively broad • Limited in number • Mutually exclusive • Sometimes hierarchical • More controlled • Pre-planned • Supports browsing & filtering Categories vs. Tags vs. Index terms Categories Tags Index terms • What topics the content contains • For an index • More specific • More numerous • Overlapping • Structured • More controlled • Pre-planned • Supports browsing, searching & filtering
  • 8. Introduction 8 • Assigning any terms desired • Used by authors and editors • Tends to inconsistent terms and indexing • Responsive to trends and dynamic • May supplement a controlled vocabulary • Using only pre-approved terms • Used by indexers and content managers • Ensures consistent indexing • Slower to change and updates Indexing or tagging with a controlled vocabulary or not Controlled vocabulary Keywords Folksonomy • Assigning any terms and reusing terms • Used by authors, editors, content managers, users • Tends to inconsistent terms and indexing • Responsive to trends and dynamic • May supplement a controlled vocabulary • More collaborative as “social tagging”
  • 9. Taxonomy design for manual indexing ▪ Use of alternative labels/nonpreferred terms (considering also search or browse UI, from start of term) ▪ Use of associative (related term – RT) relationships in addition to hierarchical ▪ Scope notes, dedicated Indexer notes, occasional definitions of terms ▪ Grouped distinct term sets, hierarchies, or facets for comprehensive indexing (even if distinct term sets or facets are not supported in the end-user interface) Taxonomy Design and Display for Indexing 9
  • 10. Indexing user interface and experience (UI/UX) with taxonomy Tagging interfaces of a commercial CMS are not user friendly. For large volume manual tagging, develop your own. Desirable features ▪ Both alphabetical and hierarchical browse options ▪ Alphabetical browse with alternative labels/nonpreferred terms ▪ Various search options: Begins with, Word/phrase within, Exact, Smart ▪ Exact term matches are validated and don’t require searching/browsing ▪ Shortcuts (abbreviations) for commonly indexed terms ▪ Auto-conversion of selected alternative labels/nonpreferred terms to preferred ▪ Indexing steps with keyboard shortcuts, and not just mouse, for speed Taxonomy Design and Display for Indexing 10
  • 11. Indexing UI display Screenshot example (Gale/Cengage internal) Taxonomy Design and Display for Indexing 11
  • 12. Indexing policy, rules, documentation, should cover: ▪ Criteria for determining topic or name relevancy for indexing ▪ Depth, level of detail ▪ Comprehensiveness of aspects (what, who, where, when, how, why, etc.) ▪ Required term types/facets (and any dependencies) ▪ Number of terms (of each type) ▪ Indexing of certain terms in combination e.g.: a parent/broader term in addition to its narrower/child term ▪ Other required metadata to enter ➢ Recommendations/guidelines and rules/requirements Indexing Policy, Documentation and Training 12
  • 13. Indexer training ▪ Instructing the indexing policy/guidelines as a live or web presentation ▪ Training with examples on indexing that captures the “aboutness” of a document rather than matching words in the text to taxonomy terms. ▪ Reviewing sample indexing and providing feedback. Indexing Policy, Documentation and Training 13
  • 14. Feedback from indexing to improve the taxonomy Often based on statistics on term usage in indexing ▪ Underused terms may need added alternative labels or relationships. ▪ Overused terms may need to be split into more specific terms. ▪ Misused terms may need rewording, scope note, and/or alternative labels. ▪ Correctly used low-use terms can be dropped. Also based on indexers’ individual requests and queries Indexing Policy, Documentation and Training 14
  • 15. Indexer-taxonomist communication for new terms ▪ Taxonomist informs indexers of new and changed terms, and indexing tips (combinations of terms) for indexing new or recurring topics ▪ Indexers request taxonomist to clarify terms or create new terms Methods: ▪ email distribution lists ▪ Intranet bulletin posts ▪ collaboration workspace posts ▪ indexing software feature for new term nomination Indexing Policy, Documentation and Training 15
  • 16. Automated indexing/Auto-categorization/Auto-classification 2 primary methods: machine-learning and rules-based Machine-learning based Automatically categorizes/tags based on previous examples. ▪ System has complex mathematical algorithms. ▪ Content managers must provide multiple (10’s or more) representative sample documents for each taxonomy term to “train” the system. Results are reviewed and training sets are “tuned.” ▪ Matches are to terms and alternative labels, which can be individually weighted. ▪ System may also “suggest” additional terms to add to taxonomy. ▪ Best if large body of pre-indexed records already exists (such as migrating from manual to automated indexing) Automated Indexing Methods 16
  • 17. Rules-based auto-indexing Rules are created for each taxonomy term. ▪ Rules are based on synonyms with more conditions. ▪ Some systems feature weighting of synonyms. ▪ Some systems feature more sophisticated rule-writing, like advanced Boolean searching (in reverse) and proximity operators or regular expressions. ▪ Some systems feature auto-generated suggested rules for each term/synonym which can be manually edited in addition to writing rules from scratch. Automated Indexing Methods 17
  • 18. Manual tasks for automated indexing Continual update work is needed for each new term created. ▪ New training documents need to be added and taxonomy terms tuned. ▪ New rules need to created or edited. ➢ Identifying and tuning training documents is more appropriate for subject matter experts, editors, indexers. ➢ Writing rules is more appropriate for information professionals, taxonomists, knowledge engineers. Automated Indexing Methods 18
  • 19. Benefits of manual indexing ▪ Can audit and check indexers’ work immediately and make corrections or give instructions ▪ Can respond to indexers request for new tags quickly ▪ Can make and/or use compound headings ▪ Can handle terms that could go under multiple headings and make educated and nuanced decision of where to index the information correctly ▪ Human interpretation of complex subjects is hard to automate with rules. Autoindexing can lead to inconsistent and uncertain results for complex subjects ▪ Higher levels of precision and recall: indexers’ inconsistencies are minor, compared with potential automated indexing errors. Manual Indexing 19
  • 20. Benefits of manual indexing: Process advantages ▪ Handles complex documents that require human interpretation to analyze terms ▪ Can figure out indexing parameters as you go. Everything does not need to be decided ahead of time-very responsive to change ▪ Make major structural changes right away because decision maker is right there ▪ Clean out old, out-of-date, unusable tags while tagging ▪ Create new tags immediately when needed. Able to adapt and change taxonomy while it is evolving with new information instead of later coming back to find information to tag Manual Indexing 20
  • 21. Considerations in choosing an indexing method Manual versus Automated Indexing 21 Manual methods ➢ Manageable number of documents ➢ Higher accuracy in indexing ➢ May include non-text files ➢ Investing in people ➢ Low-tech: can build your own indexing tool/user interface ➢ Internal control Automated methods ➢ Very large number of documents ➢ Greater speed in indexing ➢ Text files only ➢ Investing in technology ➢ High-tech: must purchase auto- indexing/classification software ➢ Software vendor relationship
  • 22. Who are indexers and where to find them ▪ Full-time staff (for ongoing indexing) ‒ Editorial or subject specialization background + thorough indexing training ▪ Freelance or contractors (for temporary or part-time projects) ‒ Look for those with periodical article/database indexing experience ‒ Many have back-of-the-book indexing experience only (so would require some additional training) ▪ Subject matter experts or not ‒ Scholarly or highly technical documents require subject expertise ‒ General enterprise or public content does not require subject expertise ▪ Rates can be hourly or per indexed record/document ▪ American Society for Indexing -- for finding freelance/contract indexers Finding Indexers 22
  • 23. American Society for Indexing ▪ ASI Website screen shot Indexers 23 www.asindexing.org
  • 24. ▪ Educational institution: unique scholarly article for public research access ▪ Professional indexer for first phase; will train others on subsequent phases ▪ Taxonomy consultant remained available throughout first phase indexing ▪ International organization: SharePoint intranet taxonomy ▪ Request for not just guidelines but also training for tagging ▪ Fortune 500 firm: enterprise taxonomy for tagging articles Examples of Indexing Projects 24
  • 25. Questions/Contact 25 Terry Casey Taxonomy Consultant Casey Indexing and Information Service Saint Paul, MN 651-278-2023 terry@caseyindex.com www.caseyindex.com www.linkedin.com/in/terry-casey Heather Hedden Taxonomy Consultant Hedden Information Management Carlisle, MA 978-467-5195 heather@hedden.net www.hedden-information.com accidental-taxonomist.blogspot.com www.linkedin.com/in/hedden Twitter: @hhedden