SlideShare ist ein Scribd-Unternehmen logo
1 von 24
ROI & Impact: Quantitative &
Qualitative Measures for
Taxonomies
Wednesday, 11 February 2009
12:00 – 12:30 PM MST
Presented by Jay Ven Eman, Ph.D., CEO
Access Innovations, Inc. / Data Harmony
505.998.0800 / www.accessinn.com / www.dataharmony.com
j_ven_eman@accessinn.com
DHUG 2009
First, some questions
 Do you know what a taxonomy is?
 Does your boss’s boss know? Care?
 What are YOU trying to accomplish?
 What are your objectives?
 What isn’t working? What is?
 How badly?
 How much?
 Who? Where?
Copyright © 2007 Access Innovations, Inc.
First, some questions - 2
 Who are your searchers?
 Internal? Intranet?
 External? Web? Fee based (commercial)?
 How many?
 What do they do? How do they do it?
 What are they seeking?
 Why?
Copyright © 2007 Access Innovations, Inc.
First, some questions - 3
 Where are they looking?
 How many searching environments?
 Physical?
 Internal resources?
 External resources?
 Search interfaces?
 And so on…
Copyright © 2007 Access Innovations, Inc.
Copyright © 2007 Access Innovations, Inc.
“Meaning” starts with a knowledge
organization system (KOS)
 Uncontrolled list
 Name authority file
 Synonym set/ring
 Controlled vocabulary
 Taxonomy
 Thesaurus
Not complex - $
Highly complex - $$$$
LOTS OF OVERLAP!
Topic MapOntology
SKOS
The Pain of Search
Copyright © 2007 Access Innovations, Inc.
The Pain of
Search
Percent
Number of
Employees
Search &
Use Timel
Per Week
Time
Searching
Per Week
Time
Analysing
Per Week
Average
Loaded
Salary
Annual Cost
of Looking
Search Time
Reduction Difference
Mission
critical 1000 Hours Hours Hours
$ Per
Hour 10%
High 10 100 14 8.4 5.6 200 8,736,000 7,862,400 873,600
Medium 80 800 12 7.2 4.8 150 44,928,000 40,435,200 4,492,800
Low 10 100 10 6 4 100 3,120,000 2,808,000 312,000
$56,784,000 $51,105,600 $5,678,400
ROI - Segments
 Cost of taxonomy system
 Indexing costs
 Cost of getting system ready
 Ongoing maintenance
 Increased efficiency
 Increased quality of retrieval
 Cost of legacy system maintenance
Copyright © 2005 Access Innovations, Inc.
Taxonomy construction
Process Terms/hr # of
terms
Cost/hr Cost
From scratch 4 5000 $75 $93,750
License 0 - 100K
License & customize 6 5000 75 62,500+
5,000
Auto-
generate/cleanup +
tool
6 5000 75 62,500+
100,000
Mapping 8 5000 75 46,875
Indexing & Search Metrics
 Hit, Miss, Noise
 Subjective
 Relevance
 Aboutness
 Statistical
 Precision
 Recall
 Level of effort
Hit, Miss, Noise
 Hit – exactly what a human indexer would use
 Miss – human indexer would use but system
did not assign
 Noise – system assigned but human did not
 Relevant noise – could have been assigned
 Irrelevant noise – just plain wrong
Subjective
 Relevance
 Reflects how akin it is to the users request
 Aboutness
 Reflects the topical match between the document
content and the term
 How well the topic describes what the document is
about
 Varies with level of conceptual terms vs. factual
terms in the thesaurus
Subjective
 “There is now a 92% accuracy rating accuracy on accounting and
regulatory document search based on hit, miss and noise or
relevance, precision and recall statistics…Access Innovations.”
USGAO
 “IEEE had their system up and running in three days, in full
production in less than two weeks.” Institute of Electrical and
Electronics Engineers (IEEE)
 “The American Economic Association said its editors think using it
is fun and makes time fly!” American Economic Association (AEA)
 “ ProQuest CSA have achieved a 7 fold increase in productivity –
thus they have four licenses.” ProQuest CSA
 “Weather Channel finds things 50% faster using Data Harmony. A
significant saving in time.” The Weather Channel
Statistical
 Precision
 Correct retrieval / Total retrieval
 Hits / hits + noise
 Recall
 Correct retrieval / Total correct in system
 Hits / Hits + misses
 Level of effort
 Hits / Hits + misses + noise
Cost Goals
 Cost Savings
 Software/hardware
 More efficient delivery systems
 Retirement of legacy systems
 Cost Avoidance
 Additional staff not needed to scale
 Lower training costs
Productivity Goals
 Productivity gains
 Employee productivity – fourfold
 Get up to speed faster
 Learn vocabulary faster
 Able to capture peoples knowledge in the
rule base
 Staff savings / redeployment
 Elimination of new hires
Additional Benefits
 Revenue Generation
 Higher hit rates
 More purchases off the site
 Competitive advantage
 Shorter product / sales cycles
 Faster implementation
 Better search experience
 Ability to meet regulatory requirements
Go – No Go
 Reach 85% precision to launch for
productivity - assisted
 Reach 85% for filtering or categorization
 Sorting for production
 Level of effort to get to 85%
 Integration into the workflow is efficient
Benchmarks
 15 – 20% irrelevant returns / noise
 Amount of work needed to achieve 85%
level
 How good is good enough?
 Satisfice = satisfaction + suffice
 How much error can you put up with?
Example ROI Calculation
 Assume – 5,000 term thesaurus
 1.5 synonyms per terms
 7,500 terms total
 Assume 85% accuracy
 Use assisted for indexing
 Use automatically for filtering
 Assume $75 per hour for staff
 Assume 10,000 records for test batch
Indexing costs with Data Harmony
 80% of rules built automatically
 7,500 x .8 = 6,000
 20% require complex rules
 Average rule takes 5 minutes
 (Actually MUCH faster using M.A.I. GUI)
 5 x 1,500 = 7,500 minutes
 125 hours x $75 = $9,375
Indexing Costs
 Base cost of MAIstro EE - $60,000
 Cost of getting system ready
 Programming support and integration
 Estimated at 2 weeks programming $125 / hour = $10,000
 Rule building
 Estimated at 125 hours $75 / hour = $9,375
 Possible need to re-run training set several times
 Ongoing maintenance
 Estimated at 15% of purchase price for license = $9,000
 Rule building for new terms 50 terms per quarter
 200 terms x .8 = 160 automatic
 40 at 5 minutes per term = 200 minutes /60 = 3.33 hours x $75 =
$250
 Targeted initial accuracy at 85%
Indexing costs
 Year one
 $60,000 + $10,000 + $9,375 = $79,375
 Years thereafter
 9000 + 250 = $9250
 85% accuracy
ROI
 Taxonomy costs = $67,500
 Indexing costs = $79,375
 Pain of search – difference = $5,678,400
 If off by factor of 4, then a positive ROI of
241%
Copyright © 2007 Access Innovations, Inc.
ROI & Impact: Quantitative &
Qualitative Measures for
Taxonomies
Wednesday, 11 February 2009
12:00 – 12:30 PM MST
Presented by Jay Ven Eman, Ph.D., CEO
Access Innovations, Inc. / Data Harmony
505.998.0800 / www.accessinn.com / www.dataharmony.com
j_ven_eman@accessinn.com
Thank you!

Weitere ähnliche Inhalte

Andere mochten auch

The JTHES as Part of the Intelligence Layer for the Sustainability Collection...
The JTHES as Part of the Intelligence Layer for the Sustainability Collection...The JTHES as Part of the Intelligence Layer for the Sustainability Collection...
The JTHES as Part of the Intelligence Layer for the Sustainability Collection...Access Innovations, Inc.
 
Case Study: Integrating Data Harmony Terms and the eJournalPress Peer Review...
Case Study:  Integrating Data Harmony Terms and the eJournalPress Peer Review...Case Study:  Integrating Data Harmony Terms and the eJournalPress Peer Review...
Case Study: Integrating Data Harmony Terms and the eJournalPress Peer Review...Access Innovations, Inc.
 
NFAIS 2014 Miles Conrad Award Lecture, Presented by Marjorie M.K. Hlava
NFAIS 2014 Miles Conrad Award Lecture, Presented by Marjorie M.K. HlavaNFAIS 2014 Miles Conrad Award Lecture, Presented by Marjorie M.K. Hlava
NFAIS 2014 Miles Conrad Award Lecture, Presented by Marjorie M.K. HlavaAccess Innovations, Inc.
 
Leveraging Your Taxonomy With Navtree and MAIQuery
Leveraging Your Taxonomy With Navtree and MAIQueryLeveraging Your Taxonomy With Navtree and MAIQuery
Leveraging Your Taxonomy With Navtree and MAIQueryAccess Innovations, Inc.
 
Developing the AIP Thesaurus: The Platform for an Ontology
Developing the AIP Thesaurus: The Platform for an OntologyDeveloping the AIP Thesaurus: The Platform for an Ontology
Developing the AIP Thesaurus: The Platform for an OntologyAccess Innovations, Inc.
 
The Business Case for Enterprise Search
The Business Case for Enterprise SearchThe Business Case for Enterprise Search
The Business Case for Enterprise SearchRBC
 
Implementing a Taxonomy in a Content Management Portal
Implementing a Taxonomy in a Content Management PortalImplementing a Taxonomy in a Content Management Portal
Implementing a Taxonomy in a Content Management PortalAccess Innovations, Inc.
 
Sampling Methods in Qualitative and Quantitative Research
Sampling Methods in Qualitative and Quantitative ResearchSampling Methods in Qualitative and Quantitative Research
Sampling Methods in Qualitative and Quantitative ResearchSam Ladner
 

Andere mochten auch (10)

The JTHES as Part of the Intelligence Layer for the Sustainability Collection...
The JTHES as Part of the Intelligence Layer for the Sustainability Collection...The JTHES as Part of the Intelligence Layer for the Sustainability Collection...
The JTHES as Part of the Intelligence Layer for the Sustainability Collection...
 
Case Study: Integrating Data Harmony Terms and the eJournalPress Peer Review...
Case Study:  Integrating Data Harmony Terms and the eJournalPress Peer Review...Case Study:  Integrating Data Harmony Terms and the eJournalPress Peer Review...
Case Study: Integrating Data Harmony Terms and the eJournalPress Peer Review...
 
NFAIS 2014 Miles Conrad Award Lecture, Presented by Marjorie M.K. Hlava
NFAIS 2014 Miles Conrad Award Lecture, Presented by Marjorie M.K. HlavaNFAIS 2014 Miles Conrad Award Lecture, Presented by Marjorie M.K. Hlava
NFAIS 2014 Miles Conrad Award Lecture, Presented by Marjorie M.K. Hlava
 
Leveraging Your Taxonomy With Navtree and MAIQuery
Leveraging Your Taxonomy With Navtree and MAIQueryLeveraging Your Taxonomy With Navtree and MAIQuery
Leveraging Your Taxonomy With Navtree and MAIQuery
 
I Don’t Have Time for Metadata!
I Don’t Have Time for Metadata!I Don’t Have Time for Metadata!
I Don’t Have Time for Metadata!
 
Developing the AIP Thesaurus: The Platform for an Ontology
Developing the AIP Thesaurus: The Platform for an OntologyDeveloping the AIP Thesaurus: The Platform for an Ontology
Developing the AIP Thesaurus: The Platform for an Ontology
 
The Business Case for Enterprise Search
The Business Case for Enterprise SearchThe Business Case for Enterprise Search
The Business Case for Enterprise Search
 
Implementing a Taxonomy in a Content Management Portal
Implementing a Taxonomy in a Content Management PortalImplementing a Taxonomy in a Content Management Portal
Implementing a Taxonomy in a Content Management Portal
 
Taxonomy Fundamentals - SLA 2014
Taxonomy Fundamentals - SLA 2014Taxonomy Fundamentals - SLA 2014
Taxonomy Fundamentals - SLA 2014
 
Sampling Methods in Qualitative and Quantitative Research
Sampling Methods in Qualitative and Quantitative ResearchSampling Methods in Qualitative and Quantitative Research
Sampling Methods in Qualitative and Quantitative Research
 

Ähnlich wie ROI & Impact Measures for Taxonomies

Presentation Tms Webinar V08
Presentation Tms Webinar V08Presentation Tms Webinar V08
Presentation Tms Webinar V08CarolaMoore
 
Presentation Tms Webinar V08
Presentation Tms Webinar V08Presentation Tms Webinar V08
Presentation Tms Webinar V08CarolaMoore
 
How Judson ISD Implemented and Tracks IT Metrics & Key Performance Indicators
How Judson ISD Implemented and Tracks IT Metrics & Key Performance IndicatorsHow Judson ISD Implemented and Tracks IT Metrics & Key Performance Indicators
How Judson ISD Implemented and Tracks IT Metrics & Key Performance IndicatorsSteve Young
 
Justifying Taxonomy Projects: Taxonomy Boot Camp 2009
Justifying Taxonomy Projects: Taxonomy Boot Camp 2009Justifying Taxonomy Projects: Taxonomy Boot Camp 2009
Justifying Taxonomy Projects: Taxonomy Boot Camp 2009Earley Information Science
 
Finding The Agile Sweet Spot
Finding The Agile Sweet SpotFinding The Agile Sweet Spot
Finding The Agile Sweet SpotCharles Husemann
 
Coradiant
CoradiantCoradiant
Coradiantgigamon
 
Web Performance Analysis - TCF Pro 2009
Web Performance Analysis - TCF Pro 2009Web Performance Analysis - TCF Pro 2009
Web Performance Analysis - TCF Pro 2009Guy Ferraiolo
 
Show Me the Money: Connecting Performance Engineering to Real Business Results
Show Me the Money: Connecting Performance Engineering to Real Business ResultsShow Me the Money: Connecting Performance Engineering to Real Business Results
Show Me the Money: Connecting Performance Engineering to Real Business ResultsCorrelsense
 
The case for continuous delivery
The case for continuous deliveryThe case for continuous delivery
The case for continuous deliveryCodecamp Romania
 
The case for continuous delivery
The case for continuous deliveryThe case for continuous delivery
The case for continuous deliveryCodecamp Romania
 
DevOps Deep Dive Webinar: Building a business case for agile and devops
DevOps Deep Dive Webinar: Building a business case for agile and devopsDevOps Deep Dive Webinar: Building a business case for agile and devops
DevOps Deep Dive Webinar: Building a business case for agile and devopsBasis Technologies
 
Applicant Tracking System Business Case
Applicant Tracking System Business CaseApplicant Tracking System Business Case
Applicant Tracking System Business CaseHolly DeMuro, MBA
 
Testing – Why We Do It Badly2
Testing – Why We Do It Badly2Testing – Why We Do It Badly2
Testing – Why We Do It Badly2adevney
 
Best Practices for Rating and Policy Administration System Replacement
Best Practices for Rating and Policy Administration System ReplacementBest Practices for Rating and Policy Administration System Replacement
Best Practices for Rating and Policy Administration System ReplacementEdgewater
 
Service industry metrics
Service industry metricsService industry metrics
Service industry metricsDan Wilson
 
Business Process Improvement
Business Process ImprovementBusiness Process Improvement
Business Process ImprovementAnand Subramaniam
 
IT Alignment and The Cloud
IT Alignment and The CloudIT Alignment and The Cloud
IT Alignment and The CloudSteve McDonell
 
Work Measurement and Operational Effectiveness
Work Measurement and Operational EffectivenessWork Measurement and Operational Effectiveness
Work Measurement and Operational Effectivenessgrubinm
 
Live Conversation: Cut your customer interview costs by up to 90%
Live Conversation: Cut your customer interview costs by up to 90%Live Conversation: Cut your customer interview costs by up to 90%
Live Conversation: Cut your customer interview costs by up to 90%UserTesting
 

Ähnlich wie ROI & Impact Measures for Taxonomies (20)

Presentation Tms Webinar V08
Presentation Tms Webinar V08Presentation Tms Webinar V08
Presentation Tms Webinar V08
 
Presentation Tms Webinar V08
Presentation Tms Webinar V08Presentation Tms Webinar V08
Presentation Tms Webinar V08
 
How Judson ISD Implemented and Tracks IT Metrics & Key Performance Indicators
How Judson ISD Implemented and Tracks IT Metrics & Key Performance IndicatorsHow Judson ISD Implemented and Tracks IT Metrics & Key Performance Indicators
How Judson ISD Implemented and Tracks IT Metrics & Key Performance Indicators
 
Justifying Taxonomy Projects: Taxonomy Boot Camp 2009
Justifying Taxonomy Projects: Taxonomy Boot Camp 2009Justifying Taxonomy Projects: Taxonomy Boot Camp 2009
Justifying Taxonomy Projects: Taxonomy Boot Camp 2009
 
Finding The Agile Sweet Spot
Finding The Agile Sweet SpotFinding The Agile Sweet Spot
Finding The Agile Sweet Spot
 
Coradiant
CoradiantCoradiant
Coradiant
 
Web Performance Analysis - TCF Pro 2009
Web Performance Analysis - TCF Pro 2009Web Performance Analysis - TCF Pro 2009
Web Performance Analysis - TCF Pro 2009
 
Show Me the Money: Connecting Performance Engineering to Real Business Results
Show Me the Money: Connecting Performance Engineering to Real Business ResultsShow Me the Money: Connecting Performance Engineering to Real Business Results
Show Me the Money: Connecting Performance Engineering to Real Business Results
 
The case for continuous delivery
The case for continuous deliveryThe case for continuous delivery
The case for continuous delivery
 
The case for continuous delivery
The case for continuous deliveryThe case for continuous delivery
The case for continuous delivery
 
DevOps Deep Dive Webinar: Building a business case for agile and devops
DevOps Deep Dive Webinar: Building a business case for agile and devopsDevOps Deep Dive Webinar: Building a business case for agile and devops
DevOps Deep Dive Webinar: Building a business case for agile and devops
 
Applicant Tracking System Business Case
Applicant Tracking System Business CaseApplicant Tracking System Business Case
Applicant Tracking System Business Case
 
Testing – Why We Do It Badly2
Testing – Why We Do It Badly2Testing – Why We Do It Badly2
Testing – Why We Do It Badly2
 
Best Practices for Rating and Policy Administration System Replacement
Best Practices for Rating and Policy Administration System ReplacementBest Practices for Rating and Policy Administration System Replacement
Best Practices for Rating and Policy Administration System Replacement
 
Service industry metrics
Service industry metricsService industry metrics
Service industry metrics
 
Business Process Improvement
Business Process ImprovementBusiness Process Improvement
Business Process Improvement
 
IT Alignment and The Cloud
IT Alignment and The CloudIT Alignment and The Cloud
IT Alignment and The Cloud
 
Work Measurement and Operational Effectiveness
Work Measurement and Operational EffectivenessWork Measurement and Operational Effectiveness
Work Measurement and Operational Effectiveness
 
Building Your Roadmap Sucessful Identity And Access Management
Building Your Roadmap Sucessful Identity And Access ManagementBuilding Your Roadmap Sucessful Identity And Access Management
Building Your Roadmap Sucessful Identity And Access Management
 
Live Conversation: Cut your customer interview costs by up to 90%
Live Conversation: Cut your customer interview costs by up to 90%Live Conversation: Cut your customer interview costs by up to 90%
Live Conversation: Cut your customer interview costs by up to 90%
 

Mehr von Access Innovations, Inc.

Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsAccess Innovations, Inc.
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8Access Innovations, Inc.
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Access Innovations, Inc.
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Access Innovations, Inc.
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Access Innovations, Inc.
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut ItAccess Innovations, Inc.
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityAccess Innovations, Inc.
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedAccess Innovations, Inc.
 

Mehr von Access Innovations, Inc. (20)

Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 
Smart submit
Smart submitSmart submit
Smart submit
 
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
 
Acs discoverability-dhug2021
Acs discoverability-dhug2021Acs discoverability-dhug2021
Acs discoverability-dhug2021
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
 

Kürzlich hochgeladen

AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
TEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxTEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxruthvilladarez
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
The Contemporary World: The Globalization of World Politics
The Contemporary World: The Globalization of World PoliticsThe Contemporary World: The Globalization of World Politics
The Contemporary World: The Globalization of World PoliticsRommel Regala
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 

Kürzlich hochgeladen (20)

AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
TEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxTEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
The Contemporary World: The Globalization of World Politics
The Contemporary World: The Globalization of World PoliticsThe Contemporary World: The Globalization of World Politics
The Contemporary World: The Globalization of World Politics
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 

ROI & Impact Measures for Taxonomies

  • 1. ROI & Impact: Quantitative & Qualitative Measures for Taxonomies Wednesday, 11 February 2009 12:00 – 12:30 PM MST Presented by Jay Ven Eman, Ph.D., CEO Access Innovations, Inc. / Data Harmony 505.998.0800 / www.accessinn.com / www.dataharmony.com j_ven_eman@accessinn.com DHUG 2009
  • 2. First, some questions  Do you know what a taxonomy is?  Does your boss’s boss know? Care?  What are YOU trying to accomplish?  What are your objectives?  What isn’t working? What is?  How badly?  How much?  Who? Where? Copyright © 2007 Access Innovations, Inc.
  • 3. First, some questions - 2  Who are your searchers?  Internal? Intranet?  External? Web? Fee based (commercial)?  How many?  What do they do? How do they do it?  What are they seeking?  Why? Copyright © 2007 Access Innovations, Inc.
  • 4. First, some questions - 3  Where are they looking?  How many searching environments?  Physical?  Internal resources?  External resources?  Search interfaces?  And so on… Copyright © 2007 Access Innovations, Inc.
  • 5. Copyright © 2007 Access Innovations, Inc. “Meaning” starts with a knowledge organization system (KOS)  Uncontrolled list  Name authority file  Synonym set/ring  Controlled vocabulary  Taxonomy  Thesaurus Not complex - $ Highly complex - $$$$ LOTS OF OVERLAP! Topic MapOntology SKOS
  • 6. The Pain of Search Copyright © 2007 Access Innovations, Inc. The Pain of Search Percent Number of Employees Search & Use Timel Per Week Time Searching Per Week Time Analysing Per Week Average Loaded Salary Annual Cost of Looking Search Time Reduction Difference Mission critical 1000 Hours Hours Hours $ Per Hour 10% High 10 100 14 8.4 5.6 200 8,736,000 7,862,400 873,600 Medium 80 800 12 7.2 4.8 150 44,928,000 40,435,200 4,492,800 Low 10 100 10 6 4 100 3,120,000 2,808,000 312,000 $56,784,000 $51,105,600 $5,678,400
  • 7. ROI - Segments  Cost of taxonomy system  Indexing costs  Cost of getting system ready  Ongoing maintenance  Increased efficiency  Increased quality of retrieval  Cost of legacy system maintenance
  • 8. Copyright © 2005 Access Innovations, Inc. Taxonomy construction Process Terms/hr # of terms Cost/hr Cost From scratch 4 5000 $75 $93,750 License 0 - 100K License & customize 6 5000 75 62,500+ 5,000 Auto- generate/cleanup + tool 6 5000 75 62,500+ 100,000 Mapping 8 5000 75 46,875
  • 9. Indexing & Search Metrics  Hit, Miss, Noise  Subjective  Relevance  Aboutness  Statistical  Precision  Recall  Level of effort
  • 10. Hit, Miss, Noise  Hit – exactly what a human indexer would use  Miss – human indexer would use but system did not assign  Noise – system assigned but human did not  Relevant noise – could have been assigned  Irrelevant noise – just plain wrong
  • 11. Subjective  Relevance  Reflects how akin it is to the users request  Aboutness  Reflects the topical match between the document content and the term  How well the topic describes what the document is about  Varies with level of conceptual terms vs. factual terms in the thesaurus
  • 12. Subjective  “There is now a 92% accuracy rating accuracy on accounting and regulatory document search based on hit, miss and noise or relevance, precision and recall statistics…Access Innovations.” USGAO  “IEEE had their system up and running in three days, in full production in less than two weeks.” Institute of Electrical and Electronics Engineers (IEEE)  “The American Economic Association said its editors think using it is fun and makes time fly!” American Economic Association (AEA)  “ ProQuest CSA have achieved a 7 fold increase in productivity – thus they have four licenses.” ProQuest CSA  “Weather Channel finds things 50% faster using Data Harmony. A significant saving in time.” The Weather Channel
  • 13. Statistical  Precision  Correct retrieval / Total retrieval  Hits / hits + noise  Recall  Correct retrieval / Total correct in system  Hits / Hits + misses  Level of effort  Hits / Hits + misses + noise
  • 14. Cost Goals  Cost Savings  Software/hardware  More efficient delivery systems  Retirement of legacy systems  Cost Avoidance  Additional staff not needed to scale  Lower training costs
  • 15. Productivity Goals  Productivity gains  Employee productivity – fourfold  Get up to speed faster  Learn vocabulary faster  Able to capture peoples knowledge in the rule base  Staff savings / redeployment  Elimination of new hires
  • 16. Additional Benefits  Revenue Generation  Higher hit rates  More purchases off the site  Competitive advantage  Shorter product / sales cycles  Faster implementation  Better search experience  Ability to meet regulatory requirements
  • 17. Go – No Go  Reach 85% precision to launch for productivity - assisted  Reach 85% for filtering or categorization  Sorting for production  Level of effort to get to 85%  Integration into the workflow is efficient
  • 18. Benchmarks  15 – 20% irrelevant returns / noise  Amount of work needed to achieve 85% level  How good is good enough?  Satisfice = satisfaction + suffice  How much error can you put up with?
  • 19. Example ROI Calculation  Assume – 5,000 term thesaurus  1.5 synonyms per terms  7,500 terms total  Assume 85% accuracy  Use assisted for indexing  Use automatically for filtering  Assume $75 per hour for staff  Assume 10,000 records for test batch
  • 20. Indexing costs with Data Harmony  80% of rules built automatically  7,500 x .8 = 6,000  20% require complex rules  Average rule takes 5 minutes  (Actually MUCH faster using M.A.I. GUI)  5 x 1,500 = 7,500 minutes  125 hours x $75 = $9,375
  • 21. Indexing Costs  Base cost of MAIstro EE - $60,000  Cost of getting system ready  Programming support and integration  Estimated at 2 weeks programming $125 / hour = $10,000  Rule building  Estimated at 125 hours $75 / hour = $9,375  Possible need to re-run training set several times  Ongoing maintenance  Estimated at 15% of purchase price for license = $9,000  Rule building for new terms 50 terms per quarter  200 terms x .8 = 160 automatic  40 at 5 minutes per term = 200 minutes /60 = 3.33 hours x $75 = $250  Targeted initial accuracy at 85%
  • 22. Indexing costs  Year one  $60,000 + $10,000 + $9,375 = $79,375  Years thereafter  9000 + 250 = $9250  85% accuracy
  • 23. ROI  Taxonomy costs = $67,500  Indexing costs = $79,375  Pain of search – difference = $5,678,400  If off by factor of 4, then a positive ROI of 241% Copyright © 2007 Access Innovations, Inc.
  • 24. ROI & Impact: Quantitative & Qualitative Measures for Taxonomies Wednesday, 11 February 2009 12:00 – 12:30 PM MST Presented by Jay Ven Eman, Ph.D., CEO Access Innovations, Inc. / Data Harmony 505.998.0800 / www.accessinn.com / www.dataharmony.com j_ven_eman@accessinn.com Thank you!