SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Data Management for Citizen Science

Challenges & Opportunities for USGS Leadership


Andrea Wiggins
Postdoctoral Fellow
DataONE & Cornell Lab of Ornithology

12 September, 2012
USGS CDI Citizen Science workshop
DataONE PPSR Working Group
Purpose:
 • Improve quality, quantity, and accessibility of PPSR data
 • Advance integration of PPSR data in conventional science


Products:
 • Data Management Guide for PPSR - coming soon!
 • Articles in August FREE special issue
 • Data quality & validation paper




                                                               2
How long will it                      What is a data
          take to get                         management
         enough data?                            plan?
                                   Plan

                     Analyze                Collect       How can I assure
                                                        quality of volunteers’
 What tools
                                                                data?
  do I use?


               Integrate                          Assure

                                                            What data about
                                                           volunteers should I
Who can help
                                                             keep or share?
   me?
                     Discover               Describe

                                 Preserve         Should I share
          What if the data are                    raw data with
          used for commercial                     known errors?
                 profit?
How long will it                      What is a data
          take to get                         management
         enough data?                            plan?
                                   Plan

                     Analyze                Collect       How can I assure
                                                             quality of
 What tools
                                                         volunteers’ data?
  do I use?


               Integrate                          Assure

                                                            What data about
                                                           volunteers should
Who can help
                                                            I keep or share?
   me?
                     Discover               Describe

                                 Preserve         Should I share
          What if the data are                    raw data with
          used for commercial                     known errors?
                 profit?
Citizen science data challenges
Data policies

Cyberinfrastructure

Data quality




                                  5
Policy? What policy?
Data policies = boring




          http://www.flickr.com/photos/escapist/107455718/




                                                             6
Policy? What policy?
Data policies = boring

Data policies = hard
 • Ownership, sharing, use, access, challenge, etc.
 • Lots of decisions, vague consequences




                                                      7
Policy? What policy?
Data policies = boring

Data policies = hard
 • Ownership, sharing, use, access, challenge, etc.
 • Lots of decisions, vague consequences


Need examples of carefully crafted policies
 • Story of the data + policy that resulted
 • USGS is way ahead of the game!

                                                      8
Cyberinfrastructure
Technology is a major pain point




                                   9
Cyberinfrastructure
Technology is a major pain point

Platforms needed
  • Transcription, observation, processing
  • Ongoing support & development required




                                             10
Cyberinfrastructure
Technology is a major pain point

Platforms needed
  • Transcription, observation, processing
  • Ongoing support & development required

Who is going to pay?
 • <insert sound of crickets here>



                              http://www.flickr.com/photos/gravitywave/1303504847/   11
Data quality perceptions
No more reinvention
 • The data are as good as your project design
 • Reuse protocols & technologies
 • Replicability -> reliability




                                                 12
Data quality perceptions
No more reinvention
 • The data are as good as your project design
 • Reuse protocols & technologies
 • Replicability -> reliability


No more excuses
 • All scientific data have errors
 • Our data are just like yours...except we have more friends
 • Document data collection & QA/QC in excruciating detail


                                                                13
Survey says...




                 14
Survey says...
Least satisfied with current:
  • Process for sharing project data with colleagues,
    researchers, and/or participants
  • Ways of presenting project data/results to participants




                                                              15
Survey says...
Least satisfied with current:
  • Process for sharing project data with colleagues,
    researchers, and/or participants
  • Ways of presenting project data/results to participants

Better data management planning than average
 • 1/3 had NO data management plan at all!
 • Government-funded projects: yes, for some data




                                                              16
Survey says...
Tools & resources strongly desired across categories,
especially:
 • Analyzing & visualizing data
 • Documenting & describing data
 • Training




                                                        17
Survey says...
Tools & resources strongly desired across categories,
especially:
 • Analyzing & visualizing data
 • Documenting & describing data
 • Training


Top priorities for improvement (high agreement)
 1. Analyzing & visualizing data
 2. Documenting & describing data
 3. Long-term storage
 4. Establishing & updating data policies
                                                        18
Leading the way




                  19
Leading the way
Be an exemplar in data sharing & community building




                                                      20
Leading the way
Be an exemplar in data sharing & community building

Make your data policies easy to find & emulate




                                                      21
Leading the way
Be an exemplar in data sharing & community building

Make your data policies easy to find & emulate

Share your platforms with everyone, not just New Zealand!




                                                            22
Leading the way
Be an exemplar in data sharing & community building

Make your data policies easy to find & emulate

Share your platforms with everyone, not just New Zealand!

Make data quality obvious




                                                            23
Leading the way
Be an exemplar in data sharing & community building

Make your data policies easy to find & emulate

Share your platforms with everyone, not just New Zealand!

Make data quality obvious

USGS brings more credibility to citizen science


                                                            24
Thanks!
andrea.wiggins@cornell.edu
@AndreaWiggins

dataone.org
birds.cornell.edu
citizenscience.org
andreawiggins.com




                             25

Weitere ähnliche Inhalte

Was ist angesagt?

Citizen Science Phenotypes
Citizen Science PhenotypesCitizen Science Phenotypes
Citizen Science PhenotypesAndrea Wiggins
 
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceFree as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceAndrea Wiggins
 
Data Intensive Collaboration in Science and Engineering: CSCW workshop themes
Data Intensive Collaboration in Science and Engineering: CSCW workshop themesData Intensive Collaboration in Science and Engineering: CSCW workshop themes
Data Intensive Collaboration in Science and Engineering: CSCW workshop themesAndrea Wiggins
 
Citizen science
Citizen scienceCitizen science
Citizen sciencesamar1407
 
4-H and Citizen Science Basics
4-H and Citizen Science Basics4-H and Citizen Science Basics
4-H and Citizen Science BasicsCitizenScience.org
 
Outcomes for citizen science at science centers
Outcomes for citizen science at science centersOutcomes for citizen science at science centers
Outcomes for citizen science at science centersCitizenScience.org
 
Ian Thornhill Citizen Science Training Day
Ian Thornhill Citizen Science Training DayIan Thornhill Citizen Science Training Day
Ian Thornhill Citizen Science Training DayAlice Sheppard
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis? Amit Sheth
 
Why do citizen science at science centers?
Why do citizen science at science centers?Why do citizen science at science centers?
Why do citizen science at science centers?CitizenScience.org
 
Activities for citizen science at science centers
Activities for citizen science at science centersActivities for citizen science at science centers
Activities for citizen science at science centersCitizenScience.org
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkAdina Chuang Howe
 
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Artificial Intelligence Institute at UofSC
 
Developing Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging TechnologiesDeveloping Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging TechnologiesDouglas Joubert
 
Newsletter 2013-fall
Newsletter 2013-fallNewsletter 2013-fall
Newsletter 2013-fallHoa Bien
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementMarieke Guy
 

Was ist angesagt? (20)

Citizen Science Phenotypes
Citizen Science PhenotypesCitizen Science Phenotypes
Citizen Science Phenotypes
 
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceFree as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
 
Data Intensive Collaboration in Science and Engineering: CSCW workshop themes
Data Intensive Collaboration in Science and Engineering: CSCW workshop themesData Intensive Collaboration in Science and Engineering: CSCW workshop themes
Data Intensive Collaboration in Science and Engineering: CSCW workshop themes
 
Little eScience
Little eScienceLittle eScience
Little eScience
 
Crowdsourcing Science
Crowdsourcing ScienceCrowdsourcing Science
Crowdsourcing Science
 
Citizen science
Citizen scienceCitizen science
Citizen science
 
4-H and Citizen Science Basics
4-H and Citizen Science Basics4-H and Citizen Science Basics
4-H and Citizen Science Basics
 
Engaging the software in research community
Engaging the software in research communityEngaging the software in research community
Engaging the software in research community
 
Oess NCRM Festival
Oess NCRM FestivalOess NCRM Festival
Oess NCRM Festival
 
Outcomes for citizen science at science centers
Outcomes for citizen science at science centersOutcomes for citizen science at science centers
Outcomes for citizen science at science centers
 
Ian Thornhill Citizen Science Training Day
Ian Thornhill Citizen Science Training DayIan Thornhill Citizen Science Training Day
Ian Thornhill Citizen Science Training Day
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis?
 
Why do citizen science at science centers?
Why do citizen science at science centers?Why do citizen science at science centers?
Why do citizen science at science centers?
 
Activities for citizen science at science centers
Activities for citizen science at science centersActivities for citizen science at science centers
Activities for citizen science at science centers
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data Talk
 
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
 
Knoesis Student Achievement
Knoesis Student AchievementKnoesis Student Achievement
Knoesis Student Achievement
 
Developing Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging TechnologiesDeveloping Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging Technologies
 
Newsletter 2013-fall
Newsletter 2013-fallNewsletter 2013-fall
Newsletter 2013-fall
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data Management
 

Andere mochten auch

Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Andrea Wiggins
 
Code for Africa - Building Demand-driven + Citizen-focused Open Data Ecosystems
Code for Africa - Building Demand-driven + Citizen-focused Open Data EcosystemsCode for Africa - Building Demand-driven + Citizen-focused Open Data Ecosystems
Code for Africa - Building Demand-driven + Citizen-focused Open Data EcosystemsJustin Arenstein
 
Enterprise 2.0 - Enabling change or part of the problem?
Enterprise 2.0 - Enabling change or part of the problem?Enterprise 2.0 - Enabling change or part of the problem?
Enterprise 2.0 - Enabling change or part of the problem?Stephen Collins
 
The Road to Identity 2.0
The Road to Identity 2.0The Road to Identity 2.0
The Road to Identity 2.0Adam Lewis
 
digital identity 2.0: how technology is transforming behaviours and raising c...
digital identity 2.0: how technology is transforming behaviours and raising c...digital identity 2.0: how technology is transforming behaviours and raising c...
digital identity 2.0: how technology is transforming behaviours and raising c...Patrick McCormick
 
National identity strategy presentation may 10, 2016
National identity strategy  presentation may 10, 2016National identity strategy  presentation may 10, 2016
National identity strategy presentation may 10, 2016Guy Huntington
 
Canberra Executive Breakfast - A Citizen-Centric Approach to Identity
Canberra Executive Breakfast - A Citizen-Centric Approach to Identity Canberra Executive Breakfast - A Citizen-Centric Approach to Identity
Canberra Executive Breakfast - A Citizen-Centric Approach to Identity ForgeRock
 
Trends in IRM: Internet of Things
Trends in IRM: Internet of ThingsTrends in IRM: Internet of Things
Trends in IRM: Internet of ThingsForgeRock
 
User Authentication for Government
User Authentication for GovernmentUser Authentication for Government
User Authentication for GovernmentCarahsoft
 
The Rise of the Citizen Data Scientist
The Rise of the Citizen Data ScientistThe Rise of the Citizen Data Scientist
The Rise of the Citizen Data ScientistPlatfora
 
The connected economy mark skilton july 15 bright talk v2
The connected economy mark skilton july 15   bright talk v2The connected economy mark skilton july 15   bright talk v2
The connected economy mark skilton july 15 bright talk v2Mark Skilton
 
Digital Transformation: Connected API Ecosystems
Digital Transformation: Connected API EcosystemsDigital Transformation: Connected API Ecosystems
Digital Transformation: Connected API EcosystemsHARMAN Services
 
New Zealand: Proactively Preparing for a More Sustainable Future
New Zealand: Proactively Preparing for a More Sustainable FutureNew Zealand: Proactively Preparing for a More Sustainable Future
New Zealand: Proactively Preparing for a More Sustainable FutureLCANZ
 
IR-website. Investor Relations. What to do online? Nov 2010 - eng
IR-website. Investor Relations. What to do online? Nov 2010 - engIR-website. Investor Relations. What to do online? Nov 2010 - eng
IR-website. Investor Relations. What to do online? Nov 2010 - engAndrey Podderegin
 
World Economic Forum Global Risks 2015 Report - A Review
World Economic Forum Global Risks 2015 Report - A ReviewWorld Economic Forum Global Risks 2015 Report - A Review
World Economic Forum Global Risks 2015 Report - A ReviewNavDhami
 
A Collective, merit-based approach to Managing Workforce Adjustment, Canada
A Collective, merit-based approach to Managing Workforce Adjustment, CanadaA Collective, merit-based approach to Managing Workforce Adjustment, Canada
A Collective, merit-based approach to Managing Workforce Adjustment, CanadaUNDP India
 
National Trade Facilitation Strategy and Roadmap
National Trade Facilitation Strategy and RoadmapNational Trade Facilitation Strategy and Roadmap
National Trade Facilitation Strategy and RoadmapNotis Mitarachi
 

Andere mochten auch (20)

Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
 
Code for Africa - Building Demand-driven + Citizen-focused Open Data Ecosystems
Code for Africa - Building Demand-driven + Citizen-focused Open Data EcosystemsCode for Africa - Building Demand-driven + Citizen-focused Open Data Ecosystems
Code for Africa - Building Demand-driven + Citizen-focused Open Data Ecosystems
 
Enterprise 2.0 - Enabling change or part of the problem?
Enterprise 2.0 - Enabling change or part of the problem?Enterprise 2.0 - Enabling change or part of the problem?
Enterprise 2.0 - Enabling change or part of the problem?
 
SCC2013 - Citizen science - Helen Roy
SCC2013 - Citizen science - Helen RoySCC2013 - Citizen science - Helen Roy
SCC2013 - Citizen science - Helen Roy
 
The Road to Identity 2.0
The Road to Identity 2.0The Road to Identity 2.0
The Road to Identity 2.0
 
digital identity 2.0: how technology is transforming behaviours and raising c...
digital identity 2.0: how technology is transforming behaviours and raising c...digital identity 2.0: how technology is transforming behaviours and raising c...
digital identity 2.0: how technology is transforming behaviours and raising c...
 
National identity strategy presentation may 10, 2016
National identity strategy  presentation may 10, 2016National identity strategy  presentation may 10, 2016
National identity strategy presentation may 10, 2016
 
Canberra Executive Breakfast - A Citizen-Centric Approach to Identity
Canberra Executive Breakfast - A Citizen-Centric Approach to Identity Canberra Executive Breakfast - A Citizen-Centric Approach to Identity
Canberra Executive Breakfast - A Citizen-Centric Approach to Identity
 
Trends in IRM: Internet of Things
Trends in IRM: Internet of ThingsTrends in IRM: Internet of Things
Trends in IRM: Internet of Things
 
User Authentication for Government
User Authentication for GovernmentUser Authentication for Government
User Authentication for Government
 
The Rise of the Citizen Data Scientist
The Rise of the Citizen Data ScientistThe Rise of the Citizen Data Scientist
The Rise of the Citizen Data Scientist
 
The connected economy mark skilton july 15 bright talk v2
The connected economy mark skilton july 15   bright talk v2The connected economy mark skilton july 15   bright talk v2
The connected economy mark skilton july 15 bright talk v2
 
Digital Transformation: Connected API Ecosystems
Digital Transformation: Connected API EcosystemsDigital Transformation: Connected API Ecosystems
Digital Transformation: Connected API Ecosystems
 
Project Management 2.0
Project Management 2.0Project Management 2.0
Project Management 2.0
 
Humanity 2.0
Humanity 2.0Humanity 2.0
Humanity 2.0
 
New Zealand: Proactively Preparing for a More Sustainable Future
New Zealand: Proactively Preparing for a More Sustainable FutureNew Zealand: Proactively Preparing for a More Sustainable Future
New Zealand: Proactively Preparing for a More Sustainable Future
 
IR-website. Investor Relations. What to do online? Nov 2010 - eng
IR-website. Investor Relations. What to do online? Nov 2010 - engIR-website. Investor Relations. What to do online? Nov 2010 - eng
IR-website. Investor Relations. What to do online? Nov 2010 - eng
 
World Economic Forum Global Risks 2015 Report - A Review
World Economic Forum Global Risks 2015 Report - A ReviewWorld Economic Forum Global Risks 2015 Report - A Review
World Economic Forum Global Risks 2015 Report - A Review
 
A Collective, merit-based approach to Managing Workforce Adjustment, Canada
A Collective, merit-based approach to Managing Workforce Adjustment, CanadaA Collective, merit-based approach to Managing Workforce Adjustment, Canada
A Collective, merit-based approach to Managing Workforce Adjustment, Canada
 
National Trade Facilitation Strategy and Roadmap
National Trade Facilitation Strategy and RoadmapNational Trade Facilitation Strategy and Roadmap
National Trade Facilitation Strategy and Roadmap
 

Ähnlich wie Data Management for Citizen Science

Data Is Eating The World
Data Is Eating The WorldData Is Eating The World
Data Is Eating The WorldUday Kumar
 
E-Metrics: Embrace the Data, Change the World
E-Metrics:  Embrace the Data, Change the WorldE-Metrics:  Embrace the Data, Change the World
E-Metrics: Embrace the Data, Change the WorldBeth Kanter
 
Foundation Center DC
Foundation Center DCFoundation Center DC
Foundation Center DCBeth Kanter
 
Data-Ed: Unlock Business Value through Data Governance
Data-Ed: Unlock Business Value through Data GovernanceData-Ed: Unlock Business Value through Data Governance
Data-Ed: Unlock Business Value through Data GovernanceData Blueprint
 
DataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDATAVERSITY
 
Allstate Foundation
Allstate FoundationAllstate Foundation
Allstate FoundationBeth Kanter
 
ETE 2013: Going Big with Big Data...one step at a time
ETE 2013:  Going Big with Big Data...one step at a timeETE 2013:  Going Big with Big Data...one step at a time
ETE 2013: Going Big with Big Data...one step at a timeAnita Andrews
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data ScienceThinkful
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Thinkful
 
Key Notes Slides
Key Notes SlidesKey Notes Slides
Key Notes SlidesBeth Kanter
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Thinkful
 
Data science and ethics in fundraising
Data science and ethics in fundraisingData science and ethics in fundraising
Data science and ethics in fundraisingJames Orton
 
Cisco Foundation Presentation
Cisco Foundation PresentationCisco Foundation Presentation
Cisco Foundation PresentationBeth Kanter
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementNick Lynch
 
Compasspoint Measurement Workshop
Compasspoint Measurement WorkshopCompasspoint Measurement Workshop
Compasspoint Measurement WorkshopBeth Kanter
 
Managing data responsibly to enable research interity
Managing data responsibly to enable research interityManaging data responsibly to enable research interity
Managing data responsibly to enable research interityIUPUI
 
Creating a data driven culture
Creating a data driven cultureCreating a data driven culture
Creating a data driven culturePoojitha B
 
Drinking from the Digital Data Fire Hose
Drinking from the Digital Data Fire HoseDrinking from the Digital Data Fire Hose
Drinking from the Digital Data Fire HoseGigi Johnson
 

Ähnlich wie Data Management for Citizen Science (20)

Data Is Eating The World
Data Is Eating The WorldData Is Eating The World
Data Is Eating The World
 
E-Metrics: Embrace the Data, Change the World
E-Metrics:  Embrace the Data, Change the WorldE-Metrics:  Embrace the Data, Change the World
E-Metrics: Embrace the Data, Change the World
 
Foundation Center DC
Foundation Center DCFoundation Center DC
Foundation Center DC
 
Data-Ed: Unlock Business Value through Data Governance
Data-Ed: Unlock Business Value through Data GovernanceData-Ed: Unlock Business Value through Data Governance
Data-Ed: Unlock Business Value through Data Governance
 
DataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data Governance
 
Allstate Foundation
Allstate FoundationAllstate Foundation
Allstate Foundation
 
ETE 2013: Going Big with Big Data...one step at a time
ETE 2013:  Going Big with Big Data...one step at a timeETE 2013:  Going Big with Big Data...one step at a time
ETE 2013: Going Big with Big Data...one step at a time
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Key Notes Slides
Key Notes SlidesKey Notes Slides
Key Notes Slides
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
Data science and ethics in fundraising
Data science and ethics in fundraisingData science and ethics in fundraising
Data science and ethics in fundraising
 
Cisco Foundation Presentation
Cisco Foundation PresentationCisco Foundation Presentation
Cisco Foundation Presentation
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
 
Compasspoint Measurement Workshop
Compasspoint Measurement WorkshopCompasspoint Measurement Workshop
Compasspoint Measurement Workshop
 
Managing data responsibly to enable research interity
Managing data responsibly to enable research interityManaging data responsibly to enable research interity
Managing data responsibly to enable research interity
 
Self-Service Analytics
Self-Service AnalyticsSelf-Service Analytics
Self-Service Analytics
 
Creating a data driven culture
Creating a data driven cultureCreating a data driven culture
Creating a data driven culture
 
Drinking from the Digital Data Fire Hose
Drinking from the Digital Data Fire HoseDrinking from the Digital Data Fire Hose
Drinking from the Digital Data Fire Hose
 
Make data more human
Make data more humanMake data more human
Make data more human
 

Mehr von Andrea Wiggins

With Great Data Comes Great Responsibility
With Great Data Comes Great ResponsibilityWith Great Data Comes Great Responsibility
With Great Data Comes Great ResponsibilityAndrea Wiggins
 
Mechanisms for Data Quality and Validation in Citizen Science
Mechanisms for Data Quality and Validation in Citizen ScienceMechanisms for Data Quality and Validation in Citizen Science
Mechanisms for Data Quality and Validation in Citizen ScienceAndrea Wiggins
 
Open Source & Citizen Science
Open Source & Citizen ScienceOpen Source & Citizen Science
Open Source & Citizen ScienceAndrea Wiggins
 
From Conservation to Crowdsourcing: A Typology of Citizen Science
From Conservation to Crowdsourcing: A Typology of Citizen ScienceFrom Conservation to Crowdsourcing: A Typology of Citizen Science
From Conservation to Crowdsourcing: A Typology of Citizen ScienceAndrea Wiggins
 
Motivation by Design: Technologies, Experiences, and Incentives
Motivation by Design: Technologies, Experiences, and IncentivesMotivation by Design: Technologies, Experiences, and Incentives
Motivation by Design: Technologies, Experiences, and IncentivesAndrea Wiggins
 
Secondary data analysis with digital trace data
Secondary data analysis with digital trace dataSecondary data analysis with digital trace data
Secondary data analysis with digital trace dataAndrea Wiggins
 
Reclassifying Success and Tragedy in FLOSS Projects
Reclassifying Success and Tragedy in FLOSS ProjectsReclassifying Success and Tragedy in FLOSS Projects
Reclassifying Success and Tragedy in FLOSS ProjectsAndrea Wiggins
 
Intellectual Diversity in the iSchools: Past, Present and Future
Intellectual Diversity in the iSchools: Past, Present and FutureIntellectual Diversity in the iSchools: Past, Present and Future
Intellectual Diversity in the iSchools: Past, Present and FutureAndrea Wiggins
 
Distributed Scientific Collaboration: Research Opportunities in Citizen Science
Distributed Scientific Collaboration: Research Opportunities in Citizen ScienceDistributed Scientific Collaboration: Research Opportunities in Citizen Science
Distributed Scientific Collaboration: Research Opportunities in Citizen ScienceAndrea Wiggins
 
Designing Virtual Organizations for Citizen Science
Designing Virtual Organizations for Citizen ScienceDesigning Virtual Organizations for Citizen Science
Designing Virtual Organizations for Citizen ScienceAndrea Wiggins
 
National Park System Property Designations
National Park System Property DesignationsNational Park System Property Designations
National Park System Property DesignationsAndrea Wiggins
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsAndrea Wiggins
 
Tales of the Field: Building Small Science Cyberinfrastructure
Tales of the Field: Building Small Science CyberinfrastructureTales of the Field: Building Small Science Cyberinfrastructure
Tales of the Field: Building Small Science CyberinfrastructureAndrea Wiggins
 
Coordination Dynamics in Free/Libre and Open Source Software
Coordination Dynamics in Free/Libre and Open Source SoftwareCoordination Dynamics in Free/Libre and Open Source Software
Coordination Dynamics in Free/Libre and Open Source SoftwareAndrea Wiggins
 
Heartbeat: Measuring Active User Base and Potential User Interest
Heartbeat: Measuring Active User Base and Potential User InterestHeartbeat: Measuring Active User Base and Potential User Interest
Heartbeat: Measuring Active User Base and Potential User InterestAndrea Wiggins
 
Replicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearchReplicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearchAndrea Wiggins
 
Social dynamics of FLOSS team communication across channels
Social dynamics of FLOSS team communication across channelsSocial dynamics of FLOSS team communication across channels
Social dynamics of FLOSS team communication across channelsAndrea Wiggins
 
eResearch workflows for studying free and open source software development
eResearch workflows for studying free and open source software developmenteResearch workflows for studying free and open source software development
eResearch workflows for studying free and open source software developmentAndrea Wiggins
 

Mehr von Andrea Wiggins (18)

With Great Data Comes Great Responsibility
With Great Data Comes Great ResponsibilityWith Great Data Comes Great Responsibility
With Great Data Comes Great Responsibility
 
Mechanisms for Data Quality and Validation in Citizen Science
Mechanisms for Data Quality and Validation in Citizen ScienceMechanisms for Data Quality and Validation in Citizen Science
Mechanisms for Data Quality and Validation in Citizen Science
 
Open Source & Citizen Science
Open Source & Citizen ScienceOpen Source & Citizen Science
Open Source & Citizen Science
 
From Conservation to Crowdsourcing: A Typology of Citizen Science
From Conservation to Crowdsourcing: A Typology of Citizen ScienceFrom Conservation to Crowdsourcing: A Typology of Citizen Science
From Conservation to Crowdsourcing: A Typology of Citizen Science
 
Motivation by Design: Technologies, Experiences, and Incentives
Motivation by Design: Technologies, Experiences, and IncentivesMotivation by Design: Technologies, Experiences, and Incentives
Motivation by Design: Technologies, Experiences, and Incentives
 
Secondary data analysis with digital trace data
Secondary data analysis with digital trace dataSecondary data analysis with digital trace data
Secondary data analysis with digital trace data
 
Reclassifying Success and Tragedy in FLOSS Projects
Reclassifying Success and Tragedy in FLOSS ProjectsReclassifying Success and Tragedy in FLOSS Projects
Reclassifying Success and Tragedy in FLOSS Projects
 
Intellectual Diversity in the iSchools: Past, Present and Future
Intellectual Diversity in the iSchools: Past, Present and FutureIntellectual Diversity in the iSchools: Past, Present and Future
Intellectual Diversity in the iSchools: Past, Present and Future
 
Distributed Scientific Collaboration: Research Opportunities in Citizen Science
Distributed Scientific Collaboration: Research Opportunities in Citizen ScienceDistributed Scientific Collaboration: Research Opportunities in Citizen Science
Distributed Scientific Collaboration: Research Opportunities in Citizen Science
 
Designing Virtual Organizations for Citizen Science
Designing Virtual Organizations for Citizen ScienceDesigning Virtual Organizations for Citizen Science
Designing Virtual Organizations for Citizen Science
 
National Park System Property Designations
National Park System Property DesignationsNational Park System Property Designations
National Park System Property Designations
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna Workflows
 
Tales of the Field: Building Small Science Cyberinfrastructure
Tales of the Field: Building Small Science CyberinfrastructureTales of the Field: Building Small Science Cyberinfrastructure
Tales of the Field: Building Small Science Cyberinfrastructure
 
Coordination Dynamics in Free/Libre and Open Source Software
Coordination Dynamics in Free/Libre and Open Source SoftwareCoordination Dynamics in Free/Libre and Open Source Software
Coordination Dynamics in Free/Libre and Open Source Software
 
Heartbeat: Measuring Active User Base and Potential User Interest
Heartbeat: Measuring Active User Base and Potential User InterestHeartbeat: Measuring Active User Base and Potential User Interest
Heartbeat: Measuring Active User Base and Potential User Interest
 
Replicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearchReplicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearch
 
Social dynamics of FLOSS team communication across channels
Social dynamics of FLOSS team communication across channelsSocial dynamics of FLOSS team communication across channels
Social dynamics of FLOSS team communication across channels
 
eResearch workflows for studying free and open source software development
eResearch workflows for studying free and open source software developmenteResearch workflows for studying free and open source software development
eResearch workflows for studying free and open source software development
 

Kürzlich hochgeladen

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 

Kürzlich hochgeladen (20)

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 

Data Management for Citizen Science

  • 1. Data Management for Citizen Science Challenges & Opportunities for USGS Leadership Andrea Wiggins Postdoctoral Fellow DataONE & Cornell Lab of Ornithology 12 September, 2012 USGS CDI Citizen Science workshop
  • 2. DataONE PPSR Working Group Purpose: • Improve quality, quantity, and accessibility of PPSR data • Advance integration of PPSR data in conventional science Products: • Data Management Guide for PPSR - coming soon! • Articles in August FREE special issue • Data quality & validation paper 2
  • 3. How long will it What is a data take to get management enough data? plan? Plan Analyze Collect How can I assure quality of volunteers’ What tools data? do I use? Integrate Assure What data about volunteers should I Who can help keep or share? me? Discover Describe Preserve Should I share What if the data are raw data with used for commercial known errors? profit?
  • 4. How long will it What is a data take to get management enough data? plan? Plan Analyze Collect How can I assure quality of What tools volunteers’ data? do I use? Integrate Assure What data about volunteers should Who can help I keep or share? me? Discover Describe Preserve Should I share What if the data are raw data with used for commercial known errors? profit?
  • 5. Citizen science data challenges Data policies Cyberinfrastructure Data quality 5
  • 6. Policy? What policy? Data policies = boring http://www.flickr.com/photos/escapist/107455718/ 6
  • 7. Policy? What policy? Data policies = boring Data policies = hard • Ownership, sharing, use, access, challenge, etc. • Lots of decisions, vague consequences 7
  • 8. Policy? What policy? Data policies = boring Data policies = hard • Ownership, sharing, use, access, challenge, etc. • Lots of decisions, vague consequences Need examples of carefully crafted policies • Story of the data + policy that resulted • USGS is way ahead of the game! 8
  • 10. Cyberinfrastructure Technology is a major pain point Platforms needed • Transcription, observation, processing • Ongoing support & development required 10
  • 11. Cyberinfrastructure Technology is a major pain point Platforms needed • Transcription, observation, processing • Ongoing support & development required Who is going to pay? • <insert sound of crickets here> http://www.flickr.com/photos/gravitywave/1303504847/ 11
  • 12. Data quality perceptions No more reinvention • The data are as good as your project design • Reuse protocols & technologies • Replicability -> reliability 12
  • 13. Data quality perceptions No more reinvention • The data are as good as your project design • Reuse protocols & technologies • Replicability -> reliability No more excuses • All scientific data have errors • Our data are just like yours...except we have more friends • Document data collection & QA/QC in excruciating detail 13
  • 15. Survey says... Least satisfied with current: • Process for sharing project data with colleagues, researchers, and/or participants • Ways of presenting project data/results to participants 15
  • 16. Survey says... Least satisfied with current: • Process for sharing project data with colleagues, researchers, and/or participants • Ways of presenting project data/results to participants Better data management planning than average • 1/3 had NO data management plan at all! • Government-funded projects: yes, for some data 16
  • 17. Survey says... Tools & resources strongly desired across categories, especially: • Analyzing & visualizing data • Documenting & describing data • Training 17
  • 18. Survey says... Tools & resources strongly desired across categories, especially: • Analyzing & visualizing data • Documenting & describing data • Training Top priorities for improvement (high agreement) 1. Analyzing & visualizing data 2. Documenting & describing data 3. Long-term storage 4. Establishing & updating data policies 18
  • 20. Leading the way Be an exemplar in data sharing & community building 20
  • 21. Leading the way Be an exemplar in data sharing & community building Make your data policies easy to find & emulate 21
  • 22. Leading the way Be an exemplar in data sharing & community building Make your data policies easy to find & emulate Share your platforms with everyone, not just New Zealand! 22
  • 23. Leading the way Be an exemplar in data sharing & community building Make your data policies easy to find & emulate Share your platforms with everyone, not just New Zealand! Make data quality obvious 23
  • 24. Leading the way Be an exemplar in data sharing & community building Make your data policies easy to find & emulate Share your platforms with everyone, not just New Zealand! Make data quality obvious USGS brings more credibility to citizen science 24

Hinweis der Redaktion

  1. When it comes to the data life cycle that Bill mentioned yesterday, many scientists are grappling with questions about data management. Questions like... [READ OFF] These are just a few questions out of many that PPSR project leaders have discussed with me, but as you might have noticed, most of them are questions that are equally applicable to conventional scientific research.
  2. In fact, the only thing I can see that is truly unique about PPSR data is the involvement of volunteers. At the end of the day, data is data. So I hope it comes as some comfort for everyone here to know that there ’ s nothing unusual in these challenges, with the exception of needing to manage aspects of the data that are directly related to volunteers.