SlideShare ist ein Scribd-Unternehmen logo
1 von 40
Downloaden Sie, um offline zu lesen
Intro to Open Data 
for OpenCon 2014 
November 4, 2014 
Ross Mounce, Ph.D. (@RMounce) 
Postdoc, University of Bath
These slides are on Slideshare here: 
bit.ly/opendataintro 
All textual content is
With thanks to SPARC & R2RC for organising this webcast series 
Previous talks in this series have been given on 
Open Access http://youtu.be/5YwASIziPIQ 
and 
Open Educational Resources http://youtu.be/5Dauh_PeAzI
Outline 
●Why care about data? 
●Definition: exactly what is open data? 
●Open data in scholarship, and beyond... 
● Aspects of open data, inc. HowTo 
● Legal , Technical 
● Privacy! Not all data should be be open 
● The 5 stars of open data
Why care about data? 
Data is absolutely fundamental to most research 
Science without data isn't science * 
*Entirely theoretical, data-free contributions to science are possible, but rare
By sharing data we can see further 
Data are the building blocks of 
science 
Shared, re-used data allow us to 
more rigorously test hypotheses; 
“to see further” 
...and to do it all more quickly and 
easily. 
Also, for validation & reproducibility 
(more of which later...)
What exactly is open data? 
Open means anyone can 
freely access, use, modify, 
and share for any purpose 
(subject, at most, to requirements 
that preserve provenance and 
openness) 
From http://opendefinition.org/, 
see http://opendefinition.org/od/ for more detail
Legally, what is open data? 
There are a great many open knowledge definition (OKD) conformant licences, 
including: 
CC0 
http://creativecommons.org/ 
publicdomain/zero/1.0/ 
CC BY 
https://creativecommons.org/ 
licenses/by/4.0/ 
CC BY-SA 
https://creativecommons.org/ 
licenses/by-sa/4.0/ 
See here for the comprehensive list: http://opendefinition.org/licenses/
Not all Creative Commons licences are 
'open' 
} NC -- You “may not use this work for 
commercial purposes”. 
Work under this licence cannot be used 
for any purpose, therefore it is not open. 
Can have significant, often unexpected 
negative impact on potential re-use. 
ND -- “No Derivative Works”. 
Work under this licence cannot be 
adapted if it is re-used. Not very helpful 
for research! 
NC & ND – An extremely restrictive re-use 
licence, neither commercial purposes nor 
adaptations are allowed.
Non-open licencing causes real 
problems for research & education 
The Creative Commons non-commercial (-NC) restriction is poorly defined in 
most jurisdictions, and even more poorly understood by many of its users. 
“non-commercial” != “non-profit” 
A) Non-commercial actually excludes many teaching purposes: 
In the UK, university students typically pay expensive tuition fees to attend. 
Thus university teaching is often a commercial activity, -NC restricted 
materials cannot be used to teach students in these circumstances. 
B) Licence incompatibility – NC licences are not compatible with licences 
used on major collaboration platforms like Wikipedia or Wikimedia Commons 
C) Non-commercial organizations (e.g. Deutschlandradio) 
have been successfully sued for re-using CC BY-NC 
content without permission. 
http://zookeys.pensoft.net/articles.php?id=3036 
https://www.techdirt.com/articles/20140326/11405526695/german-court-says-creat 
ive-commons-non-commercial-licenses-must-be-purely-personal-use.shtml
Real problems of non-open data: 
GBIF & biodiversity data 
Desmet, P. (2013) Showing you this map of aggregated bullfrog occurrences 
would be illegal http://peterdesmet.com/posts/illegal-bullfrogs.html
Open data in scholarship, and beyond 
The open data movement is much broader than just academia/research 
It's been successful & popular in areas like open government data: 
For transparency, detecting & discouraging corruption 
For releasing social & commercial value (governments collect a lot of data 
already, why not make wider use of it, at little or no extra cost?) 
For participatory governance – 
citizens can be more informed, a “read/write” society 
Each of these has clear parallels with open 
research data: transparency & fraud detection, 
extra value through research data re-use, 
participatory citizen science 
Some text adapted from http://opengovernmentdata.org/
Open data in scholarship, and beyond 
Similarly, and with some overlap to open research data, 
there's the open GLAM movement 
(GLAM = Galleries, Libraries, Archives & Museums) 
In this case, their data is typically collections metadata 
but also digital images of their collections 
See http://openglam.org/ for more
Technical aspects of open data 
So, you understand the imporance of licensing... 
What next? 
How best can we make our data openly available? 
Where should I upload to? 
What format(s) should I make the data available in?
Data Standards & Data File Formats 
Adhere to existing standards!
Data Standards & Data File Formats 
Take note of community standards: 
e.g. the Bermuda Principles for sharing DNA seq. data 
● Automatic release of sequence 
assemblies larger than 1 kb 
(preferably within 24 hours). 
● Immediate publication of finished 
annotated sequences. 
● Aim to make the entire sequence 
freely available in the public domain
Data Standards & Data File Formats 
If there are no formally agreed community standards, 
canvas the community to create/formalise a standard 
e.g. Best Practices for Data Sharing in Phylogenetic Research 
(2014) PLOS Currents Tree of Life 
e.g. The 1st Open Economics International Workshop 
(Cambridge, 2013) bringing together academic 
economists from around the world to discuss data 
sharing in economics research.
Data Standards & Data File Formats 
If there are multiple, competing file formats: 
Opt for file formats based on open standards 
https://en.wikipedia.org/wiki/Open_standard 
e.g. 
Avoid proprietary formats 
https://en.wikipedia.org/wiki/Proprietary_format 
e.g.
Data Standards & Data File Formats 
A real example: recent creation of a new data 
standard for exchange of 3-dimensional reconstruction 
of objects from tomographic imaging data 
SPIERS software 
+ VAXML data standard 
Sutton et al (2012) SPIERS and VAXML: A software 
toolkit for tomographic visualisation and a format for 
virtual specimen interchange. 
Palaeontologia Electronica
Where to upload open data? 
http://www.crystallography.net/ 
Genbank, 
SRA, 
1000's more!
'Data paper' journals 
http://www.mdpi.com/journal/data/about
Your data is NOT 'too big' to share 
http://gigadb.org/dataset/100124 
39 Gigabytes (GB) 
of MRI scans
Intelligent data papers allow databases 
to automatically pull-in your data 
Many publishers (e.g. Pensoft) intelligently 
markup data papers so that the data can be 
automatically ingested into appropriate db's 
on the day of publication! 
Data 
data
Data sharing benefits authors & re-users 
Piwowar HA, Vision TJ. (2013) 
Data reuse and the open data 
citation advantage. PeerJ 
1:e175 
“...open data citation 
benefit for this sample 
to be 9%” 
relative to papers 
providing no public 
data, for gene 
expression microarray 
data 
10.7717/peerj.175/fig-2 
See also previous work by 
Piwowar: 
10.1371/journal.pone.0000308
Those who share data, do better science 
Wicherts, J. M., Bakker, M. & Molenaar, D. (2011) 
Willingness to share research data is related to the 
strength of the evidence and the quality of reporting of 
statistical results. PLoS ONE 6, e26828+ URL 
http://dx.doi.org/10.1371/journal.pone.0026828 
The authors examined psychological papers for the quality of statistical 
reporting & asked the authors of those papers for the full data underlying 
the reported results. Generally, those who shared, had more statistically 
robust, reproducible results.
“Email the author for data” - doesnt work 
A well-known problem, which 
I myself have also faced 
many times!!! 
Many legacy journals 
unfortunately still pretend 
that “email the author” is 
still acceptable. 
Wicherts JM, Borsboom D, 
Kats J, Molenaar D (2006) 
The poor availability of 
psychological research 
data for reanalysis. 
American Psychologist 61: 
726–728 link
Best practice open data is time consuming 
(but still worth the extra effort!) 
Emilio M. Bruna recently provided an estimate of the amount of 
time it took him to prepare & upload open data related to 
publication to figshare & dryad. 
11 
Hours 
& $90 
(for Dryad) 
Providing open-source code was the most time consuming part (25.5 hours), 
and Open Access publication the most expensive ($600). 
http://brunalab.org/blog/2014/09/04/the-opportunity-cost-of-my-openscience-was-35- 
hours-690/
Not all data should be open! 
Obviously, there are some types of 
data which should NOT be made 
mandatorily open e.g. sensitive 
medical data 
However, with informed consent, if patients really want to, 
they should be allowed to publish their own medical data
But with informed consent, sharing 
sensitive health data can have good 
outcomes 
http://blog.ted.com/2013/06/14/why-i-opensourced-cures-for-my-cancer-sa 
lvatore-iaconesi-at-tedglobal-2013/
Other exceptions to the open default 
Sensitive species conservation data 
e.g. exact geocoordinates of home range 
Certain species of wild orchids, cacti & carnivorous plants 
are highly endangered by illegal harvesting. 
Publishing the exact geolocation data of the remaining 
populations of commercially-desirable, endangered 
species is really dumb thing to do. 
Such data is typically held privately in databases (not 
publicly available).
The 5 stars of open data 
Most research data would get 
ZERO (not available online) 
Or just ONE star 
http://5stardata.info/
3-star open research data is achievable 
This is where research data publication 
should be aiming for in the short term. 
Publishing .csv / non-proprietary open data is 
NOT actually that hard! 
http://5stardata.info/
Open Access & Open Data have similar goals 
80% of research is publicly 
funded 
1 
e.g. maximising the return on investment, provided to research by 
taxpayers and charities 
Source: “Academic Publishing: Survey of funders supports the benign Open Access outcome priced into shares, HSBC Global Research,” 
February 11, 2013: https://www.research.hsbc.com/midas/Res/RDV?ao=20&key=RxArFbnG1P&n=360010.PDF
Further Reading 
1.Editor’s Introduction - Samuel A. Moore 
2.Open Content Mining - Peter Murray-Rust, 
Jennifer C. Molloy, Diane Cabell 
3.The Need to Humanize Open Science - Eric C. 
Kansa 
4.Data Sharing in a Humanitarian Organization: The 
Experience of Médecins Sans Frontières - Unni 
Karunakara 
5.Why Open Drug Discovery Needs Four Simple 
Rules for Licensing Data and Models - Antony J. 
Williams, John Wilbanks, Sean Ekins 
6.Open Data in the Earth and Climate Sciences - 
Sarah Callaghan 
7.Open Minded Psychology - Wouter van den Bos, 
Mirjam Jenny, Dirk Wulff 
8.Open Data in Health Sciences - Tom Pollard 
9.Open Research Data in Economics - Velichka 
Dimitrova 
10.Open Data and Palaeontology - Ross Mounce 
Out soon this November, 
2014 
Published by Ubiquity 
Press
Further Reading 
● The Open Data Handbook - http://opendatahandbook.org/ 
● 5 star Open Data - http://5stardata.info/ 
● Science as an open enterprise (2012) A Royal Society report 
● Caetano, D. S. & Aisenberg, A. 2014 Forgotten treasures: the fate of data in animal 
behaviour studies Animal Behaviour 
Data sharing in phylogenetics 
● Magee et al 2014 The Dawn of Open Access to Phylogenetic Data PLOS ONE 
● Drew et al 2013 Lost Branches on the Tree of Life. PLOS Biology 
● Stoltzfus et al 2012 Sharing and re-use of phylogenetic trees (and associated data) 
to facilitate synthesis. BMC Research Notes 
On licencing & legal issues with re-use 
● Hagedorn et al 2011 Creative commons licenses and the non-commercial 
condition: Implications for the re-use of biodiversity information. ZooKeys 
● Mounce 2012. Life as a palaeontologist: Academia, the internet and creative 
commons. Palaeontology Online 
● Klimpel, P. 2012 Consequences, Risks, and side-effects of the license module 
Non-Commercial – NC [PDF]
Further Reading 
● Murray-Rust, P. Open data in science. Serials Review 34, 52-64 (2008). URL 
http://dx.doi.org/10.1016/j.serrev.2008.01.001 
● Leonelli, S., Smirnoff, N., Moore, J., Cook, C. & Bastow, R. Making open data work 
for plant scientists. Journal of Experimental Botany 64, 4109-4117 (2013). URL 
http://dx.doi.org/10.1093/jxb/ert273 
● Hrynaszkiewicz, I. & Cockerill, M. Open by default: a proposed copyright license 
and waiver agreement for open access research and data in peer-reviewed 
journals. BMC Research Notes 5, 494+ (2012). URL 
http://dx.doi.org/10.1186/1756-0500-5-494 
● Boulton, G., Rawlins, M., Vallance, P. & Walport, M. Science as a public enterprise: 
the case for open data. The Lancet 377, 1633-1635 (2011). URL 
http://dx.doi.org/10.1016/s0140-6736(11)60647-8 
● Parr, C. S. Open sourcing ecological data. BioScience 57, 309-310 (2007). URL 
http://dx.doi.org/10.1641/b570402 
● Poisot, T., Mounce, R. & Gravel, D. Moving toward a sustainable ecological 
science: don't let data go to waste! Ideas in Ecology and Evolution 6 (2013). URL 
http://dx.doi.org/10.4033/iee.2013.6b.14.f
November 15-17, 2014 
Washington, D.C. & Online 
www.opencon2014.org 
@open_con #opencon2014 
See you there if you're lucky enough to be going!
Thank you! 
Happy to answer all questions 
ross@righttoresearch.org 
@RMounce 
www.righttoresearch.org 
www.sparc.arl.org

Weitere ähnliche Inhalte

Was ist angesagt?

Open Access and Research Integrity Workshop Introduction - 2014
Open Access and Research Integrity Workshop Introduction - 2014Open Access and Research Integrity Workshop Introduction - 2014
Open Access and Research Integrity Workshop Introduction - 2014Right to Research
 
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014Kimberly Hoffman
 
From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle Kimberly Hoffman
 
Open access for researchers, policy makers and research managers - Short ver...
Open access  for researchers, policy makers and research managers - Short ver...Open access  for researchers, policy makers and research managers - Short ver...
Open access for researchers, policy makers and research managers - Short ver...Iryna Kuchma
 
Illustrative Global Endorsements of my article Future of Ethically Effective ...
Illustrative Global Endorsements of my article Future of Ethically Effective ...Illustrative Global Endorsements of my article Future of Ethically Effective ...
Illustrative Global Endorsements of my article Future of Ethically Effective ...Chaudhary Imran Sarwar
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6ARDC
 
The world of research data: when should data be closed, shared or open
The world of research data: when should data be closed, shared or openThe world of research data: when should data be closed, shared or open
The world of research data: when should data be closed, shared or openheila1
 
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...Susanna-Assunta Sansone
 
La Ciencia Abierta en la práctica: infraestructuras y políticas en Europa, se...
La Ciencia Abierta en la práctica: infraestructuras y políticas en Europa, se...La Ciencia Abierta en la práctica: infraestructuras y políticas en Europa, se...
La Ciencia Abierta en la práctica: infraestructuras y políticas en Europa, se...Pedro Príncipe
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open scienceSarah Jones
 
WikiFactMine: Science for Everyone
WikiFactMine: Science for EveryoneWikiFactMine: Science for Everyone
WikiFactMine: Science for Everyonepetermurrayrust
 
ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017petermurrayrust
 
Open access for researchers and students, research managers and publishers
Open access  for researchers and students, research managers and publishersOpen access  for researchers and students, research managers and publishers
Open access for researchers and students, research managers and publishersIryna Kuchma
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKpetermurrayrust
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchFrancesca Di Donato
 
Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms: Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms: Martin Donnelly
 
Open Access and Research Integrity Workshop 2014 - Advocacy
Open Access and Research Integrity Workshop 2014 - AdvocacyOpen Access and Research Integrity Workshop 2014 - Advocacy
Open Access and Research Integrity Workshop 2014 - AdvocacyRight to Research
 

Was ist angesagt? (20)

Open Access and Research Integrity Workshop Introduction - 2014
Open Access and Research Integrity Workshop Introduction - 2014Open Access and Research Integrity Workshop Introduction - 2014
Open Access and Research Integrity Workshop Introduction - 2014
 
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
 
From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle
 
Open access for researchers, policy makers and research managers - Short ver...
Open access  for researchers, policy makers and research managers - Short ver...Open access  for researchers, policy makers and research managers - Short ver...
Open access for researchers, policy makers and research managers - Short ver...
 
Information Sources for Research
Information Sources for ResearchInformation Sources for Research
Information Sources for Research
 
Illustrative Global Endorsements of my article Future of Ethically Effective ...
Illustrative Global Endorsements of my article Future of Ethically Effective ...Illustrative Global Endorsements of my article Future of Ethically Effective ...
Illustrative Global Endorsements of my article Future of Ethically Effective ...
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
 
Power Searching on the Web Advanced: Finding Specialized Information
Power Searching on the Web Advanced: Finding Specialized InformationPower Searching on the Web Advanced: Finding Specialized Information
Power Searching on the Web Advanced: Finding Specialized Information
 
Power Searching on the Web: An 8-Fold Path
Power Searching on the Web: An 8-Fold PathPower Searching on the Web: An 8-Fold Path
Power Searching on the Web: An 8-Fold Path
 
The world of research data: when should data be closed, shared or open
The world of research data: when should data be closed, shared or openThe world of research data: when should data be closed, shared or open
The world of research data: when should data be closed, shared or open
 
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
 
La Ciencia Abierta en la práctica: infraestructuras y políticas en Europa, se...
La Ciencia Abierta en la práctica: infraestructuras y políticas en Europa, se...La Ciencia Abierta en la práctica: infraestructuras y políticas en Europa, se...
La Ciencia Abierta en la práctica: infraestructuras y políticas en Europa, se...
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
WikiFactMine: Science for Everyone
WikiFactMine: Science for EveryoneWikiFactMine: Science for Everyone
WikiFactMine: Science for Everyone
 
ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017
 
Open access for researchers and students, research managers and publishers
Open access  for researchers and students, research managers and publishersOpen access  for researchers and students, research managers and publishers
Open access for researchers and students, research managers and publishers
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UK
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for research
 
Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms: Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms:
 
Open Access and Research Integrity Workshop 2014 - Advocacy
Open Access and Research Integrity Workshop 2014 - AdvocacyOpen Access and Research Integrity Workshop 2014 - Advocacy
Open Access and Research Integrity Workshop 2014 - Advocacy
 

Andere mochten auch

The PLUTo project @iEvoBio 2014
The PLUTo project @iEvoBio 2014The PLUTo project @iEvoBio 2014
The PLUTo project @iEvoBio 2014Ross Mounce
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
 
Open scholarship [a FOSTER open science talk]
Open scholarship [a FOSTER open science talk]Open scholarship [a FOSTER open science talk]
Open scholarship [a FOSTER open science talk]Ross Mounce
 
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference Kaitlin Thaney
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research DataRoss Mounce
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century ResearchRoss Mounce
 
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themRoss Mounce
 
How can repositories support the text-mining of their content and why?
How can repositories support the text-mining of their content and why? How can repositories support the text-mining of their content and why?
How can repositories support the text-mining of their content and why? Nancy Pontika
 
Subscription costs versus open access costs, & Dissolving journals' boundaries
Subscription costs versus open access costs, & Dissolving journals' boundariesSubscription costs versus open access costs, & Dissolving journals' boundaries
Subscription costs versus open access costs, & Dissolving journals' boundariesAlex Holcombe
 
SocialCite makes its debut at the HighWire Press meeting
SocialCite makes its debut at the HighWire Press meetingSocialCite makes its debut at the HighWire Press meeting
SocialCite makes its debut at the HighWire Press meetingKent Anderson
 
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Ross Mounce
 
Open Access for Early Career Researchers
Open Access for Early Career ResearchersOpen Access for Early Career Researchers
Open Access for Early Career ResearchersRoss Mounce
 
Open software and knowledge for MIOSS
Open software and knowledge for MIOSSOpen software and knowledge for MIOSS
Open software and knowledge for MIOSSpetermurrayrust
 
Research publication support for scholars in Brazil: Rising to the challenge
Research publication support for scholars in Brazil: Rising to the challengeResearch publication support for scholars in Brazil: Rising to the challenge
Research publication support for scholars in Brazil: Rising to the challengeRon Martinez
 
Open Access: Which Side Are You On
Open Access: Which Side Are You OnOpen Access: Which Side Are You On
Open Access: Which Side Are You OnJill Cirasella
 
Fifty shades of green and gold: open access to scholarly information
Fifty shades of green and gold: open access to scholarly informationFifty shades of green and gold: open access to scholarly information
Fifty shades of green and gold: open access to scholarly informationhierohiero
 
Active actionable DMPs
Active actionable DMPsActive actionable DMPs
Active actionable DMPsSarah Jones
 

Andere mochten auch (18)

The PLUTo project @iEvoBio 2014
The PLUTo project @iEvoBio 2014The PLUTo project @iEvoBio 2014
The PLUTo project @iEvoBio 2014
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
Open scholarship [a FOSTER open science talk]
Open scholarship [a FOSTER open science talk]Open scholarship [a FOSTER open science talk]
Open scholarship [a FOSTER open science talk]
 
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century Research
 
Open Access Publishing, Threat or Opportunity?
Open Access Publishing, Threat or Opportunity?Open Access Publishing, Threat or Opportunity?
Open Access Publishing, Threat or Opportunity?
 
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on them
 
How can repositories support the text-mining of their content and why?
How can repositories support the text-mining of their content and why? How can repositories support the text-mining of their content and why?
How can repositories support the text-mining of their content and why?
 
Subscription costs versus open access costs, & Dissolving journals' boundaries
Subscription costs versus open access costs, & Dissolving journals' boundariesSubscription costs versus open access costs, & Dissolving journals' boundaries
Subscription costs versus open access costs, & Dissolving journals' boundaries
 
SocialCite makes its debut at the HighWire Press meeting
SocialCite makes its debut at the HighWire Press meetingSocialCite makes its debut at the HighWire Press meeting
SocialCite makes its debut at the HighWire Press meeting
 
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
 
Open Access for Early Career Researchers
Open Access for Early Career ResearchersOpen Access for Early Career Researchers
Open Access for Early Career Researchers
 
Open software and knowledge for MIOSS
Open software and knowledge for MIOSSOpen software and knowledge for MIOSS
Open software and knowledge for MIOSS
 
Research publication support for scholars in Brazil: Rising to the challenge
Research publication support for scholars in Brazil: Rising to the challengeResearch publication support for scholars in Brazil: Rising to the challenge
Research publication support for scholars in Brazil: Rising to the challenge
 
Open Access: Which Side Are You On
Open Access: Which Side Are You OnOpen Access: Which Side Are You On
Open Access: Which Side Are You On
 
Fifty shades of green and gold: open access to scholarly information
Fifty shades of green and gold: open access to scholarly informationFifty shades of green and gold: open access to scholarly information
Fifty shades of green and gold: open access to scholarly information
 
Active actionable DMPs
Active actionable DMPsActive actionable DMPs
Active actionable DMPs
 

Ähnlich wie The OpenCon Intro to Open Data

Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...African Open Science Platform
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
Nicole Nogoy: GigaScience...how licensing can change the way we do research
Nicole Nogoy: GigaScience...how licensing can change the way we do researchNicole Nogoy: GigaScience...how licensing can change the way we do research
Nicole Nogoy: GigaScience...how licensing can change the way we do researchGigaScience, BGI Hong Kong
 
Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Dag Endresen
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonAfrican Open Science Platform
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...GigaScience, BGI Hong Kong
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesMartin Donnelly
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeLizLyon
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 
Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Robert Oostenveld
 
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...African Open Science Platform
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarMartin Donnelly
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotMartin Donnelly
 
Navigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenNavigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenDigital Science
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacySarah Jones
 

Ähnlich wie The OpenCon Intro to Open Data (20)

Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...
 
The State of Open Data Report by @figshare
The State of Open Data Report  by @figshareThe State of Open Data Report  by @figshare
The State of Open Data Report by @figshare
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
 
Nicole Nogoy: GigaScience...how licensing can change the way we do research
Nicole Nogoy: GigaScience...how licensing can change the way we do researchNicole Nogoy: GigaScience...how licensing can change the way we do research
Nicole Nogoy: GigaScience...how licensing can change the way we do research
 
Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practices
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data
 
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data Pilot
 
Navigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenNavigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan Valen
 
From byte to mind
From byte to mindFrom byte to mind
From byte to mind
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacy
 

Mehr von Ross Mounce

Workshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningWorkshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningRoss Mounce
 
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)Ross Mounce
 
Social Media For Researchers
Social Media For ResearchersSocial Media For Researchers
Social Media For ResearchersRoss Mounce
 
Social Media for Science
Social Media for ScienceSocial Media for Science
Social Media for ScienceRoss Mounce
 
Sharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yetSharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yetRoss Mounce
 
Phylogenetic Congruence between Cranial and Postcranial Characters in Archosa...
Phylogenetic Congruence between Cranial and Postcranial Characters in Archosa...Phylogenetic Congruence between Cranial and Postcranial Characters in Archosa...
Phylogenetic Congruence between Cranial and Postcranial Characters in Archosa...Ross Mounce
 

Mehr von Ross Mounce (8)

Workshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningWorkshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data mining
 
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
 
Social Media For Researchers
Social Media For ResearchersSocial Media For Researchers
Social Media For Researchers
 
Social Media for Science
Social Media for ScienceSocial Media for Science
Social Media for Science
 
Sharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yetSharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yet
 
Herding Cats
Herding CatsHerding Cats
Herding Cats
 
Phylogenetic Congruence between Cranial and Postcranial Characters in Archosa...
Phylogenetic Congruence between Cranial and Postcranial Characters in Archosa...Phylogenetic Congruence between Cranial and Postcranial Characters in Archosa...
Phylogenetic Congruence between Cranial and Postcranial Characters in Archosa...
 
ProgPal2011
ProgPal2011ProgPal2011
ProgPal2011
 

Kürzlich hochgeladen

4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptxmary850239
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesVijayaLaxmi84
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 

Kürzlich hochgeladen (20)

4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their uses
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 

The OpenCon Intro to Open Data

  • 1. Intro to Open Data for OpenCon 2014 November 4, 2014 Ross Mounce, Ph.D. (@RMounce) Postdoc, University of Bath
  • 2. These slides are on Slideshare here: bit.ly/opendataintro All textual content is
  • 3. With thanks to SPARC & R2RC for organising this webcast series Previous talks in this series have been given on Open Access http://youtu.be/5YwASIziPIQ and Open Educational Resources http://youtu.be/5Dauh_PeAzI
  • 4. Outline ●Why care about data? ●Definition: exactly what is open data? ●Open data in scholarship, and beyond... ● Aspects of open data, inc. HowTo ● Legal , Technical ● Privacy! Not all data should be be open ● The 5 stars of open data
  • 5. Why care about data? Data is absolutely fundamental to most research Science without data isn't science * *Entirely theoretical, data-free contributions to science are possible, but rare
  • 6.
  • 7. By sharing data we can see further Data are the building blocks of science Shared, re-used data allow us to more rigorously test hypotheses; “to see further” ...and to do it all more quickly and easily. Also, for validation & reproducibility (more of which later...)
  • 8. What exactly is open data? Open means anyone can freely access, use, modify, and share for any purpose (subject, at most, to requirements that preserve provenance and openness) From http://opendefinition.org/, see http://opendefinition.org/od/ for more detail
  • 9. Legally, what is open data? There are a great many open knowledge definition (OKD) conformant licences, including: CC0 http://creativecommons.org/ publicdomain/zero/1.0/ CC BY https://creativecommons.org/ licenses/by/4.0/ CC BY-SA https://creativecommons.org/ licenses/by-sa/4.0/ See here for the comprehensive list: http://opendefinition.org/licenses/
  • 10. Not all Creative Commons licences are 'open' } NC -- You “may not use this work for commercial purposes”. Work under this licence cannot be used for any purpose, therefore it is not open. Can have significant, often unexpected negative impact on potential re-use. ND -- “No Derivative Works”. Work under this licence cannot be adapted if it is re-used. Not very helpful for research! NC & ND – An extremely restrictive re-use licence, neither commercial purposes nor adaptations are allowed.
  • 11. Non-open licencing causes real problems for research & education The Creative Commons non-commercial (-NC) restriction is poorly defined in most jurisdictions, and even more poorly understood by many of its users. “non-commercial” != “non-profit” A) Non-commercial actually excludes many teaching purposes: In the UK, university students typically pay expensive tuition fees to attend. Thus university teaching is often a commercial activity, -NC restricted materials cannot be used to teach students in these circumstances. B) Licence incompatibility – NC licences are not compatible with licences used on major collaboration platforms like Wikipedia or Wikimedia Commons C) Non-commercial organizations (e.g. Deutschlandradio) have been successfully sued for re-using CC BY-NC content without permission. http://zookeys.pensoft.net/articles.php?id=3036 https://www.techdirt.com/articles/20140326/11405526695/german-court-says-creat ive-commons-non-commercial-licenses-must-be-purely-personal-use.shtml
  • 12. Real problems of non-open data: GBIF & biodiversity data Desmet, P. (2013) Showing you this map of aggregated bullfrog occurrences would be illegal http://peterdesmet.com/posts/illegal-bullfrogs.html
  • 13. Open data in scholarship, and beyond The open data movement is much broader than just academia/research It's been successful & popular in areas like open government data: For transparency, detecting & discouraging corruption For releasing social & commercial value (governments collect a lot of data already, why not make wider use of it, at little or no extra cost?) For participatory governance – citizens can be more informed, a “read/write” society Each of these has clear parallels with open research data: transparency & fraud detection, extra value through research data re-use, participatory citizen science Some text adapted from http://opengovernmentdata.org/
  • 14. Open data in scholarship, and beyond Similarly, and with some overlap to open research data, there's the open GLAM movement (GLAM = Galleries, Libraries, Archives & Museums) In this case, their data is typically collections metadata but also digital images of their collections See http://openglam.org/ for more
  • 15. Technical aspects of open data So, you understand the imporance of licensing... What next? How best can we make our data openly available? Where should I upload to? What format(s) should I make the data available in?
  • 16. Data Standards & Data File Formats Adhere to existing standards!
  • 17. Data Standards & Data File Formats Take note of community standards: e.g. the Bermuda Principles for sharing DNA seq. data ● Automatic release of sequence assemblies larger than 1 kb (preferably within 24 hours). ● Immediate publication of finished annotated sequences. ● Aim to make the entire sequence freely available in the public domain
  • 18. Data Standards & Data File Formats If there are no formally agreed community standards, canvas the community to create/formalise a standard e.g. Best Practices for Data Sharing in Phylogenetic Research (2014) PLOS Currents Tree of Life e.g. The 1st Open Economics International Workshop (Cambridge, 2013) bringing together academic economists from around the world to discuss data sharing in economics research.
  • 19. Data Standards & Data File Formats If there are multiple, competing file formats: Opt for file formats based on open standards https://en.wikipedia.org/wiki/Open_standard e.g. Avoid proprietary formats https://en.wikipedia.org/wiki/Proprietary_format e.g.
  • 20.
  • 21. Data Standards & Data File Formats A real example: recent creation of a new data standard for exchange of 3-dimensional reconstruction of objects from tomographic imaging data SPIERS software + VAXML data standard Sutton et al (2012) SPIERS and VAXML: A software toolkit for tomographic visualisation and a format for virtual specimen interchange. Palaeontologia Electronica
  • 22. Where to upload open data? http://www.crystallography.net/ Genbank, SRA, 1000's more!
  • 23. 'Data paper' journals http://www.mdpi.com/journal/data/about
  • 24. Your data is NOT 'too big' to share http://gigadb.org/dataset/100124 39 Gigabytes (GB) of MRI scans
  • 25. Intelligent data papers allow databases to automatically pull-in your data Many publishers (e.g. Pensoft) intelligently markup data papers so that the data can be automatically ingested into appropriate db's on the day of publication! Data data
  • 26. Data sharing benefits authors & re-users Piwowar HA, Vision TJ. (2013) Data reuse and the open data citation advantage. PeerJ 1:e175 “...open data citation benefit for this sample to be 9%” relative to papers providing no public data, for gene expression microarray data 10.7717/peerj.175/fig-2 See also previous work by Piwowar: 10.1371/journal.pone.0000308
  • 27. Those who share data, do better science Wicherts, J. M., Bakker, M. & Molenaar, D. (2011) Willingness to share research data is related to the strength of the evidence and the quality of reporting of statistical results. PLoS ONE 6, e26828+ URL http://dx.doi.org/10.1371/journal.pone.0026828 The authors examined psychological papers for the quality of statistical reporting & asked the authors of those papers for the full data underlying the reported results. Generally, those who shared, had more statistically robust, reproducible results.
  • 28. “Email the author for data” - doesnt work A well-known problem, which I myself have also faced many times!!! Many legacy journals unfortunately still pretend that “email the author” is still acceptable. Wicherts JM, Borsboom D, Kats J, Molenaar D (2006) The poor availability of psychological research data for reanalysis. American Psychologist 61: 726–728 link
  • 29. Best practice open data is time consuming (but still worth the extra effort!) Emilio M. Bruna recently provided an estimate of the amount of time it took him to prepare & upload open data related to publication to figshare & dryad. 11 Hours & $90 (for Dryad) Providing open-source code was the most time consuming part (25.5 hours), and Open Access publication the most expensive ($600). http://brunalab.org/blog/2014/09/04/the-opportunity-cost-of-my-openscience-was-35- hours-690/
  • 30. Not all data should be open! Obviously, there are some types of data which should NOT be made mandatorily open e.g. sensitive medical data However, with informed consent, if patients really want to, they should be allowed to publish their own medical data
  • 31. But with informed consent, sharing sensitive health data can have good outcomes http://blog.ted.com/2013/06/14/why-i-opensourced-cures-for-my-cancer-sa lvatore-iaconesi-at-tedglobal-2013/
  • 32. Other exceptions to the open default Sensitive species conservation data e.g. exact geocoordinates of home range Certain species of wild orchids, cacti & carnivorous plants are highly endangered by illegal harvesting. Publishing the exact geolocation data of the remaining populations of commercially-desirable, endangered species is really dumb thing to do. Such data is typically held privately in databases (not publicly available).
  • 33. The 5 stars of open data Most research data would get ZERO (not available online) Or just ONE star http://5stardata.info/
  • 34. 3-star open research data is achievable This is where research data publication should be aiming for in the short term. Publishing .csv / non-proprietary open data is NOT actually that hard! http://5stardata.info/
  • 35. Open Access & Open Data have similar goals 80% of research is publicly funded 1 e.g. maximising the return on investment, provided to research by taxpayers and charities Source: “Academic Publishing: Survey of funders supports the benign Open Access outcome priced into shares, HSBC Global Research,” February 11, 2013: https://www.research.hsbc.com/midas/Res/RDV?ao=20&key=RxArFbnG1P&n=360010.PDF
  • 36. Further Reading 1.Editor’s Introduction - Samuel A. Moore 2.Open Content Mining - Peter Murray-Rust, Jennifer C. Molloy, Diane Cabell 3.The Need to Humanize Open Science - Eric C. Kansa 4.Data Sharing in a Humanitarian Organization: The Experience of Médecins Sans Frontières - Unni Karunakara 5.Why Open Drug Discovery Needs Four Simple Rules for Licensing Data and Models - Antony J. Williams, John Wilbanks, Sean Ekins 6.Open Data in the Earth and Climate Sciences - Sarah Callaghan 7.Open Minded Psychology - Wouter van den Bos, Mirjam Jenny, Dirk Wulff 8.Open Data in Health Sciences - Tom Pollard 9.Open Research Data in Economics - Velichka Dimitrova 10.Open Data and Palaeontology - Ross Mounce Out soon this November, 2014 Published by Ubiquity Press
  • 37. Further Reading ● The Open Data Handbook - http://opendatahandbook.org/ ● 5 star Open Data - http://5stardata.info/ ● Science as an open enterprise (2012) A Royal Society report ● Caetano, D. S. & Aisenberg, A. 2014 Forgotten treasures: the fate of data in animal behaviour studies Animal Behaviour Data sharing in phylogenetics ● Magee et al 2014 The Dawn of Open Access to Phylogenetic Data PLOS ONE ● Drew et al 2013 Lost Branches on the Tree of Life. PLOS Biology ● Stoltzfus et al 2012 Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis. BMC Research Notes On licencing & legal issues with re-use ● Hagedorn et al 2011 Creative commons licenses and the non-commercial condition: Implications for the re-use of biodiversity information. ZooKeys ● Mounce 2012. Life as a palaeontologist: Academia, the internet and creative commons. Palaeontology Online ● Klimpel, P. 2012 Consequences, Risks, and side-effects of the license module Non-Commercial – NC [PDF]
  • 38. Further Reading ● Murray-Rust, P. Open data in science. Serials Review 34, 52-64 (2008). URL http://dx.doi.org/10.1016/j.serrev.2008.01.001 ● Leonelli, S., Smirnoff, N., Moore, J., Cook, C. & Bastow, R. Making open data work for plant scientists. Journal of Experimental Botany 64, 4109-4117 (2013). URL http://dx.doi.org/10.1093/jxb/ert273 ● Hrynaszkiewicz, I. & Cockerill, M. Open by default: a proposed copyright license and waiver agreement for open access research and data in peer-reviewed journals. BMC Research Notes 5, 494+ (2012). URL http://dx.doi.org/10.1186/1756-0500-5-494 ● Boulton, G., Rawlins, M., Vallance, P. & Walport, M. Science as a public enterprise: the case for open data. The Lancet 377, 1633-1635 (2011). URL http://dx.doi.org/10.1016/s0140-6736(11)60647-8 ● Parr, C. S. Open sourcing ecological data. BioScience 57, 309-310 (2007). URL http://dx.doi.org/10.1641/b570402 ● Poisot, T., Mounce, R. & Gravel, D. Moving toward a sustainable ecological science: don't let data go to waste! Ideas in Ecology and Evolution 6 (2013). URL http://dx.doi.org/10.4033/iee.2013.6b.14.f
  • 39. November 15-17, 2014 Washington, D.C. & Online www.opencon2014.org @open_con #opencon2014 See you there if you're lucky enough to be going!
  • 40. Thank you! Happy to answer all questions ross@righttoresearch.org @RMounce www.righttoresearch.org www.sparc.arl.org