Accessing small molecule data using ChEBI

Accessing small molecule data using ChEBI Janna Hastings, Duncan Hull and Nico Adams Programmatic Access to Biological Databases (Perl) 22-26 February 2010 @ EBI

Overview ,[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

Small Molecules within Bioinformatics Literature Nucleotide sequences Genomes Expressions Protein sequences Protein domains, families 3D structures Enzymes Small molecules Pathways Systems

Literature Nucleotide sequences Genomes Expressions Protein sequences Protein domains, families 3D structures Enzymes Small molecules Pathways Systems Small Molecules within Bioinformatics Small molecules Small molecules Small molecules Small molecules Small molecules

Small molecules participate in all the processes of life

Signaling γ-aminobutyric acid ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Metabolism ,[object Object],[object Object],[object Object],[object Object],[object Object],Adenosine 5'-triphosphate

Enzymes ,[object Object],[object Object],[object Object],[object Object],clavulanic acid (ChEBI:48947) acts as a suicide inhibitor of bacterial β-lactamase enzymes

Pathways http://www.genome.jp/kegg-bin/highlight_pathway?scale=1.0&map=map00231&keyword=tryptophan

Systems biology ,[object Object],[object Object],D-enantiomer: sweet L-enantiomer: bitter

Drug design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Drug types 2003 - 2009 'Small molecules' in various shades of blue (http://chembl.blogspot.com/)

Getting the chemistry right ,[object Object],[object Object],[object Object],[object Object],http://www.drugbank.ca/drugs/DB01041

Small molecule data sources Deposition-driven publicly available compound repository, containing more than 25 million unique structures. http://pubchem.ncbi.nlm.nih.gov/ http://www.chemspider.com/ Automatic aggregation of publicly available chemistry data with crowdsourced annotation. http://www.ebi.ac.uk/chebi/ Manually annotated database and ontology

Small molecule annotations ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Chemicals - ChEBI Visualisation caffeine 1,3,7-trimethylxanthine methyltheobromine Nomenclature Formula: C8H10N4O2 Charge: 0 Mass: 194.19 Chemical data metabolite CNS stimulant trimethylxanthines Ontology MSDchem: CFF KEGG DRUG: D00528 Database Xrefs Chemical Informatics InChI=1/C8H10N4O2/c1-10-4-9-6-5(10)7(13)12(3)8(14)11(6)2/h4H,1-3H3 SMILES CN1C(=O)N(C)c2ncn(C)c2C1=O

What is ChEBI? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

ChEBI home page ChEBI – Chemical Entities of Biological Interest 25.02.10

How is ChEBI maintained? ,[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

ChEBI entries contain ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

ChEBI entry view ChEBI – Chemical Entities of Biological Interest 25.02.10

Automatic Cross-references ChEBI – Chemical Entities of Biological Interest 25.02.10

Chemical Structures ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

Molfile format ChEBI – Chemical Entities of Biological Interest 25.02.10

Searching and browsing ChEBI Block 2

[object Object],Simple text search ChEBI – Chemical Entities of Biological Interest 25.02.10 Wildcard: * Enter any text

Advanced text search ChEBI – Chemical Entities of Biological Interest 25.02.10 Narrow to category AND, OR and BUT NOT

Structure search ChEBI – Chemical Entities of Biological Interest 25.02.10 Search options Structure drawing tools

Search Results ChEBI – Chemical Entities of Biological Interest 25.02.10 Click to go to entry page Hover-over for search menu

Fingerprints ,[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

Fingerprints [2] ,[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 C8H9NO2 ,[object Object],cannot be a substructure of an entity which does not have at least 8 carbon atoms, 9 hydrogen atoms…

Fingerprints [3] ,[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 water (HOH) 0-bond paths H O H 1-bond paths HO OH 2-bond paths HOH ,[object Object],Pattern Hashed bitmap H 0000010000 O 0010000000 HO 1010000000 OH 0000100010 HOH 0000000101 Result: 1010110111

Types of structure search ,[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 InChI=1/H2O/h1H2 1010110111 0010110010 10 1 0 11 01 1 1 00 1 0 11 00 1 0 Tanimoto(a,b) = c / (a+b-c) = 4 / ( 4 + 7 - 4 ) = 0.57 a b

Browse via Periodic Table ChEBI – Chemical Entities of Biological Interest 25.02.10 Molecular entities / Elements

Navigate via links in ontology ChEBI – Chemical Entities of Biological Interest 25.02.10 Click to follow links

Understanding the ChEBI ontology Block 3

Annotation of bioinformatics data ,[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 ,[object Object],[object Object],[object Object],[object Object]

The ChEBI ontology ,[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 ( R ) -adrenaline

Molecular structure ontology ChEBI – Chemical Entities of Biological Interest 25.02.10

Role ontology ChEBI – Chemical Entities of Biological Interest 25.02.10

ChEBI ontology relationships ,[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

Viewing ChEBI ontology ChEBI – Chemical Entities of Biological Interest 25.02.10

Viewing ChEBI ontology [2] ChEBI – Chemical Entities of Biological Interest 25.02.10 Tree view

Browsing ChEBI ontology (OLS) ChEBI – Chemical Entities of Biological Interest 25.02.10 Browse the ontology Ontology Lookup Service (OLS): http://www.ebi.ac.uk/ontology-lookup/

Ontology Lookup Service ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

OBO Foundry “ The OBO Foundry is a collaborative experiment involving developers of science-based ontologies who are establishing a set of principles for ontology development with the goal of creating a suite of orthogonal interoperable reference ontologies in the biomedical domain.” ChEBI – Chemical Entities of Biological Interest 25.02.10

Download and programmatic access Block 4

ChEBI domain model ChEBI – Chemical Entities of Biological Interest 25.02.10 Self-referencing - merging

Compound IDs and Merging ,[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 only the main accession of a merged group is displayed Navigated accession: CHEBI:5585 Main accession: CHEBI:15377

Compound IDs and Merging [2] ChEBI – Chemical Entities of Biological Interest 25.02.10 Additional acc Parent ID This compound ID = additional acc ID STATUS CHEBI_ACCN SOURCE PARENT_ID NAME DEFINITION 15377 C CHEBI:15377 ChEBI null water null 5585 C CHEBI:5585 KEGG 15377 null null ID COMPOUND ACCN_NUMBER TYPE STATUS SOURCE URL_ABBR 16213 5585 C00001 KEGG accn C KEGG KEGG 17314 5585 7732-18-5 CAS Registry C KEGG null

Downloading ChEBI flavours ChEBI – Chemical Entities of Biological Interest 25.02.10 ,[object Object],[object Object],[object Object]

Downloading ChEBI ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

OBO File Format ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 General header information Synonym types used in terms Root terms Relationships to other terms

SDF File Lite format ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 Entries separated by $$$$

SDF File complete format ChEBI – Chemical Entities of Biological Interest 25.02.10 Entries separated by $$$$

Flat-file tab and comma delimited ChEBI – Chemical Entities of Biological Interest 25.02.10 ,[object Object],[object Object],[object Object],[object Object],[object Object]

Table dumps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

Web services ,[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10 User application

The ChEBI web service ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

Web service client object model ChEBI – Chemical Entities of Biological Interest 25.02.10 getLiteEntity getCompleteEntity getOntology (Parents and Children)

Methods and parameters (1) ChEBI – Chemical Entities of Biological Interest 25.02.10

For more information ,[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

Acknowledgements ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ChEBI – Chemical Entities of Biological Interest 25.02.10

Accessing small molecule data using ChEBI

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie Accessing small molecule data using ChEBI

Ähnlich wie Accessing small molecule data using ChEBI (20)

Mehr von Duncan Hull

Mehr von Duncan Hull (20)

Accessing small molecule data using ChEBI

Hinweis der Redaktion