The Confidence Information Ontology: a step towards a standard for asserting confidence in annotations Open Access

Example homology annotation from Bgee

Entity name	Qualifier	Taxon name	Line type	Evidence term name	Confidence term name	Reference ID
Autopod	—	Vertebrata	RAW	Traceable author statement	Medium confidence from single evidence	PMID:23598338
Autopod	NOT	Vertebrata	RAW	Traceable author statement	Medium confidence from single evidence	PMID:23598338
Autopod	—	Vertebrata	SUMMARY		Confidence statement from strongly conflicting evidence lines of same type	—

Entity name	Qualifier	Taxon name	Line type	Evidence term name	Confidence term name	Reference ID
Autopod	—	Vertebrata	RAW	Traceable author statement	Medium confidence from single evidence	PMID:23598338
Autopod	NOT	Vertebrata	RAW	Traceable author statement	Medium confidence from single evidence	PMID:23598338
Autopod	—	Vertebrata	SUMMARY		Confidence statement from strongly conflicting evidence lines of same type	—

This table shows columns 4, 5, 7, 8, 10, 12 and 13 of the Bgee homology annotation file. The first two rows represent conflicting annotations from single evidence, about the homology of the autopod among Vertebrata . The third is an auto-generated row, summarizing the status of this homology hypothesis, from all evidence lines available.

Table 1.

Example homology annotation from Bgee

Entity name	Qualifier	Taxon name	Line type	Evidence term name	Confidence term name	Reference ID
Autopod	—	Vertebrata	RAW	Traceable author statement	Medium confidence from single evidence	PMID:23598338
Autopod	NOT	Vertebrata	RAW	Traceable author statement	Medium confidence from single evidence	PMID:23598338
Autopod	—	Vertebrata	SUMMARY		Confidence statement from strongly conflicting evidence lines of same type	—

Entity name	Qualifier	Taxon name	Line type	Evidence term name	Confidence term name	Reference ID
Autopod	—	Vertebrata	RAW	Traceable author statement	Medium confidence from single evidence	PMID:23598338
Autopod	NOT	Vertebrata	RAW	Traceable author statement	Medium confidence from single evidence	PMID:23598338
Autopod	—	Vertebrata	SUMMARY		Confidence statement from strongly conflicting evidence lines of same type	—

Finally, the terms from the CIO that are the most informative and likely to be used are described in Table 2 .

Table 2.

List of most informative terms from the CIO

Interpretation	Main branch	Term label
Assertion should be trusted	Single evidence	High confidence from single evidence
	Multiple evidence lines, same type	Confidence statement from congruent evidence lines of same type, overall confidence high
	Multiple evidence lines, same type	Confidence statement from congruent evidence lines of same type, overall confidence medium
	Multiple evidence lines, multiple types	Confidence statement from congruent evidence lines of multiple types, overall confidence high
	Multiple evidence lines, multiple types	Confidence statement from congruent evidence lines of multiple types, overall confidence medium

Assertion needs additional support	Single evidence	Low confidence from single evidence
	Multiple evidence lines, same type	Confidence statement from strongly conflicting evidence lines of same type
	Multiple evidence lines, same type	Confidence statement from weakly conflicting evidence lines of same type, overall confidence low
	Multiple evidence lines, multiple types	Confidence statement from strongly conflicting evidence lines of multiple types
	Multiple evidence lines, multiple types	Confidence statement from weakly conflicting evidence lines of multiple types, overall confidence low

Interpretation	Main branch	Term label
Assertion should be trusted	Single evidence	High confidence from single evidence
	Multiple evidence lines, same type	Confidence statement from congruent evidence lines of same type, overall confidence high
	Multiple evidence lines, same type	Confidence statement from congruent evidence lines of same type, overall confidence medium
	Multiple evidence lines, multiple types	Confidence statement from congruent evidence lines of multiple types, overall confidence high
	Multiple evidence lines, multiple types	Confidence statement from congruent evidence lines of multiple types, overall confidence medium

Assertion needs additional support	Single evidence	Low confidence from single evidence
	Multiple evidence lines, same type	Confidence statement from strongly conflicting evidence lines of same type
	Multiple evidence lines, same type	Confidence statement from weakly conflicting evidence lines of same type, overall confidence low
	Multiple evidence lines, multiple types	Confidence statement from strongly conflicting evidence lines of multiple types
	Multiple evidence lines, multiple types	Confidence statement from weakly conflicting evidence lines of multiple types, overall confidence low

Table 2.

List of most informative terms from the CIO

Interpretation	Main branch	Term label
Assertion should be trusted	Single evidence	High confidence from single evidence
	Multiple evidence lines, same type	Confidence statement from congruent evidence lines of same type, overall confidence high
	Multiple evidence lines, same type	Confidence statement from congruent evidence lines of same type, overall confidence medium
	Multiple evidence lines, multiple types	Confidence statement from congruent evidence lines of multiple types, overall confidence high
	Multiple evidence lines, multiple types	Confidence statement from congruent evidence lines of multiple types, overall confidence medium

Assertion needs additional support	Single evidence	Low confidence from single evidence
	Multiple evidence lines, same type	Confidence statement from strongly conflicting evidence lines of same type
	Multiple evidence lines, same type	Confidence statement from weakly conflicting evidence lines of same type, overall confidence low
	Multiple evidence lines, multiple types	Confidence statement from strongly conflicting evidence lines of multiple types
	Multiple evidence lines, multiple types	Confidence statement from weakly conflicting evidence lines of multiple types, overall confidence low

Interpretation	Main branch	Term label
Assertion should be trusted	Single evidence	High confidence from single evidence
	Multiple evidence lines, same type	Confidence statement from congruent evidence lines of same type, overall confidence high
	Multiple evidence lines, same type	Confidence statement from congruent evidence lines of same type, overall confidence medium
	Multiple evidence lines, multiple types	Confidence statement from congruent evidence lines of multiple types, overall confidence high
	Multiple evidence lines, multiple types	Confidence statement from congruent evidence lines of multiple types, overall confidence medium

Assertion needs additional support	Single evidence	Low confidence from single evidence
	Multiple evidence lines, same type	Confidence statement from strongly conflicting evidence lines of same type
	Multiple evidence lines, same type	Confidence statement from weakly conflicting evidence lines of same type, overall confidence low
	Multiple evidence lines, multiple types	Confidence statement from strongly conflicting evidence lines of multiple types
	Multiple evidence lines, multiple types	Confidence statement from weakly conflicting evidence lines of multiple types, overall confidence low

Discussion

The aim of this work is to show how confidence in annotations can be provided in a standard way; it is not to impose one specific practice in assessing confidence. One purpose of the draft CIO described in this manuscript is to invite feedback and comments from the community. Whatever solution is eventually adopted, the problem of assessing confidence in annotations must be addressed. We hope that this project will provide a home for discussing this major issue (ideally through its associated tracker, available at https://github.com/BgeeDB/confidence-information-ontology ), as well as a practical solution for those who wish to rapidly implement it. Once the principle and design of the CIO are approved by the community, the formalization of this draft ontology could then be improved, by properly defining the semantics of the terms created, using, e.g. the Ontology for Biomedical Investigations ( 20 ), or the Information Artifact Ontology ( https://code.google.com/p/information-artifact-ontology/ ).

The main practical additional task described here is annotating single-evidence assertions with a confidence statement using a basic rating system, a solution akin to what is already adopted by several resources (see Introduction). Summary confidence annotations could then be automatically generated and assigned a confidence term from the ‘confidence statement from multiple evidence lines’ branch, as it is the case, e.g. in the Bgee homology annotations, as long as individual assertions are provided with confidence information and ECO terms. Alternatively, annotation teams with limited resources could choose to provide annotations only at the global summary level. However, the latter solution has the disadvantage of masking the evaluation of confidence at the level of each evidence, which limits the transparency of annotations.

Even when it is impossible to provide confidence in each statement, whether because of lack of manpower or because of methodological limitations (e.g. in case of many electronic annotation methods), it is still very relevant to record whether one evidence line or several supported an assertion, whether they were of the same type or not, and whether they were contradictory or consistent. Such an approach can also be used to integrate the CIO with legacy annotations, by imposing an arbitrary confidence statement to all single evidence lines (e.g. medium confidence from single evidence), and then automatically generating terms from the multiple evidence lines branch, thus providing confidence information at least according to the multiplicity and consistency of evidence lines.

Assessing the quality of the data being captured is one of the most difficult, yet essential, tasks of biocurators. Until now, this has not been done systematically, or even explicitly. There are several reasons for this: first, all papers go through a process of peer-review, and published data is usually assumed to be trustworthy. Second, due to the scale of the task, the resources available are not sufficient to capture the data from all published papers; a selection must be done as to which papers provide the most relevant information for the users of the resource being developed. As curated databases’ usage increases, biocurators have an editorial role that effectively filters published articles into biological databases. A careful selection of the data is thus essential.

This opens the question of defining what makes a ‘high confidence’ evidence. Many biocurators are accustomed to estimating the confidence in evidence sources that they use, yet it can be difficult to transform such subjective estimates into standardized levels. This issue is akin to inter-curator agreement for GO annotations ( 21 ), that despite being highly consistent, highlights the inevitable subjectivity of the process of assigning an ontology term to an assertion.

Indeed, biocuration is a translation problem: the language of biologists must be translated into a structured vocabulary suitable for computational analyses. Ideally, it would be done without losing any of the original meaning, but that is hardly possible. An important aspect which is often missing from annotations is their biological context. For example, a protein may be found in the nucleus in one article by some immunocytochemistry approach, but may be known to have a function that is more consistent with a mitochondrial localization. Ideally at some point, one should be able to integrate all information and try to reconstruct the biological meaning of the annotations. Having a confidence in the different annotations that describe a proteins’ role will certainly help to resolve some of the apparent discrepancies in the annotations (and in the literature).

One important feature we are proposing is to systematically provide a summary annotation when several evidence sources are available. We believe that the use of a summary annotation would be of great benefit, by allowing to have a clear overview of an assertion, taking into account all evidence lines. This can often be difficult when many sources are available, especially when they are contradictory.

Another advantage of the guidelines proposed here is the ability of maintaining erroneous assertions, for informing users about retracted results, while being able to discard them to produce summary interpretations. Indeed, while resources providing a global overview of annotations about an entry, such as UniProtKB/Swiss-Prot, can remove erroneous information, and provide comments to warn about incorrect information, this is hardly possible for resources presenting data based on individual assertions. For instance, for GO annotations, while the presence of the NOT qualifier allows to track conflicting information, it is not sufficient since information from dubious publications remains. Moreover, when an annotation is removed, e.g. following a paper retraction, no trace of this annotation is maintained. A user coming across the original article, unaware of the retraction, might conclude that the publication has just not yet been annotated.

An example of this issue was described by Poux et al . ( 22 ), who showed how erroneous statements about the SIRT5 protein, based on incomplete in vitro studies, are repeatedly published, still today. The approach proposed here would allow to identify assertions that have later been shown to be based on misleading conclusions, owing to the use of the ‘rejected’ term. Users would then be aware of the retraction, while retracted results would not impact summary annotations, presenting the correct interpretation. Also, as summary annotations can be generated automatically (as long as individual assertions are provided with confidence information and ECO terms), the reevaluation of an incorrect statement could be easily propagated to the summary annotation.

An aspect that is not addressed by the guidelines provided here is the different levels at which confidence in assertions can be estimated: at the level of the experimental procedure; at the level of the author interpretation, as authors might have selected results not allowing an unbiased interpretation; and at the level of the annotator interpretation. For now, the basic terms from the ‘confidence statement from single evidence’ branch should be used to take into account these different layers all together. The CIO could be expanded if that turned out to be a need of the community. Possible solutions could be to capture the confidence at these different levels independently, or to modify the branch ‘confidence statement from single evidence’ for this purpose. We hope that the current work will promote discussions towards this aim, possibly through the associated tracker.

This issue is related to the definition of the provenance of an annotation. Data provenance aims at documenting origin of data, but also annotation steps or task workflow (see http://www.w3.org/TR/prov-overview/ ). However, in the current state, documenting the full provenance of annotations might be overwhelming for most curation teams. The CIO is designed to be easily used in daily curation work, and to be as close as possible to the rating systems already adopted by several resources. Capturing provenance of an annotation in accordance with W3C and other standards (e.g. linked data frameworks) should be a long term goal; our current work represents a first step towards such structured capture.

We believe that the CIO, as well as the guidelines presented here, will allow end-users to better evaluate the pertinence of curated data. This is expected to enhance data dissemination across resources, as well as analyses based on curated data, thanks to the improved possibilities of filtering data, and of evaluating their trustworthiness.

Conclusion

With the growth of annotations available, it has become essential to assess confidence in these annotations. This article is an attempt at defining guidelines for standardizing the exchange of confidence information, and at showing the feasibility of this approach.

We propose three basic principles: (i) while it is difficult to standardize parameters to define confidence in annotations across resources, it is possible to use a common ontology language to provide this information; (ii) in the same way that the GO guidelines recommend to provide annotations at the level of each individual evidence, a confidence statement might also be assigned to each single evidence, using a basic rating system; (iii) when several evidence lines are available relative to an assertion, it is desirable to provide a global summary assertion, taking all evidence into account.

We created the CIO in order to illustrate these principles. We hope it might be a trigger for the biocuration community to address this need for standardizing confidence information. Whether this ontology undergoes major changes in the near future, following feedback from the community, or whether it is used ‘as is’ by several resources, we hope that annotation confidence will be increasingly available in biocuration efforts.

Acknowledgements

We thank Amos Bairoch for insightful comments provided in preparation of the workshop held at the Biocuration 2012 conference.

Funding

G.L.H. is funded by NSF DBI-1356193. S.P. is funded by the Swiss Federal Government through the state Secretariat for Education, Research and Innovation (SERI), and by the National Institute of Health (NIH) grant 1 U41 HG006104. F.B.B., M.R.R., and A.N. are funded by the Swiss Federal Government through the state Secretariat for Education, Research and Innovation (SERI), the Swiss National Science Foundation (grant number 31003A_153341), SystemsX.ch (project AgingX), and Etat de Vaud. S.O. is funded by the BBSRC MIDAS grant (BB/L024179/1) and the European Commission grant Affinomics (FP7-241481). M.C.C. and M.G. are funded by National Institutes of Health (R01 GM089636). Funding for open access charge: Etat de Vaud funding to MRR.

Conflict of interest . None declared.

References

Skunca

Altenhoff

Dessimoz

(

2012

)

Quality of computationally inferred gene ontology annotations

PLoS Comput. Biol.

e1002533

Schnoes

A.M.

Ream

D.C.

Thorman

A.W.

et al. . (

2013

)

Biases in the experimental annotations of protein function and their effect on our understanding of protein function space

PLoS Comput. Biol.

e1003063

Du Plessis

Skunca

Dessimoz

(

2011

)

The what, where, how and why of gene ontology–a primer for bioinformaticians

Brief. Bioinform.

723

–

735

Chibucos

M.C.

Mungall

C.J.

Balakrishnan

et al. . (

2014

)

Standardized description of scientific evidence using the Evidence Ontology (ECO)

Database

2014

, bau075 http://database.oxfordjournals.org/content/2014/bau075.long

Willighagen

E.L.

Waagmeester

Spjuth

et al. . (

2013

)

The ChEMBL database as linked open data

J. Cheminform.

Niknejad

Comte

Parmentier

et al. . (

2012

)

vHOG, a multispecies vertebrate ontology of homologous organs groups

Bioinformatics

1017

–

1020

Lane

Argoud-Puy

Britan

et al. . (

2012

)

neXtProt: a knowledge platform for human proteins

Nucleic Acids Res.

D76

–

The UniProt Consortium

(

2014

)

Activities at the Universal Protein Resource (UniProt)

Nucleic Acids Res.

D191

–

D198

Furnham

Holliday

G.L.

de Beer

T.A.P.

et al. . (

2014

)

The Catalytic Site Atlas 2.0: cataloging catalytic sites and residues identified in enzymes

Nucleic Acids Res.

D485

–

Holliday

G.L.

Andreini

Fischer

J.D.

et al. . (

2012

)

MACiE: exploring the diversity of biochemical reactions

Nucleic Acids Res.

D783

–

D789

Pruitt

K.D.

Brown

G.R.

Hiatt

S.M.

et al. . (

2014

)

RefSeq: an update on mammalian reference sequences

Nucleic Acids Res.

D756

–

D763

Richardson

E.J.

Watson

(

2013

)

The automatic annotation of bacterial genomes

Brief. Bioinform.

–

Altman

(

2012

)

Mitigating threats to data quality throughout the curation lifecycle

. In:

Marciano

Lee

Bowden

(eds).

Curating For Quality: Ensuring Data Quality to Enable New Science

National Science Foundation, Arlington County

, pp.

–

119

Google Preview

Gaudet

Arighi

Bastian

et al. . (

2012

)

Recent advances in biocuration: Meeting Report from the fifth International Biocuration Conference

Database

2012

bas036.

Kardong

K.V.

(

2006

)

Vertebrates: comparative anatomy, function, evolution

McGraw-Hill, NY

Google Preview

Bentley

P.J.

(

1979

)

The vertebrate urinary bladder: osmoregulatory and other uses

Yale J. Biol. Med.

563

–

568

PubMed

Orchard

Kerrien

Abbani

et al. . (

2012

)

Protein interaction data curation: the International Molecular Exchange (IMEx) consortium

Nat. Methods

345

–

350

Nakayama

Izuta

Nakayama

et al. . (

2000

)

Depletion of the squalene synthase (ERG9) gene does not impair growth of Candida glabrata in mice

Antimicrob. Agents Chemother.

2411

–

2418

Amemiya

C.T.

Alföldi

Lee

A.P.

et al. . (

2013

)

The African coelacanth genome provides insights into tetrapod evolution

Nature

496

311

316

Brinkman

Courtot

Derom

et al. . (

2010

)

Modeling biomedical experimental processes with OBI

J. Biomed. Semant.

Crossref

Camon

E.B.

Barrell

D.G.

Dimmer

E.C.

et al. . (

2005

)

An evaluation of GO annotation retrieval for BioCreAtIvE and GOA

BMC Bioinformatics

(

Suppl 1

S17

Poux

Magrane

Arighi

C.N.

et al. . (

2014

)

Expert curation in UniProtKB: a case study on dealing with conflicting and erroneous data

Database

2014

, bau016