Abstract

Units are basic scientific tools that render meaning to numerical data. Their standardization and formalization caters for the report, exchange, process, reproducibility and integration of quantitative measurements. Ontologies are means that facilitate the integration of data and knowledge allowing interoperability and semantic information processing between diverse biomedical resources and domains. Here, we present the Units Ontology (UO), an ontology currently being used in many scientific resources for the standardized description of units of measurements.

Introduction

Scientific research crucially relies on quantitative measurements. Scientific findings, even if they exclusively include qualitative information about specific observations, have to rely in some form on quantitative measurements that enable the inference of the reported qualitative information. Quantitative measurements would be meaningless without specifying the units that were measured. For example, it would make little sense to a biologist to talk about the value of the weight of a mouse without specifying the units of this measurement, nor for a chemist to talk about the value of the ionization energy or the electron affinity of an atom without specifying their units.

Units are basic scientific tools that render meaning to numerical data. The value of a quantity is generally expressed as the product of a number and its associated unit. This unit then represents a reference of a particular example of that quantity that it is associated with, whereas the number is the ratio of the value of the quantity to the unit. It is arbitrary, which particular example of the reference quantity a unit would be, and as a result there are many different units that correspond to particular quantities.

Indeed, throughout our scientific endeavors, different types of units have been proposed and used. Even today, different countries or even regions use different kinds of unit systems. The standardization and formalization of units is vital for our ability to exchange, process and integrate quantitative data (1). In scientific research, standardized concepts cater for the ability of scientists to formulate theories, report their results and allow for the reproducibility of them. As a result, various efforts have been initiated to achieve the standardization of units. The prime example is the International System of Units or Système Internationale (SI), which was adopted by the Eleventh General Conference of Weights and Measures (Conferérence Générale des Poids et Mesures) in 1960 as a universal measuring system used in all areas of science (2). However, the adoption of a standard for units, such as the SI, is not sufficient to ensure the integration of quantitative information (3). Instead, a consistent method is required that enables both humans and machines to interpret the units occurring in a data set (4–6).

Within the biomedical community, one of the most successful strategies for achieving standardization and integration of biomedical knowledge, data and associated experiments was proposed more than a decade ago with the advent of the Gene Ontology (7). Since then, the biomedical community has invested a considerable amount of effort, research and resources in the development of ontologies that are now becoming and increasingly successful as information management and integration tools.

Here, we present the Units Ontology (UO), a comprehensive ontology for the standardization of units of measurement in the biomedical domain. The development of UO was initiated in 2005, as a part of the Phenotype and Trait Ontology (PATO) framework for describing qualitative and quantitative observations in biology (8) and aims to provide stable identifier for all units that are required by biomedical research projects. UO is continually updated and extended based on specific requests. The ontology is freely available in several formats and has been adopted by a wide range of research initiatives for the description of measurements, observations and hypotheses.

Materials and methods

Manual curation

The initial version of the UO was developed manually using the Open Biomedical Ontology (OBO)-Edit ontology editor (9). UO was refined and populated through a combination of literature research on units, based on existing annotations of measurements, as well as assays, personal communications with users of UO, as well as the domain knowledge of the ontology developers. The UO contains textual definitions for all its terms. Where possible, we provide links to the source of the definition.

Maintenance, release and availability

UO is maintained in a subversion repository and is made available through the OBO registry and our project website at http://unit-ontology.googlecode.com. Additionally, a term request tracker (http://code.google.com/p/unit-ontology/issues/list) and a discussion list (https://lists.sourceforge.net/lists/listinfo/obo-unit) allow users to suggest changes and request new features. UO is available in both the OBO Flatfile Format (10) and the Web Ontology Language (OWL) (11).

Although the UO is directly developed in the OBO Flatfile Format, a software tool generates several different OWL versions that are suitable for different application scenarios. The conversion tool is freely available on UO’s website. It is implemented in Groovy and uses the OWL Application Programming Interface (API) (12) to perform the conversion.

The distinctions between the different versions are based on the OWL treatment of units (i.e. whether they are classes or instances) and whether the PATO ontology of qualities is included or not. In particular, some applications only require identifiers for units but no links to qualities, and for these applications we generate an OWL version without these links. In particular, the link to PATO uses the OWL construct of ‘disjunction’ (logical ‘or’) when a unit may be the unit of more than one quality. The use of disjunction introduces non-determinism and commonly increases the computational complexity of key reasoning tasks (11, 13). As a result, such a formal representation may not be suitable for applications that rely on fast query times (14). The file uo-without-pato-references.owl contains a unit ontology without any references to qualities. Although this still permits inferences over units and their hierarchy, it is no longer possible to answer queries that return the qualities to which a unit belongs.

The second distinction in UO is whether to treat units as classes or as instances. In OWL, a class is a collection of things determined either by a set of constraints that the members of the class have to satisfy or by explicitly enumerating the class’ members. The members of a class are called its ‘instances’. There is some debate about whether units, such as ‘meter’, should be modelled as classes or instances. If a ‘meter’ is represented as a class, the question arises what the instances of ‘meter’ are. Instances of a class ‘meter’ could, e.g. be considered to be individual qualities (i.e. particular ‘length’ qualities). If ‘meter’ is an instance, only one ‘meter’ would exist and the question arises where it exists. For example, ‘meter’ as an instance could be considered an abstract entity.

The choice of representation is not only dependent on philosophical considerations, but also depends on the type of application in which an ontology is used. For example, some ontology browsers, particularly for biomedical ontologies, are only able to display classes but not individuals. Therefore, we generate several further OWL versions of UO: one version (uo-without-instances.owl) in which units are ‘subclasses’ of grouping classes, another (uo-without-units-as-classes.owl) in which units are ‘instances’ of grouping classes, and yet another (uo.owl) in which they are both and the classes are defined as ‘singleton’ classes. For example, ‘degree Celsius’ (UO:0000027) belongs to the ‘Temperature unit’ (UO:0000005) category, and we declare the following axioms:

  • in uo-without-instances.owl, we declare UO:0000027 SubClassOf: UO:0000005, in uo-without-units-as-classes.owl, we declare UO:0000027 InstanceOf: UO:0000005 and in uo.owl, we declare three axioms:

    • UO:0000027 SubClassOf: UO:0000005 and

    • UO:0000027 EquivalentTo: {UO:0000027}

    • UO:0000027 InstanceOf: UO:0000005,

In the file uo.owl, we use the identifier for ‘degree Celsius’ (UO:0000027) ‘both’ as an instance and as a class. In OWL 2, this feature has been introduced as ‘punning’ The use of punning allows the use of the same identifier for an instance and a class, in it enables us to treat units both as instances and classes. The axiom UO:0000027 EquivalentTo: {UO:0000027} then declares the class ‘degree Celsius’ to be equivalent to the class that can only have a single instance—‘degree Celsius’ (treated as an instance). When using an OWL reasoner capable of reasoning over instances and enumeration axioms, the third axiom is a consequence of the first two.

This tight integration between a class-view and an instance-view ensures that the two semantic representations can be converted into each other if desired. For example, if ‘temperature’ qualities are measured in ‘degree Celsius’ within some application, an axiom could be declared:

MyTemperature SubClassOf has-unit some UO:0000027

Every instance of the class ‘MyTemperature’ will then not only be an instance of has-unit some UO:0000027, but will also directly stand in a ‘has-unit’ relation to UO:0000027 (treated as an instance).

Results

UO

We provide the UO in several formats (OWL and OBO), and using different axioms in the OWL versions. However, the core terms of UO are common across all versions. Currently, UO includes 304 terms for units, types of units and prefixes. All terms have textual definitions. These definitions are consistent with those of the Unified Code for Units of Measure (UCUM) (3). Wherever possible, we use definitions from the National Institute of Standards and Technology (NIST) (2). Each term in the UO is uniquely identified by an Internationalized Resource Identifier (IRI) of the form: http://purl.obolibrary.org/OBO/UO\_nnnnnnn.

UO has two top-level classes, ‘unit’ (UO:0000000) and ‘prefix’ (UO:0000046). The ‘prefix’ class has 20 descendant classes that characterize unit prefixes such as ‘kilo’, ‘pico’ or ‘mega’. The subclasses of ‘unit’ distinguish between the qualities that are characterized by the units. For example, ‘length unit’ (UO:0000001) is a class that has, either as subclasses or instances, units measuring ‘length’.

Several units, such as ‘micrometer’ and ‘centimeter’, are based on the same unit (‘meter’) and distinguished by their prefix. To group units such as these together, we generate another grouping class, ‘meter-based unit’, that has as subclasses all units that are based on ‘meter’. These units are explicitly defined as having a prefix using the ‘has-prefix’ relation. For example, ‘centimeter’ (UO:0000015) is defined as a ‘meter-based unit’ (UO:1000008) that has as prefix ‘centi’ (UO:0000298). Based on the ‘has-prefix’ relation, the UO also provides some capabilities for defining new units by combing existing units with a prefix.

Alignment with PATO

The PATO was envisaged and designed to provide a platform for facilitating mutual understanding and interoperability of phenotype information across species and domains of knowledge among scientists and machines (8). PATO’s prime purpose is to integrate phenotype-related data and knowledge from literature, curated resources and representation methods. To achieve this goal, PATO provides a set of qualities, the basic entities that we can perceive and measure, such as weights, sizes or shapes, and combines them with the entities that are being observed in a phenotypic manifestation(8).

PATO distinguishes between the qualities that form the traits (e.g. colour, shape) and their values, which can be either qualitative (e.g. red, square) or quantitative (e.g. 650 nm, or 4 cm × 4 cm). UO is capable of providing a uniform representation of the units that are combined with the scalar PATO qualities and thereby, provide quantitative description of measurements associated with phenotype observations. For this purpose, PATO qualities are associated with appropriate units from UO via the unit_of relationship. For example, the PATO qualities ‘conductivity’ (PATO:0001585), which has two subclasses ‘electrical conductivity’ (PATO:0001757) and ‘heat conductivity’ (PATO:0001756) and ‘energy’ (PATO:0001021) are associated with the UO terms ‘electrical conduction unit’ (UO:0000262), ‘heat conduction unit’ (UO:0000263) and ‘energy unit’ (UO:0000111), respectively. The term ‘electrical conduction unit’ (UO:0000262) has children such as ‘siemens’ (UO:0000264), ‘heat conduction unit’ (UO:0000263) has children such as ‘watt per meter kelvin’ (UO:0000265) and ‘energy unit’ (UO:0000111) has children such as ‘joule’ (UO:0000112). These associations allow for the quantitative description of measurements. For example, it is now possible to describe, using the PATO framework, a measurement of an entity that has a particular ‘electrical conductivity’ measured in ‘siemens’. This mapping is demonstrated in Figure 1. The mapping between PATO scalar qualities and UO units makes it also possible, for some cases, to automatically infer, based on the unit ascribed to a particular measurement, the type of quality that the measurement refers to. This feature can be particularly useful, e.g. in the case of parsing mathematical models to extract metadata related to the model (15).

Schematic representation of an example of the mappings between PATO qualities and UO units. The figure is based on the OBO representation of UO in which units are treated as classes. Boxes in the figure represent classes and blue arrows represent subclass axioms between classes. If a grey arrow (labelled unit_of) connects the class A (from UO) and B (from PATO), then A SubClassOf: unit_of only B.
Figure 1

Schematic representation of an example of the mappings between PATO qualities and UO units. The figure is based on the OBO representation of UO in which units are treated as classes. Boxes in the figure represent classes and blue arrows represent subclass axioms between classes. If a grey arrow (labelled unit_of) connects the class A (from UO) and B (from PATO), then A SubClassOf: unit_of only B.

Application of UO

UO has been adopted, either directly or indirectly, by a large number of ontologies, markup languages, databases, standards initiatives, research project and applications. Here, we provide some examples that fall into different categories of application.

Association with other ontologies

UO is used by several ontologies allowing them to refer to units in a standardized manner. These ontologies either import the UO directly, such as the Ontology of Biomedical Investigations (OBI) (16), or select to include only the units applicable for their domain of interest. For example, the BioAssay Ontology (BAO) that serves as a foundation for the standardization of high-throughput screening assays (HTS) assays imports only the concentration unit and time unit terms from UO (17). Table 1 provides a list of examples of such ontologies.

Table 1

A list of examples of ontologies that directly or indirectly utilize UO

DomainOntology
Clinical and research investigationsOntology for Biomedical Investigations (OBI) (16)
Microarray experimentsMicroarray Gene Expression Data (MGED) (18)
BioassayBioAssay Ontology (BAO) (17)
Skeletal dysplasiaBone Dysplasia ontology (BDO) (19)
MeasurementUnits of Measurement Expressions (UOME)
ElectrophysiologyElectrophysiology Ontology
Cancer nanotechnologyNanoParticle Ontology (NPO) (20)
AgricultureCGIAR agricultural measurement unit ontology (21)
Adverse eventsAdverse Event Reporting ontology (AERO)
Mass spectrometryImaging Mass Spectrometry Ontology
Upper ontologyYAMATO—yet another more advances top-level ontology
ChemistryChemical Information Ontology (CHEMINF) (22)
Biological samplesexperimental factor ontology (EFO) (23)
Event-related potential (ERP)Neural ElectroMagnetic Ontologies (NEMO) (24)
BehaviourCognitive Paradigm Ontology (CogPO) (25)
Sleep medicineSleep Domain Ontology (SDO) (26)
DomainOntology
Clinical and research investigationsOntology for Biomedical Investigations (OBI) (16)
Microarray experimentsMicroarray Gene Expression Data (MGED) (18)
BioassayBioAssay Ontology (BAO) (17)
Skeletal dysplasiaBone Dysplasia ontology (BDO) (19)
MeasurementUnits of Measurement Expressions (UOME)
ElectrophysiologyElectrophysiology Ontology
Cancer nanotechnologyNanoParticle Ontology (NPO) (20)
AgricultureCGIAR agricultural measurement unit ontology (21)
Adverse eventsAdverse Event Reporting ontology (AERO)
Mass spectrometryImaging Mass Spectrometry Ontology
Upper ontologyYAMATO—yet another more advances top-level ontology
ChemistryChemical Information Ontology (CHEMINF) (22)
Biological samplesexperimental factor ontology (EFO) (23)
Event-related potential (ERP)Neural ElectroMagnetic Ontologies (NEMO) (24)
BehaviourCognitive Paradigm Ontology (CogPO) (25)
Sleep medicineSleep Domain Ontology (SDO) (26)
Table 1

A list of examples of ontologies that directly or indirectly utilize UO

DomainOntology
Clinical and research investigationsOntology for Biomedical Investigations (OBI) (16)
Microarray experimentsMicroarray Gene Expression Data (MGED) (18)
BioassayBioAssay Ontology (BAO) (17)
Skeletal dysplasiaBone Dysplasia ontology (BDO) (19)
MeasurementUnits of Measurement Expressions (UOME)
ElectrophysiologyElectrophysiology Ontology
Cancer nanotechnologyNanoParticle Ontology (NPO) (20)
AgricultureCGIAR agricultural measurement unit ontology (21)
Adverse eventsAdverse Event Reporting ontology (AERO)
Mass spectrometryImaging Mass Spectrometry Ontology
Upper ontologyYAMATO—yet another more advances top-level ontology
ChemistryChemical Information Ontology (CHEMINF) (22)
Biological samplesexperimental factor ontology (EFO) (23)
Event-related potential (ERP)Neural ElectroMagnetic Ontologies (NEMO) (24)
BehaviourCognitive Paradigm Ontology (CogPO) (25)
Sleep medicineSleep Domain Ontology (SDO) (26)
DomainOntology
Clinical and research investigationsOntology for Biomedical Investigations (OBI) (16)
Microarray experimentsMicroarray Gene Expression Data (MGED) (18)
BioassayBioAssay Ontology (BAO) (17)
Skeletal dysplasiaBone Dysplasia ontology (BDO) (19)
MeasurementUnits of Measurement Expressions (UOME)
ElectrophysiologyElectrophysiology Ontology
Cancer nanotechnologyNanoParticle Ontology (NPO) (20)
AgricultureCGIAR agricultural measurement unit ontology (21)
Adverse eventsAdverse Event Reporting ontology (AERO)
Mass spectrometryImaging Mass Spectrometry Ontology
Upper ontologyYAMATO—yet another more advances top-level ontology
ChemistryChemical Information Ontology (CHEMINF) (22)
Biological samplesexperimental factor ontology (EFO) (23)
Event-related potential (ERP)Neural ElectroMagnetic Ontologies (NEMO) (24)
BehaviourCognitive Paradigm Ontology (CogPO) (25)
Sleep medicineSleep Domain Ontology (SDO) (26)

Association with international projects

Several projects incorporate either directly or indirectly UO. One such example is the RICORDO project (27) that utilizes UO for the annotation of units in computational models. Table 2 provides some examples of such projects.

Table 2

A list of examples of international projects that directly or indirectly incorporate UO

DomainProject
EvolutionPhenoscape project (28)
PhysiologyCore Reference Datasets and Ontologies for the Virtual Physiological Human (RICORDO) (27)
Cardiac medicineThe CardioVascular Research Grid (CVRG) Project (29)
Personalized medicinep-medicine (30)
CancerCancer Biomedical Informatics Grid (CaBIG) (31)
DomainProject
EvolutionPhenoscape project (28)
PhysiologyCore Reference Datasets and Ontologies for the Virtual Physiological Human (RICORDO) (27)
Cardiac medicineThe CardioVascular Research Grid (CVRG) Project (29)
Personalized medicinep-medicine (30)
CancerCancer Biomedical Informatics Grid (CaBIG) (31)
Table 2

A list of examples of international projects that directly or indirectly incorporate UO

DomainProject
EvolutionPhenoscape project (28)
PhysiologyCore Reference Datasets and Ontologies for the Virtual Physiological Human (RICORDO) (27)
Cardiac medicineThe CardioVascular Research Grid (CVRG) Project (29)
Personalized medicinep-medicine (30)
CancerCancer Biomedical Informatics Grid (CaBIG) (31)
DomainProject
EvolutionPhenoscape project (28)
PhysiologyCore Reference Datasets and Ontologies for the Virtual Physiological Human (RICORDO) (27)
Cardiac medicineThe CardioVascular Research Grid (CVRG) Project (29)
Personalized medicinep-medicine (30)
CancerCancer Biomedical Informatics Grid (CaBIG) (31)

Association with standards initiatives

UO is included in many standardization efforts that refer to units. For example, the HUPO Proteomics Standards Initiative (PSI) (32) recommends ‘to use and contribute’ to the UO. Table 3 presents some examples of such Standards Initiatives.

Table 3

A list of examples of standards initiatives that utilize UO, either directly or indirectly

DomainStandard
Mass spectrometryHUPO Proteomics Standards Initiative Mass Spectrometry (33)
ChemistryChemical Entity Semantic Specification (CHESS) (34)
ProteomicsmzIdentML—standard format for proteomics spectrum identification algorithms results (32)
DomainStandard
Mass spectrometryHUPO Proteomics Standards Initiative Mass Spectrometry (33)
ChemistryChemical Entity Semantic Specification (CHESS) (34)
ProteomicsmzIdentML—standard format for proteomics spectrum identification algorithms results (32)
Table 3

A list of examples of standards initiatives that utilize UO, either directly or indirectly

DomainStandard
Mass spectrometryHUPO Proteomics Standards Initiative Mass Spectrometry (33)
ChemistryChemical Entity Semantic Specification (CHESS) (34)
ProteomicsmzIdentML—standard format for proteomics spectrum identification algorithms results (32)
DomainStandard
Mass spectrometryHUPO Proteomics Standards Initiative Mass Spectrometry (33)
ChemistryChemical Entity Semantic Specification (CHESS) (34)
ProteomicsmzIdentML—standard format for proteomics spectrum identification algorithms results (32)

Association with markup languages

A number of standardized markup languages use UO. One such example is the Systems Biology Pathway Exchange (SBPAX) data format. SBPAX is designed to store and organize quantitative modelling data (35). SBPAX uses the Units of Measurement Expressions (UOME) that references UO (35). GelML forms a data exchange format for representing gel electrophoresis experiments performed in proteomics investigations (36). GelML adopts sepCV (the controlled vocabulary developed by the PSI-Gel workgroup) and recommend that GelML should be used in conjunction with UO so as to standardize the naming of units (36). Table 4 depicts some examples of such markup languages.

Table 4

A list of examples of languages that directly or indirectly employ UO

DomainLanguage
Gel electrophoresisGel Electrophoresis Markup Language (GelML) (36)
ProteomicsTraML—standard exchange format for encoding transition lists (37)
SpectrometrymzML—standard exchange format for mass spectrometry data (38)
Biological pathwaysSystems Biology Pathway Exchange (SBPAX) (35)
DomainLanguage
Gel electrophoresisGel Electrophoresis Markup Language (GelML) (36)
ProteomicsTraML—standard exchange format for encoding transition lists (37)
SpectrometrymzML—standard exchange format for mass spectrometry data (38)
Biological pathwaysSystems Biology Pathway Exchange (SBPAX) (35)
Table 4

A list of examples of languages that directly or indirectly employ UO

DomainLanguage
Gel electrophoresisGel Electrophoresis Markup Language (GelML) (36)
ProteomicsTraML—standard exchange format for encoding transition lists (37)
SpectrometrymzML—standard exchange format for mass spectrometry data (38)
Biological pathwaysSystems Biology Pathway Exchange (SBPAX) (35)
DomainLanguage
Gel electrophoresisGel Electrophoresis Markup Language (GelML) (36)
ProteomicsTraML—standard exchange format for encoding transition lists (37)
SpectrometrymzML—standard exchange format for mass spectrometry data (38)
Biological pathwaysSystems Biology Pathway Exchange (SBPAX) (35)

Association with databases

UO has been incorporated by a variety of databases and their schemata. For example, Chado (39) is one of the most widely used database schema within the biomedical community. It is used to store information associated with genome sequence data and has recently been extended with the module called Natural Diversity module designed for storing phenotype data (39). Chado utilized UO for the descriptions of units. Table 5 presents some examples of such databases.

Table 5

A list of examples of databases that incorporate UO either directly or indirectly

DomainDatabase
PhenomesBiological Linked Open Data (BioLOD) (40)
ProteomicsGlobal Proteome Machine database (GPMDB) (41)
GenomicsChado—a database schema for genome sequence data (39)
Mammalian geneticsRIKEN integrated database of mammals (42)
PhenomicsPhenomeNet—a phenotype integration resource (43)
DomainDatabase
PhenomesBiological Linked Open Data (BioLOD) (40)
ProteomicsGlobal Proteome Machine database (GPMDB) (41)
GenomicsChado—a database schema for genome sequence data (39)
Mammalian geneticsRIKEN integrated database of mammals (42)
PhenomicsPhenomeNet—a phenotype integration resource (43)
Table 5

A list of examples of databases that incorporate UO either directly or indirectly

DomainDatabase
PhenomesBiological Linked Open Data (BioLOD) (40)
ProteomicsGlobal Proteome Machine database (GPMDB) (41)
GenomicsChado—a database schema for genome sequence data (39)
Mammalian geneticsRIKEN integrated database of mammals (42)
PhenomicsPhenomeNet—a phenotype integration resource (43)
DomainDatabase
PhenomesBiological Linked Open Data (BioLOD) (40)
ProteomicsGlobal Proteome Machine database (GPMDB) (41)
GenomicsChado—a database schema for genome sequence data (39)
Mammalian geneticsRIKEN integrated database of mammals (42)
PhenomicsPhenomeNet—a phenotype integration resource (43)

Association with applications

There are also several biomedical applications that utilize UO. For example, Phenex, a platform-independent desktop application designed to facilitate efficient and consistent annotation of phenotypic similarities and differences using the PATO framework, employs UO for the description of units assigned to the quantitative characters it records. Table 6 provides some examples of such applications.

Table 6

A list of examples of applications that directly or indirectly use UO

DomainApplication
Phenotype measurementsOntological Annotation of Phenotypic Diversity (Phenex) (44)
Systems biologyJava library for SBML (JSBML) (45)
DomainApplication
Phenotype measurementsOntological Annotation of Phenotypic Diversity (Phenex) (44)
Systems biologyJava library for SBML (JSBML) (45)
Table 6

A list of examples of applications that directly or indirectly use UO

DomainApplication
Phenotype measurementsOntological Annotation of Phenotypic Diversity (Phenex) (44)
Systems biologyJava library for SBML (JSBML) (45)
DomainApplication
Phenotype measurementsOntological Annotation of Phenotypic Diversity (Phenex) (44)
Systems biologyJava library for SBML (JSBML) (45)

Availability

The main ontology is available in both the OBO Flatfile Format (10) and the Web Ontology Language (OWL) (11) on our project website which can be reached at: http://unit-ontology.googlecode.com. Several OWL flavours of the UO ontology are available from our project website. The main ontology is also available from the OBO foundry (46), the BioPortal (47), the Ontology Lookup Service (OLS) (48) and the OntoBee (49).

Discussion and conclusion

UO was developed according to the OBO foundry principles (46) and it is part of the OBO ontologies suite. It has been widely adopted within the biomedical community by a large number of ontologies, markup languages, databases, standards initiatives, research project and applications and therefore, plays a central role in providing standardized access to biomedical data: it forms a framework that facilitates the standardization and formalization of units and is crucial for the exchange, processing and integration of quantitative data. UO is tightly integrated within the PATO framework (8) and facilitates the representation of quantitative phenotype measurements, whereas PATO is used to characterize the qualities that are being measured.

In the future, we will continue our effort to provide stable identifiers for units of measurement that are used in biomedical research, based on requests of UO’s user community. Furthermore, we plan to incorporate other unit systems such as the ‘Imperial System’, which is a system of units first defined in the British Weights and Measures Act (50), as well as the ‘United States customary units’, a system of measurements that contains similar units to the ‘Imperial System’ and is adopted in the USA (51). We also plan to provide a facility, such as webservice, that automatically converts between different units.

Funding

The European Commission’s 7th Framework Programme (RICORDO project—grant number 248502); National Institutes of Health (grant number R01 HG004838-02). Funding for open access charge: National Institutes of Health (grant number R01 HG004838-02).

Conflict of interest. none declared.

References

1
Rijgersberg
H
Wigham
M
Top
JL
How semantics can improve engineering processes: A case of units of measure and quantities
Adv. Eng Informatics
2010
, vol. 
25
 (pg. 
276
-
287
)
2
International Bureau of Weights and Measures and Taylor,B
The International System of Units (SI)
1991
 
Nist Special Publication (U.S. Department of Commerce, National Institute of Standards and Technology), Gaithersburg
3
Schadow
G
McDonald
CJ
Suico
JG
et al. 
Units of measure in clinical information systems
J. Am. Med. Inform. Assoc.
1999
, vol. 
6
 (pg. 
151
-
162
)
4
Gruber
TR
Olsen
G
Doyle
J
Torasso
P
Sandewall
E
An ontology for engineering mathematics
1994
Principles of Knowledge Representation and Reasoning: Proceedings of the 4th International Conference (KR ’94): Bonn, Germany
Morgan Kaufmann Publishers
(pg. 
258
-
269
Burlington, Massachusetts
5
Collins
JB
Openmath content dictionaries for si quantities and units
2009
Proceedings of the 16th Symposium, 8th International Conference. Held as Part of CICM ’09 on Intelligent Computer Mathematics
Springer, Berlin
(pg. 
247–
-
262
)
6
Harder
DW
Devitt
S
Units in mathml
2003
 
World Wide Web Consortium, W3C Working Group Note 10, http://www.w3.org/TR/2003/NOTE-mathml-units-20031110/ (5 August 2012, date last accessed)
7
Ashburner
M
Ball
CA
Blake
JA
et al. 
Gene ontology: tool for the unification of biology
Nat. Genet.
2000
, vol. 
25
 (pg. 
25
-
29
)
8
Gkoutos
GV
Green
EC
Mallon
A-MM
et al. 
Using ontologies to describe mouse phenotypes
Genome Biol.
2005
, vol. 
6
 pg. 
R8
 
9
Richter
JD
Harris
MAA
Haendel
M
et al. 
Obo-edit - an ontology editor for biologists
Bioinformatics
2007
, vol. 
23
 (pg. 
2198
-
2200
)
10
Horrocks
I
OBO flat file format syntax and semantics and mapping to OWL Web Ontology Language
Technical Report, University of Manchester
2007
 
http://www.cs.man.ac.uk/∼horrocks/obo/ (5 August 2012, date last accessed)
11
Grau
B
Horrocks
I
Borris
M
et al. 
OWL 2: The next step for OWL
Web Semant.
2008
, vol. 
6
 (pg. 
309
-
322
)
12
Horridge
M
Bechhofer
S
Noppens
O
Igniting the OWL 1.1 touch paper: The OWL API
2007
Proceedings of OWLED 2007: Third International Workshop on OWL Experiences and Directions
 
CEUR, vol. 258
13
Motik
B
Harrocks
I
Grau
BC
et al. 
OWL 2 Web Ontology Language: Profiles, World Wide Web Consortium (W3C) Recommendation
2009
 
http://www.w3.org/TR/2009/REC-owl2-profiles-20091027/ (5 August 2012, date last accessed)
14
Hoehndorf
R
Dumontier
M
Oellrich
A
et al. 
A common layer of interoperability for biomedical ontologies based on OWL EL
Bioinformatics
2011
, vol. 
27
 (pg. 
1001
-
1008
)
15
Hoehndorf
R
Dumontier
M
Gennari
JH
et al. 
Integrating systems biology models and biomedical ontologies
BMC Syst. Biol.
2011
, vol. 
5
 pg. 
124
 
16
Brinkman
RR
Courtot
M
Derom
D
et al. 
Modeling biomedical experimental processes with obi
J. Biomed. Semantics
2010
, vol. 
1
 
Suppl. 1
pg. 
S7
 
17
Visser
U
Abeyruwan
S
Vempati
U
et al. 
Bioassay ontology (bao): a semantic description of bioassays and high-throughput screening results
BMC Bioinformatics
2011
, vol. 
12
 pg. 
257
 
18
Whetzel
PL
Parkinson
H
Causton
HC
et al. 
The mged ontology: a resource for semantics-based description of microarray experiments
Bioinformatics
2006
, vol. 
22
 (pg. 
866
-
873
)
19
Groza
T
Hunter
J
Zankl
A
The bone dysplasia ontology: integrating genotype and phenotype information in the skeletal dysplasia domain
BMC Bioinformatics
2012
, vol. 
13
 pg. 
50
 
20
Thomas
DG
Pappu
RV
Baker
NA
Nanoparticle ontology for cancer nanotechnology research
J. Biomed. Inform.
2011
, vol. 
44
 (pg. 
59
-
74
)
21
Clark
WC
Tomich
TP
van Noordwijk
M
et al. 
Boundary work for sustainable development: Natural resource management at the consultative group on international agricultural research (cgiar)
Proc. Natl Acad. Sci. USA
2011
 
(5 August 2012, date last accessed)
22
Hastings
J
Chepelev
L
Willighagen
E
et al. 
The chemical information ontology: provenance and disambiguation for chemical data on the biological semantic web
PLoS One
2011
, vol. 
6
 pg. 
e25513
 
23
Malone
J
Holloway
E
Adamusiak
T
et al. 
Modeling sample variables with an experimental factor ontology
Bioinformatics
2010
, vol. 
26
 (pg. 
1112
-
1118
)
24
Frishkoff
G
Sydes
J
Mueller
K
et al. 
Minimal information for neural electromagnetic ontologies (minemo): A standards-compliant method for analysis and integration of event-related potentials (erp) data
Stand. Genomic Sci.
2011
, vol. 
5
 (pg. 
211
-
223
)
25
Turner
JA
Laird
AR
The cognitive paradigm ontology: design and application
Neuroinformatics
2012
, vol. 
10
 (pg. 
57
-
66
)
26
Sahoo
SS
Oqbuji
C
Luo
L
et al. 
Midas: automatic extraction of a common domain of discourse in sleep medicine for multi-center data integration
AMIA Annu. Symp. Proc.
2011
, vol. 
2011
 (pg. 
1196
-
1205
)
27
de Bono
B
Hoehndorf
R
Wimalaratne
S
et al. 
The ricordo approach to semantic interoperability for biomedical data and models: strategy, standards and solutions
BMC Res. Notes
2011
, vol. 
4
 pg. 
313
 
28
Dahdul
WM
Balhoff
JP
Engeman
J
et al. 
Evolutionary characters, phenotypes and ontologies: curating data from the systematic biology literature
PLoS One
2010
, vol. 
5
 pg. 
e10708
 
29
Winslow
RL
Saltz
J
Foster
I
et al. 
The cardiovascular research grid (cvrg) project
2011
Proceedings of the AMIA Summit on Translational Bioinformatics
(pg. 
77
-
81
AMIA, San Francisco
30
Rossi
S
Christ-Neumann
M
Ruping
S
et al. 
p-medicine: From data sharing and integration via vph models to personalized medicine
Ecancermedicalscience
2011
, vol. 
5
 pg. 
218
 
31
von Eschenbach
AC
Buetow
K
Cancer informatics vision: cabig
Cancer Inform.
2006
, vol. 
2
 (pg. 
22
-
24
)
32
Eisenacher
M
mzidentml: an open community-built standard format for the results of proteomics spectrum identification algorithms
Methods Mol. Biol.
2011
, vol. 
696
 (pg. 
161
-
177
)
33
Orchard
S
Hermjakob
H
Data standardization by the hupo-psi: how has the community benefitted?
Methods Mol. Biol.
2011
, vol. 
696
 (pg. 
149
-
160
)
34
Chepelev
LL
Dumontier
M
Chemical entity semantic specification: Knowledge representation for efficient semantic cheminformatics and facile data integration
J. Cheminform.
2011
, vol. 
3
 pg. 
20
 
35
Ruebenacker
O
Moraru
I
Schaff
J
et al. 
Integrating biopax pathway knowledge with sbml models
Syst. Biol.
2009
, vol. 
3
 (pg. 
317
-
328
)
36
Gibson
F
Hoogland
C
Martinez-Bartolome
S
et al. 
The gel electrophoresis markup language (gelml) from the proteomics standards initiative
Proteomics
2010
, vol. 
10
 (pg. 
3073
-
3081
)
37
Deutsch
EW
Chambers
M
Neumann
S
et al. 
Traml–a standard format for exchange of selected reaction monitoring transition lists
Mol. Cell. Proteomics
2012
, vol. 
11
 pg. 
R111.015040
 
38
Martens
L
Chambers
M
Sturm
M
et al. 
mzml–a community standard for mass spectrometry data
Mol. Cell. Proteomics
2011
, vol. 
10
 pg. 
R110.000133
 
39
Jung
S
Menda
N
Redmond
S
et al. 
The chado natural diversity module: a new generic database schema for large-scale phenotyping and genotyping data
Database
2011
, vol. 
2011
 pg. 
bar051
 
40
Paschke
A
Burger
A
Romano
P
et al. 
2011
 
Proceedings of the 4th International Workshop on Semantic Web Applications and Tools for the Life Sciences, SWAT4LS 2011, London, United Kingdom, December 07–09, 2011. ACM, Washington
41
Craig
R
Cortens
JP
Beavis
RC
Open source system for analyzing, validating, and storing protein identification data
J. Proteome Res.
2004
, vol. 
3
 (pg. 
1234
-
1242
)
42
Masuya
H
Makita
Y
Kobayashi
N
et al. 
The RIKEN integrated database of mammals
Nucleic Acids Res.
2011
, vol. 
39
 (pg. 
D861
-
D870
)
43
Hoehndorf
R
Schofield
PN
Gkoutos
GV
Phenomenet: a whole-phenome approach to disease gene discovery
Nucleic Acids Res.
2011
, vol. 
39
 pg. 
e119
 
44
Balhoff
JP
Dahdul
WM
Kothari
CR
et al. 
Phenex: ontological annotation of phenotypic diversity
PLoS One
2010
, vol. 
5
 pg. 
e10500
 
45
Drager
A
Rodriguez
N
Dumousseau
M
et al. 
Jsbml: a flexible java library for working with sbml
Bioinformatics
2011
, vol. 
27
 (pg. 
2167
-
2168
)
46
Smith
B
Ashburner
M
Rosse
C
et al. 
The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration
Nat. Biotech.
2007
, vol. 
25
 (pg. 
1251
-
1255
)
47
Noy
NF
Shah
NH
Whetzel
PL
et al. 
Bioportal: ontologies and integrated data resources at the click of a mouse
Nucleic Acids Res.
2009
, vol. 
37
 (pg. 
W170
-
W173
)
48
Cote
R
Jones
P
Apweiler
R
et al. 
The ontology lookup service, a lightweight cross-platform tool for controlled vocabulary queries
BMC Bioinformatics
2006
, vol. 
7
 pg. 
97
 
49
Xiang
Z
Mungall
C
Ruttenberg
A
et al. 
Ontobee: A linked data server and browser for ontology terms
2011
Proceedings of International Conference on Biomedical Ontology
(pg. 
279–
-
281
CEUR-WS, Buffalo, NY, USA
50
Britannica Educational Publishing
Hosch
WL
The Britannica Guide to Numbers and Measurement
2010
 
The Rosen Publishing Group, New York
51
Barbrow
LE
Judson
LE
Weights and Measures Standards of the United States: A Brief History
1976
Gaithersburg
National Bureau of Standards Publications (NBS)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.