On Friday, May 3, 2013 - version 1.0.0 of the OntologySummit2013_Communique was formally adopted by the community at the OntologySummit2013_Symposium (3S25)
- See the full text of the Communique below. (3S26)
- download a pdf version of the adopted communique - at: http://ontolog.cim3.net/file/work/OntologySummit2013/OntologySummit2013_Communique/OntologySummit2013_Communique_v1-0-0_20130503.pdf (3S28)
- we solicited the endorsement of this Communique by members of the broader community of ontologists, system and enterprise architects, software engineers and stakeholders of ontology, ontological analysis and ontology applications. (3S29)
- note that your endorsement will be made by you as an individual, and not as a representative of the organization(s) you are affiliated with. (3S2A)
- the solicitation for endorsement is now closed (it was open between May 5 and Jun 15, 1013.) (3S2B)
- your endorsement shall apply to OntologySummit2013_Communique version 1.0.0 (as adopted at 10:30 EDT on 3-May-2013) plus any non-substantive edits which the co-lead editors have been empowered to make subsequently. (3S2D)
Ontology Summit 2013 Communique (3RUL)
Towards Ontology Evaluation across the Life Cycle (3RTB)
- Version 1.0.0 was adopted and released on 3-May-2013 10:30 am EDT / Gaithersburg, Maryland, USA. (3RTC)
- Current Version is: 1.0.5 (3RTD)
- v1.0.1 edits were made by FabianNeuhaus & AmandaVizedom / 2013.05.04 (3RTE)
- v1.0.2 edits were made by MatthewWest & PeterYim / 2013.05.08 (3S82)
- v1.0.3 edits were made by FabianNeuhaus & AmandaVizedom / 2013.05.29 (3TPI)
- v1.0.4 edits were made by FabianNeuhaus / 2013.05.31 (3TPJ)
- v1.0.5 edits were made by FabianNeuhaus / 2013.06.18 (3UDZ)
Lead Editors: FabianNeuhaus & AmandaVizedom (3RTF)
Co-Editors: KenBaclawski, MikeBennett, MikeDean, MichaelDenny, MichaelGruninger, AliHashemi, TerryLongstreth, LeoObrst, SteveRay, RamSriram, ToddSchneider, MarcelaVegetti, MatthewWest, PeterYim (3RTG)
Executive Summary (3RTH)
Problem (3RTI)
Currently, there is no agreed on methodology for development of ontologies, and there is no consensus on how ontologies should be evaluated. Consequently, evaluation techniques and tools are not widely utilized in the development of ontologies. This can lead to ontologies of poor quality and is an obstacle to the successful deployment of ontologies as a technology. (3RUM)
Approach (3RTJ)
The goal of the Ontology Summit 2013 was to create guidance for ontology developers and users on how to evaluate ontologies. Over a period of four months a variety of approaches were discussed by participants, who represented a broad spectrum of ontology, software, and system developers and users. We explored how established best practices in systems engineering and in software engineering can be utilized in ontology development. (3RUN)
Results (3RTK)
This document focuses on the evaluation of five aspects of the quality of ontologies: intelligibility, fidelity, craftsmanship, fitness, and deployability. A model for the ontology life cycle is presented, and evaluation criteria are presented in the context of the phases of the life cycle. We discuss the availability of tools and the document ends with observations and recommendations. Given the current level of maturity of ontology as an engineering discipline, any results on how to best build and evaluate ontologies have to be considered as preliminary. However, the results achieved a broad consensus across the range of backgrounds, application foci, specialties and experience found in the Ontology Summit community. (3RUO)
Recommendations (3RTL)
For more reliable success in ontology development and use, ontology evaluation should be incorporated across all phases of the ontology life cycle. Evaluation should be conducted against carefully identified requirements; these requirements depend on the intended use of the ontology and its operational environment. For this reason, we recommend the development of integrated ontology development and management environments that support the tracking of requirements for, and the evaluation of, ontologies across all phases of their development and use. (3RUP)
1. Purpose of this Document (3RTM)
The purpose of this document is to advance the understanding and adoption of ontology evaluation practices. Our focus is on the critical relationships between usage requirements, the life cycle of an ontology, evaluation, and the quality of the result. (3RUQ)
This document is rooted in the 2013 Ontology Summit. Over four months, Summit participants prepared and presented materials, shared references, suggested resources, discussed issues and materials by email list, and met virtually each week for presentations and discussions. This Summit had the focal topic "Ontology Evaluation across the Ontology Lifecycle." This document represents a synthesis of a subset of ideas presented, discussed, and developed over the course of these four months, and reflects the contributions of the Summit's participants and the consensus of the Summit community. (3RUR)
The intended audience for this document is, first and foremost, anyone who is developing or using ontologies currently, or who is on the cusp of doing so. We believe that the adoption of ontology evaluation as presented here has the potential to greatly improve the effectiveness of ontology development and use, and to make these activities more successful. Thus, our primary audience is the developers of ontologies and ontology-based systems. A secondary audience for this document is the community of software, systems, and quality assurance engineers. When ontologies are used in information systems, success depends in part on incorporation of ontology evaluation (and related activities) into the engineering practices applied to those systems and their components. (3RUS)
2. Introduction (3RTN)
Ontologies are human-intelligible and machine-interpretable representations of some portions and aspects of a domain. Since an ontology contains terms and their definitions, it enables the standardization of a terminology across a community or enterprise; thus, ontologies can be used as a type of glossary. Since ontologies can capture key concepts and their relationships in a machine-interpretable form, they are similar to domain models in systems and software engineering. And since ontologies can be populated with or linked to instance data to create knowledge bases, and deployed as parts of information systems for query answering, ontologies resemble databases from an operational perspective. (3RUT)
This flexibility of ontologies is a major advantage of the technology. However, flexibility also contributes to the challenge of evaluating ontologies. Ontology evaluation consists of gathering information about some properties of an ontology, comparing the results with a set of requirements, and assessing the ontology's suitability for some specified purpose. Some properties of an ontology can be measured independently of usage; others involve relationships between an ontology and its intended domain, environment, or usage-specific activity, and thus can only be measured with reference to some usage context. The variety of the potential uses of ontologies means that there is no single list of relevant properties of ontologies and no single list of requirements. Therefore, there is no single, universally-applicable approach to evaluating ontologies. (3RUU)
However, we can identify some kinds of evaluation that are generally needed. To determine the quality of an ontology, we need to evaluate the ontology as a domain model for human consumption, the ontology as a domain model for machine consumption, and the ontology as deployed software that is part of a larger system. In this document, we focus on five high-level characteristics:[1] (3RUV)
1. Can humans understand the ontology correctly? (Intelligibility) (3RUW)
2. Does the ontology accurately represent its domain? (Fidelity) (3RUX)
3. Is the ontology well-built and are design decisions followed consistently? (Craftsmanship) (3RUY)
4. Does the representation of the domain fit the requirements for its intended use? (Fitness) (3RUZ)
5. Does the deployed ontology meet the requirements of the information system of which it is part? (Deployability) (3RV0)
For intelligibility, it is not sufficient that ontologists can understand the content of the ontology; all intended users need to be able understand the intended interpretation of the ontology elements (e.g., individuals, classes, relationships) that are relevant to their use-case. Intelligibility does not require that users view and understand the ontology directly. To enable intelligibility, the documentation of the ontology needs to be tailored to the different kinds of users. This may require multiple annotations of an element of the ontology suitable for different audiences (e.g., to accommodate language localization and polysemous use of terms across domains). Intelligibility is particularly important for ontologies that are used directly by humans as a controlled dictionary. But it is also desirable for ontologies that are intended to be used "under the hood" of an information system, because these ontologies need to be maintained and reviewed by people other than the original ontology developers. Fidelity is about whether the ontology represents the domain correctly, both in the axioms and in the annotations that document the ontology for humans. Craftsmanship is concerned with the question whether the ontology is built carefully; this covers aspects ranging from the syntactic correctness of the ontology to the question whether a philosophical choice (e.g., four-dimensionalism) has been implemented consistently. Both fitness and deployability are dependent on requirements for the intended usage. These requirements might be derived from the operational environment, in the case of an ontology that is deployed as part of an information system; alternatively, they may be derived from the goals for the knowledge representation project, if the ontology is deployed as a standalone reference ontology. While both characteristics are about meeting operational requirements, they are concerned with different aspects of the ontology: fitness is about the ontology as a domain model, deployability is about the ontology as a piece of software.[2] (3RV1)
Since fitness and deployability are evaluated with respect to requirements for the intended use-case, a comprehensive look at ontology evaluation needs to consider how the requirements for the ontology derive from the requirements of the system that the ontology is a part of. Furthermore, although "ontology evaluation" can be understood as the evaluation of a finished product, we consider ontology evaluation as an ongoing process during the life of an ontology. For these reasons, we embrace a broad view of ontology evaluation and discuss it in the context of expected usage and the various activities during ontology development and maintenance. In the next section we present a high-level breakdown of these activities, organized as goal-oriented phases in the ontology life cycle. Afterward we identify, for each phase, some of the activities that occur during that phase, its outputs, what should be evaluated at the stage. The document concludes with some observations about the current tool support for ontology evaluation, and recommendations for future work. (3RV2)
3. An Ontology Life Cycle Model (3RTO)
The life of any given ontology consists of various types of activities in which the ontology is being conceived, specified, developed, adapted, deployed, used, and maintained. Whether these activities occur in a sequence or in parallel, and whether certain kinds of activities (e.g., requirements development) only happen once during the life of an ontology or are cycled through repeatedly depends partially on how the development process is managed. Furthermore, as discussed above, ontologies are used for diverse purposes; thus, there are certain kinds of activities (e.g., the adaptation of the ontology to improve computational performance of automatic reasoning) that are part of the development of some ontologies and not of others. For these reasons, there is no single ontology life cycle with a fixed sequences of phases. Any ontology life cycle model presents a simplified view that abstracts from the differences between the ways ontologies are developed, deployed, and used. (3RV3)
In spite of presenting a simplified view, an ontology life cycle model is useful because it highlights recognizable, recurring patterns that are common across ontologies. The identification of phases allows the clustering of activities around goals, inputs, and outputs of recognizable types. Furthermore, a life cycle model emphasizes that some of the phases are interdependent, because the effectiveness of certain activities depends on the outputs of others. For example, effective ontology development depends on the existence of identified ontology requirements; if requirements identification has been omitted or done poorly, ontology development is very unlikely to result in useful outputs. While the development process may vary considerably between two ontologies, there are invariant dependencies between phases. (3RV4)
Figure 1 presents the phases of an ontology life cycle model. Typically, an ontology will go through each of these phases more than once during its life. In the following sections, each of these phases is discussed in more detail, including input, outputs and relationships between the phases. As the figure illustrates, evaluation should happen throughout the ontology life cycle, varying in focus, process, and intensity according to phase-appropriate requirements. In the following sections each of the phases is linked to phase-appropriate evaluation activities. The evaluation provides information about the degree to which requirements of the life cycle phase are being met. Requirements identification will be discussed in greater detail in the next section. (3RV5)
This model applies to ontologies regardless of whether their use involves significant machine processing of ontology content. Because the systems into which ontologies are incorporated are information systems in a broad sense: systems of people, processes, hardware, software, and data that process information and make decisions.[3] For example, consider an ontology which is used by humans to curate documents. In this case the information system includes the ontology, the curators, and the tools they use to browse the ontology and to annotate the documents. The success of the whole system depends on the interaction of the ontology with both the other software components and the curators. For example, the whole system will be impaired if the ontology contains information that the browser cannot display properly, or if the definitions in the ontology are so ambiguous that different curators are not able to use the terms consistently in the curation process. Thus, the ontology needs to be evaluated for both deployability and intelligibility. This example illustrates that even if an ontology is not used for machine reasoning as part of a complex piece of software, its evaluation still depends on its intended use within a larger system. (3RV6)
4. Requirements Development Phase (3RTP)
The purpose of this phase is to establish understanding, context, scope, and initial requirements. Development of adequate requirements is critical to the success of any ontology development and usage. Most evaluation activities that are presented in the next sections depend on the results of this phase. (3RV7)
During the requirements development phase, expected or intended usages and interpretations are elicited and examined, and initial requirements are derived. Typically, an intended usage is initially understood from a business[4] perspective. The intended usage may be specified as use-cases or scenarios; at early stages, requirements may be captured only as brief statements of one or more business needs and constraints. In many cases only some aspects of usage are addressed, and requirements development may include gathering information about other aspects that are significant for ontology analysis and design.[5] (3RV8)
One important way to specify requirements is by using competency questions: questions that the ontology must be able to answer.[6] These questions are formulated in a natural language, often as kinds of queries that the ontology should support in given scenarios. (3RV9)
The output of the requirements development phase is a document that should answer the following questions: (3RVA)
- Why is this ontology needed? (What is the rationale? What are the expected benefits)? (3RVB)
- What is the expected or intended usage (e.g., specified as use-cases, scenarios)? (3RVC)
- Which groups of users needs to understand which parts of the ontology? (3RVD)
- What is the scope of the ontology? (3RVE)
- Are there existing ontologies or standards that need to be reused or adopted? (3RVF)
- What are the competency questions? (3RVG)
- Are the competency questions representative of all expected or intended usages? (3RVH)
- What are the requirements from the operational environment? (3RVI)
- What resources need to be considered during the ontology and system design phases (e.g., legacy databases, test corpora, data models, glossaries, vocabularies, schemas, taxonomies, ontologies, standards, access to domain experts)? (3RVJ)
5. Ontological Analysis Phase (3RTQ)
The purpose of the ontological analysis phase is to identify the key entities of the ontology (individuals, classes, and the relationships between them), as well as to link them to the terminology that is used in the domain. This usually involves the resolution of ambiguity and the identification of entities that are denoted by different terms across different resources and communities. This activity requires close cooperation between domain experts and ontologists, because it requires both knowledge about the domain and knowledge about important ontological distinctions and patterns. (3RVK)
The results are usually captured in some informal way, understandable to both ontologists and domain experts. One way of specifying the output of ontological analysis is by a set of sentences in a natural language, which are interpreted in the same way by the involved subject matter experts and ontologists. The ontologists apply their knowledge of important ontological distinctions and relationships to elicit such sentences that capture the information needed to guide the ontology design.[7] Ontological analysis outputs can also be captured in diagrams (e.g., concept maps, UML diagrams, trees, freehand drawings). (3RVL)
The output of the ontological analysis phase, whatever the method of capture, should include specification of: (3RVM)
- significant entities within the scope of the intended usage. (3RVN)
- important characteristics of the entities, including relationships between them, disambiguating characteristics, and properties important to the domain and activities within the scope of the intended usage (3RVO)
- the terminology used to denote those entities, and provide enough contextual information to disambiguate polysemous terms. (3RVP)
These results provide input to ontology design and development. In addition, these results provide detail with which high-level requirements for ontology design and development phases can be turned into specific, evaluable requirements. (3RVQ)
Evaluating Ontological Analysis Results: Questions to be Answered (3RTR)
The output of an ontological analysis phase should be evaluated according to the following high-level criteria, assisted in detail by the outputs of requirements development: (3RVR)
- Are all relevant terms from the use cases documented? (3RVS)
- Are all entities within the scope of the ontology captured? (3RVT)
- Do the domain experts agree with the ontological analysis? (3RVU)
- Is the documentation sufficiently unambiguous to enable a consistent use of the terminology? (3RVV)
6. Ontology Design Phase (3RTS)
In the ontology design phase, a design[8] is developed, based on the outputs from the requirements development and the ontological analysis. In particular, representation languages are chosen for ontology and for queries (these may be identical). Further, the structure of the ontology is determined. Structural choices include whether and how the ontology is separated into modules and how the modules are integrated. As part of the structural design, it may be decided that some existing ontologies are reused as modules. The intended behavior of the modules may be captured by competency questions. These module-specific competency questions are often derived from the ontology-wide competency questions. (3RVW)
Design phase activities include the determination of design principles and of top-level classes in the ontology. The top-level classes are the classes in the ontology that are at the highest level of the subsumption hierarchy. (In the case of OWL ontologies, these are the direct children of owl:thing.) These classes determine the basic ontological categories of the ontology. Together the top-level categories and the design principles determine whether and how some fundamental aspects of reality are represented (e.g., change over time). The design principles may also restrict representation choices by the developers (e.g., by enforcing single inheritance for subsumption). (3RVX)
One way to make these design decisions is to use an existing upper ontology. Upper or foundational ontologies (e.g., DOLCE, BFO, or SUMO) are reusable, varyingly comprehensive ontology artifacts that specify the basic ontological categories, relationships between them, and some methodological decisions about how to represent reality. Other approaches (e.g., OntoClean) rely on the systematic representation of logical and philosophical properties of classes and relationships. There are efforts (e.g., in the NeOn project) to capture design decisions in form of design patterns, and share them with the community.[9] (3RVY)
The results of the design decisions in this phase lead to additional requirements for the ontology. Some of these requirements concern characteristics entirely internal to the ontology itself (e.g., single inheritance for subsumption or distinction between rigid, anti-rigid, and non-rigid classes). Many of these requirements can be understood and evaluated using technical, ontological understanding, without further input of usage-specific or domain-specific information. (3RVZ)
Note that there might be conflicting requirements for the expressivity of the ontology language and its performance (see system design phase). Such tension can be addressed by distinguishing between, and developing, separate reference and operational ontologies. A reference ontology is one which captures the domain faithfully, to the extent required by the intended or expected usage(s), and in a language that is expressive enough for that purpose. An operational ontology is one that is adapted from a reference ontology, potentially incorporating compromises in representation for the sake of performance. The two types of ontologies will be discussed further in the ontology development and reuse section. (3RW0)
Evaluating Ontology Design Results: Questions to be Answered (3RTT)
- Is the chosen ontology language expressive enough to capture the knowledge with sufficient detail in order to meet the ontology requirements? (3RW1)
- Is the chosen query language expressive enough to formalize the competency questions? (3RW2)
- Does the chosen language support all required ontology capabilities? (For example, if the ontology is to support probability reasoning, does the language enable the representation of probabilistic information?) (3RW3)
- Is every individual or class that has been identified in the ontological analysis phase either an instance or a subclass of some top-level class? (3RW4)
- Are naming conventions specified and followed? (3RW5)
- Does the design call for multiple, distinct ontology modules? If so, do the ontology modules together cover the whole scope of the ontology? (3UDT)
- Are all modules of the ontology associated with (informal) competency questions? (3UDU)
- Does the design avoid addition of features or content not relevant to satisfaction of the requirements? (3UDV)
- For each module, is it specified what type of entities are represented in the module (the intended domain of quantification)? (3UE0)
- For each module, is it specified how it will be evaluated and who will be responsible? (3UE1)
- Does the design specify whether and how existing ontologies will be reused? (3UE2)
7. System Design Phase (3RTU)
Information system design as a general activity is its own field of practice, and there is no need to re-invent or summarize it here. There is, however, a need to emphasize the interdependence of ontology design and system design for ontologies that are intended to be used as components of an information system. During system design, decisions are made that lead to requirements for the capabilities and implementation of the ontology and its integration within the larger information system. This interdependency is often underestimated, which leads to poor alignment between the ontology and the larger system it is part of, and thus, to greater risk of failure in ontology and system use. (3RWC)
The output of the system design phase should answer such questions as: (3RWD)
- What operations will be performed, using the ontology, by other system components? What components will perform those operations? How do the business requirements identified in the requirements development phase apply to those specific operations and components? (3RWE)
- What, if any, inputs or changes to the ontology will there be, once the system is deployed? (3RWF)
- What interfaces (between machines or between humans and machines) will enable those inputs? How will these interfaces be tested with respect to the resulting, modified ontology? What requirements will need to be met? (3RWG)
- What data sources will the ontology be used with? How will the ontology be connected to the data sources? What separate interfaces, if any, are needed to enable access to those connections? (3RWH)
- How will the ontology be built, evaluated, and maintained? What tools are needed to enable the development, evaluation, configuration management, and maintenance of the ontology? (3RWI)
- If modularity and/or collaborative development of the ontology are indicated, how will they be supported? (3RWJ)
Evaluating System Design Results: Questions to be Answered (3RTV)
The bulk of system design requirements will derive from systems design principles and methodologies in general, and are thus out of the scope of this document. We emphasize here the often unmet need to explicitly recognize the ontology as a component of the system and to evaluate the system design accordingly: (3RWK)
- Does the system design answer the questions listed just above? (3RWL)
8. Ontology Development Phase (3RTW)
The ontology development phase consists of four major activities: informal modeling, formalization of competency questions, formal modeling, and operational adaptation (each of which is described below). These activities are typically cycled through repeatedly both for individual modules and for the ontology as whole. In practice, these activities are often performed without obvious transitions between them. Nevertheless, it is important to separate them conceptually, since they have different prerequisites, depend on different types of expertise, and lead to different outputs, which are evaluated in different ways. (3RWM)
The ontology development phase covers both new ontology development and ontology reuse, despite differences between these activities. We do not consider new development and reuse to be part of different phases, for the following reasons: the successful development, or selection and adaptation, of an ontology into an information system is possible only to the extent that the ontology meets the requirements of the expected or intended usage. Thus, whether an ontology is developed entirely from scratch, re-used from existing ontologies, or a combination of the two, good results depend on identification of ontology requirements, an ontological analysis, and the identification of ontology design requirements. Furthermore, the integration of the ontology into the broader information system, its deployment and its usage are not altered in substance by the ontology's status as new or reused. The ontology is evaluated against the same set of requirements, regardless of whether it is reused or newly developed. Therefore, from a high-level perspective, both newly-developed and reused ontologies play the same role within the ontology life cycle. (3RWN)
8.1 Informal Modeling (3RTX)
During informal modeling, the result of the ontological analysis is refined. Thus, for each module, the relevant entities (individuals, classes, and their relationships) are identified and the terminology used in the domain is mapped to them. Important characteristics of the entities might be documented (e.g., the transitivity of a relationship, or a subsumption between two classes). The results are usually captured in some informal way (e.g., concept maps, UML diagrams, natural language text). (3RWO)
Evaluating Informal Modeling Results: Questions to be Answered (3RTY)
- All evaluation criteria from the ontological analysis phase apply to informal modeling, with the addition of the following: (3RWP)
- Does the model capture only entities within the specified scope of the ontology? (3RWQ)
- Are the defined classes and relationships well-defined? (e.g., no formal definition of a term should use the term to define itself) (3RWR)
- Is the intended interpretation of the undefined individuals, classes, and relationships well-documented? (3RWS)
- Are the individuals, classes, and relationships documented in a way that is easily reviewable by domain experts? (3RWT)
8.2 Formalization of Competency Questions (3RTZ)
Based on the results of the informal modeling, the scenarios and competency questions are formalized. This formalization of competency questions might involve revising the old competency questions and adding new ones. (3RWU)
Evaluating Formal Competency Questions: Questions to be Answered (3RU0)
- Are the competency questions representative for all intended usages? (3RWV)
- Does the formalization capture the intent of the competency question appropriately? (3RWW)
8.3 Formal Modeling (3RU1)
During formal modeling, the content of the informal model is captured in some ontology language (e.g., Common Logic, OWL 2 DL), and then fleshed out with axioms. The resulting reference ontology represents the domain appropriately (fidelity), adheres to the design decisions made in the ontology design phase (craftsmanship), and is supposed to meet the requirements for domain representation (fitness). This is either achieved by creating a new ontology module from scratch or by reusing an existing ontology and, if necessary, adapting it. (3RWX)
Evaluating Formal Modeling Results: Questions to be Answered (3RU2)
The ontology that is developed by the formal modeling activity or is considered for reuse is evaluated in three respects: whether the domain is represented appropriately (fidelity); whether the ontology is well-built and follows the decisions from the ontology design phase (craftsmanship); and whether the representation meets the requirements for its intended use (fitness). (3RWY)
Evaluating Fidelity (3RU3)
Whether the domain is represented accurately in an ontology depends on three questions: Are the annotations of ontology elements (e.g., classes, properties, axioms) that document their intended interpretation for humans (e.g., definitions, explanations, examples, figures) correct? Are all axioms within the ontology true with respect to the intended level of granularity and frame of reference (universe of quantification)? Are the documentation and the axioms in agreement? (3RWZ)
Since the evaluation of fidelity depends on some understanding of the domain, it ultimately requires review of the content of the ontology by domain experts.[10] However, there are some automated techniques that support the evaluation of fidelity. For example, one can evaluate the ontology for logical consistency, evaluate automatically generated models of the ontology on whether they meet the intended interpretations,[11] or compare the intrinsic structure of the ontology to other ontologies (or different versions of the same ontology) that are overlapping in scope. (3RX0)
Evaluating Craftsmanship (3RU4)
In any engineering discipline, craftsmanship covers two separate, but related aspects. The first is whether a product is well-built in a way that adheres to established best practices. The second is whether design decisions that were made are followed in the development process. Typically, the design decisions are intended to lead to a well-built product, so the second aspect feeds into the first. Since ontology engineering is a relatively young discipline, there are relatively few examples of universally accepted criteria for a well-built ontology (e.g., syntactic well-formedness, logical consistency and the existence of documentation). Thus, the craftsmanship of an ontology needs to be evaluated largely in light of the ontological commitments, design decisions, and methodological choices that have been embraced within the ontology design phase. (3RX1)
One approach to evaluating craftsmanship relies on an established upper ontology or ontological meta-properties (such as rigidity, identity, unity, etc.), which are used to gauge the axioms in the ontology. Tools that support the evaluation of craftsmanship often examine the intrinsic structure of an ontology. This kind of evaluation technique draws upon mathematical and logical properties such as logical consistency, graph-theoretic connectivity, model-theoretic interpretation issues, inter-modularity mappings and preservations, etc. Structural metrics include branching factor, density, counts of ontology constructs, averages, and the like.[12] (3RX2)
Evaluating Fitness (3RU5)
The formalized competency questions and scenarios are one source of evidence regarding fitness. These competency questions are used to query corresponding ontology modules and the whole ontology. Successful answers to competency questions provide evidence that the ontology meets the model requirements that derive from query-answering based functionalities of the ontology. The ability to successfully answer competency question queries is not the same as fitness, but, depending on the expected usage, it may be a large component of it. (3RX3)
Fitness can also be evaluated by performing a sample or approximation of system operations, using the ontology in a test environment and/or over a test corpora. For example, if the ontology is required to support automated indexing of documents with ontology terms, then fitness may be evaluated by running an approximation of the document analysis and indexing system, using the ontology in question, over a test corpus. There are various ways of assessing the results, for example, by comparison to a gold standard or by review of results by domain experts, and measured by some suitably defined notions of recall and precision. The extent to which the results are attributable to the ontology, versus other aspects of the system, can be identified to a certain extent by comparison of results using the same indexing system but different ontologies. (3RX4)
8.4 Operational Adaptation (3RU6)
During operational adaptation, the reference ontology is adapted to the operational requirements, resulting in an operational ontology. One particular concern is whether the deployed ontology will be able to respond in a time-frame that meets its performance requirements. This may require a paring-down of the ontology and other optimization steps (e.g., restructuring of the ontology to improve performance). For example, it might be necessary to trim an OWL DL ontology to its OWL EL fragment to meet performance requirements. (3RX5)
In some cases the operational ontology uses a different ontology language with a different semantics (e.g., if the application-specific reasoning does not observe the full first-order logic or description logic Open World Assumption, but instead interprets the negations in the ontology under a Closed World assumption). (3RX6)
Evaluating Operational Adaptation Results: Questions to be Answered (3RU7)
- Does the model support operational requirements (e.g., performance, precision, recall)? (3RX7)
9. System Development and Integration Phase (3RU8)
In this phase the system is built according to the design specified in the design phase. If system components other than the ontology need to be built or otherwise acquired, processes for doing so can occur more or less in parallel to the ontology development phase. Of course, tools and components necessary to the activities in the ontology development phase should be in place as ontology development begins; e.g., ontology development environments, version control systems, collaboration and workflow tools. The system development and integration phase concerns the integration of the ontology and other components into subsystems as called for and into a system as specified in the system design phase. (3RX8)
The system development and integration phase is discussed as part of the ontology life cycle because in a typical application, the functionalities supported by the ontology are realizable not by interaction with the ontology alone, but by processes carried out by some combination of the ontology and other components and/or subsystems. Thus, whether the ontology meets the full range of requirements can only be accurately evaluated once such interaction can be performed and results produced.[13] (3RX9)
Evaluating System Development Results: Questions to be Answered (3RU9)
The bulk of system development requirements will derive from systems development principles and methodologies in general, and are thus out of the scope of this document. We emphasize here the often unmet need to explicitly recognize the ontology as a component of the system and to evaluate the system development results accordingly. Specifically: (3RXA)
- Does the system achieve successful integration of the ontology, as specified in the system design? (3RXB)
- Does the system meet all requirements that specifically relate to the integrated functioning of the ontology within the system? (3RXC)
10. Deployment Phase (3RUA)
In this phase, the ontology goes from the development and integration environment to an operational, live use environment. Deployment usually occurs after some development cycle(s) in which an initial ontology, or a version with some targeted improvement or extension, has been specified, designed, and developed. As described above, the ontology will have undergone evaluation repeatedly and throughout the process to this point. Nevertheless, there may be an additional round of testing once an ontology has passed through development and integration phases and deemed ready for deployment by developers, integrators, and others responsible for those phases. This additional, deployment-phase evaluation may or may not differ in nature from evaluation performed across other life cycle stages; it may be performed by independent parties (i.e., not involved in prior phases), or with more resources, or in a more complete testing environment (one that is as complete a copy or simulation of the operational environment as possible, but still isolated from that operational environment). The focus of such evaluation, however, is on establishing whether the ontology will function properly in the operational environment and will not interrupt or degrade operations in that environment. This deployment-phase testing typically iterates until results indicate that it is safe to deploy the ontology without disrupting business activities. In cases featuring ongoing system usage and iterative ontology development and deployment cycles, this phase is often especially rigorous and protective of existing functionality in the deployed, in-use system. If and when such evaluation criteria have been satisfied, the ontology and/or system version is incorporated into the operational environment, released, and becomes available for live use. (3RXD)
Evaluating Deployment: Questions to be Answered (3RUB)
- Does the ontology meet all requirements addressed and evaluated in the development phases? (3RXE)
- Are sufficient (new) capabilities provided to warrant deployment of the ontology? (3RXF)
- Are there outstanding problems that raise the risk of disruptions if the ontology is deployed? (3RXG)
- Have succeeding competency questions been used to create regression tests? (3RXH)
- Have regression tests been run to identify any existing capabilities that may be degraded if the ontology is deployed? If some regression is expected, is it acceptable in light of the expected benefits of deployment? (3RXI)
11. Operation and Maintenance Phase (3RUC)
This phase focuses on the sustainment of deployed capabilities, rather than the development of new ones. A particular system may have operation and maintenance and new ontology development phases going on at the same time, but these activities should be distinguished as they have different goals (improvement vs sustainment) and they operate on at least different versions of an ontology, if not different ontologies or different modules of an ontology. When an ontology (or version thereof) is in an operation and maintenance phase, information is collected about the results of operational use of the ontology. Problems or sub-optimal results are identified and micro-scale development cycles may be conducted to correct those problems. Simultaneous identification of new use cases, desired improvements, and new requirements that may happen during the same period of use should not be regarded as part of maintenance activity; rather, they are inputs to, or part of, exploration and possibly requirements development for a future version, extension, new ontology or new module. A single set of tools may be used to collect information of both sorts (for maintenance and for forward-looking exploration and requirements development) while an ontology is in use, but the information belongs to different activities. This distinction is manifested, for example, in the distinction between "bug reports" (or "problem reports") and "feature requests" (or "requested improvements") made by bug-tracking tools. The maintenance activity consists of identifying and addressing bugs or problems. (3RXJ)
Evaluating Operation and Maintenance: Questions to be Answered (3RUD)
The evaluation should be continuous, e.g., open problem reporting and regular, e.g., nightly, automated regression testing: (3RXK)
- Are any regression tests failing? If so, are they being addressed? (3RXL)
- Is any functionality claimed for the most recent deployment failing? If so, can the problem be tracked to the ontology, or is the problem elsewhere? (3RXM)
- If the problem is located in the ontology, can it be corrected before the next major development and deployment cycle? If so, is it being addressed? (3RXN)
- If a problem occurs and cannot be addressed without a large development cycle effort, is the problem severe enough to warrant backing out of the deployment in which it was introduced? (3RXO)
12. Tools for Ontology Evaluation (3RUE)
There are central aspects of ontology that may not be amenable to software control or assessment. For example, the need for clear, complete, and consistent lexical definitions of ontology terms is not presently subject to effective software consideration beyond identifying where lexical definitions may be missing entirely. Another area of quality difficult for software determination is the fidelity of an ontology. (3RXP)
There are no tools for ontology development or to enable ontology evaluation across the whole life cycle. Existing tools support different life cycle phases, and for any given characteristic, some tools may perform better in one phase than in another phase where a different tool is better suited. However, significant new ontology evaluation tools are currently becoming available to users.[14] An overview is presented as part of the Ontology Quality Software Survey.[15] (3RXQ)
13. Observations and Recommendations (3RUF)
1. We still have a limited understanding of the ontology life cycle, ontology development methodologies, and how to make best use of evaluation practices. More research in these areas is needed. Thus, any recommendation in this area is provisional. (3RUG)
2. There is no single ontology life cycle with a fixed sequences of phases. However, there are recurring patterns of activities, with identifiable outcomes, which feed into each other. In order to ensure quality, these outcomes need to be evaluated. Thus, evaluation is not a singular event, but should happen across the whole life of an ontology. (3RUH)
3. The different outputs of the the ontology life cycle phases need to be evaluated with respect to the appropriate criteria. In particular, different requirements apply to informal models, reference ontologies, and operational ontologies, even when implemented in the same language. (3RUI)
4. Ontologies are evaluated against requirements that derive both from design decisions and the intended use of the ontology. Thus, a comprehensive evaluation of an ontology needs to consider the system that the ontology is part of. (3RXR)
5. There is a shortage of tools that support the tracking of requirements for and the evaluation of ontologies across all stages of their development and use. These kinds of tools should be developed, and integrated in ontology development environments and repositories. (3RXS)
6. We strongly encourage ontology developers to integrate existing evaluation methodologies and tools as part of the development process. (3RXT)
--- (3RXU)
[1] There are different approaches to clustering aspects of ontologies to be evaluated. For example, in the OQuaRE framework, characteristics are broken down into sub-characteristics, which are linked to metrics. For more on OQuaRE and other approaches, see the Ontology Characteristics and Groupings reference collection. (3RXV)
[2] Fidelity, craftsmanship, and fitness are discussed in more detail in the section on ontology development below. (3RXW)
[3] See the Broad view of Information Systems reference collection for more on this understanding. (3RXX)
[4] "Business" here is meant in the broad sense, incorporating the activities of the organization or user that need the ontology and/or ontology-based system, regardless of whether those activities are commercial, governmental, educational, or other in nature. (3RXY)
[5] See the Ontology Usage reference collection for more about analyzing ontology usage. (3RXZ)
[6] See the Competency Questions reference collection for more on competency questions. (3RY0)
[7] An example of such informal outputs is: (3RYZ)
- (phrases in italics indicate entities) (3RY1)
- Every pick report is also an order status report. (3RY2)
- Every order has a shipping method. (3RY3)
- Possible shipping methods include ground, and air. (3RY4)
- The shipping method for an individual order is determined by the fulfillment software after the order is packed. (3RY5)
- Every order has a shipping speed. Possible shipping speeds include standard, two-day, and overnight. (3RY6)
- The shipping speed for a specific order is chosen by the buyer when the buyer places the order. (3RY7)
- For the thing people in the business usually call order, the fulfillment database uses the word "sale." (3RYM)
[8] No distinction is made here between design and architecture. The design phase should be understood to encompass both. (3RY9)
[9] See the Existing Methodologies and Upper Ontologies reference collection for some examples of upper ontologies and design methodologies. (3RYA)
[10] See the Expert Review and Validation reference collection for more on expert evaluation of ontologies. (3RYB)
[11] See the Evaluating Fidelity reference collection for more on this, including evaluation via simulation. (3RYC)
[12] For more details, see the OntologySummit2013_Intrinsic_Aspects_Of_Ontology_Evaluation_Synthesis and OntologySummit2013_Intrinsic_Aspects_Of_Ontology_Evaluation_CommunityInput (3RYD)
[13] For more details, see: OntologySummit2013_Extrinsic_Aspects_Of_Ontology_Evaluation_Synthesis (3RYE)
[14] See the Tool Support reference collection and OntologySummit2013_Software_Environments_For_Evaluating_Ontologies_Synthesis, for more information about available tools. (3RYF)
[15] For example. For Survey and results, see the Software Support for Ontology Quality and Fitness Survey. (3RYG)
Endorsement (3RUJ)
The above Communique has been endorsed by the individuals listed below. Please note that these people made their endorsements as individuals and not as representatives of the organizations they are affiliated with. (3RUK)
- AmandaVizedom (Communique Co-Lead Editor) (3RZR)
- FabianNeuhaus (Communique Co-Lead Editor) (3RZS)
- MatthewWest (OntologySummit2013 General Co-chair) (3RZT)
- MichaelGruninger (OntologySummit2013 General Co-chair) (3RZU)
- MikeDean (Summit Symposium Co-chair) (3RZV)
- RamSriram (Summit Symposium Co-chair) (3RZW)
- AliHashemi (3S7Z)
- BobSchloss (3RZX)
- DennisWisnosky (3RZY)
- DougFoxvog (3RZZ)
- FouadRamia (3S01)
- GaryBergCross (3S02)
- GilbertoFragoso (3S03)
- HansPolzer (3S04)
- JerrySmith (3S05)
- JoanneLuciano (3STP)
- KenBaclawski (3S06)
- LeoObrst (3S07)
- LynnePlettenberg (3S00)
- MarcelaVegetti (3S0V)
- MarkStarnes (3S08)
- MichaelBarnett (3S09)
- MichaelDenny (3S80)
- MikeBennett (3S0A)
- PatCassidy (3S0B)
- PavithraKenjige (3S0C)
- PeterYim (3S0D)
- RaviSharma (3S0E)
- SteveRay (3S0G)
- TerryLongstreth (3S0H)
- ToddSchneider (3SEH)
- ... (3SJH)
- RexBrooks (3S0F)
- DeborahMacPherson (3S0K)
- EdDodds (3S0L)
- PeteNielsen (3S14)
- PradeepKumar (3S2G)
- ElizabethFlorescu (3S2H)
- JamesSchoening (3S2I)
- KingsleyIdehen (3S2M)
- BarrySmith (3S2N)
- ArturoSanchez (3S2O)
- JohnMylopoulos (3S2P)
- MeganKatsumi (3S2Q)
- MaryParmelee (3S2R)
- BruceBray (3S2S)
- KiyongLee (3S7Q)
- DosamHwang (3S7R)
- MarcHalpern (3S7V)
- MarkMusen (3S7W)
- HansTeijgeler (3S7X)
- JoelBender (3S7Y)
- JimDisbrow (3S81)
- TaraAthan (3SDM)
- HaroldBoley (3SDN)
- AntonioLieto (3SIX)
- DougHolmes (3SIY)
- BobSmith (3SJ8)
- ... (3SJI)
- AlanRector (3SJJ)
- AldoGangemi (3SJL)
- AnatolyLevenchuk (3SJM)
- AnthonyCohn (3SJN)
- BillMcCarthy (3SJP)
- ChrisChute (3SJQ)
- ChrisPartridge (3SJR)
- ChrisWelty (3SJS)
- CoryCasanave (3SJU)
- DagobertSoergel (3SJV)
- DavidLeal (3SJW)
- DavidPrice (3SJX)
- DeborahMcGuinness (3SJY)
- DeniseBedford (3SJZ)
- DougLenat (3SK0)
- DuaneNickull (3SK1)
- ElisaKendall (3SK2)
- EnricoMotta (3SK3)
- ErnieLucier (3SK4)
- EvanWallace (3SK5)
- FrankLoebe (3SK6)
- GeorgeStrawn (3SK7)
- GeorgeThomas (3SK8)
- GiancarloGuizzardi (3SK9)
- HensonGraves (3SKA)
- HowardMason (3SKB)
- IanHorrocks (3SKC)
- JamieClark (3SKE)
- JeanneHolm (3SKF)
- JimSpohrer (3SKG)
- JohnBateman (3SKH)
- JohnSowa (3SKI)
- KatherineGoodier (3SKJ)
- KathyLaskey (3SKK)
- KenLaskey (3SKL)
- LaurentLiscia (3SKM)
- LaureVieu (3SKN)
- MarciaZeng (3SKO)
- MarkGreaves (3SKP)
- MarkLuker (3SKQ)
- MartinHepp (3SKR)
- MichaelUschold (3SKT)
- MichaelRiben (3T8L)
- MillsDavis (3SKU)
- OliverKutz (3SKW)
- OlivierBodenreider (3SKX)
- PeterBenson (3SKY)
- PeterBrown (3SKZ)
- RalphHodgson (3SL0)
- RichardMarkSoley (3SL1)
- RiichiroMizoguchi (3SL3)
- RudiStuder (3SL4)
- StefanDecker (3SL5)
- StefanoBorgo (3SL6)
- TaniaTudorache (3SL7)
- TimFinin (3SL8)
- TimMcGrath (3SL9)
- TomGruber (3SLA)
- TrishWhetzel (3SLB)
- ... (3SLC)
- GaryGannon (3STF)
- NikolayBorgest (3STG)
- VinodPavangat (3STH)
- MelissaHaendel (3STI)
- BillMcCarthy (3STJ)
- VinayChaudhri (3STO)
- FrankOlken (3STR)
- BartGajderowicz (3STS)
- AndreasTolk (3STT)
- ChrisMenzel (3STU)
- BradleyShoebottom (3STV)
- ReginaMotz (3STW)
- TimWilson (3STX)
- JackRing (3STY)
- JamesWarren (3STZ)
- NicolauDePaula (3SU3)
- JesualdoTomasFernandezBreis (3SU4)
- TillMossakowski (3SU5)
- DavidNewman (3SU6)
- SamirTartir (3T8J)
- ChristopherSpottiswoode (3T8K)
- MariaPovedaVillalon (3T8M)
- MaryBalboni (3T8N)
- HensonGraves (3T8O)
- DicksonLukose (3T96)
- DiegoMagro (3T97)
- JohnMcClure (3T9D)
- ChristophLange (3T9E)
- MichaelUschold (3T9F)
- TorstenHahmann (3T9G)
- NathalieAussenacGilles (3T9H)
- MitchKokar (3T9I)
- RichardMartin (3T9J)
- KyoungsookKim (3TNH)
- Karen & DougEngelbart (3TP8)
- AstridDuqueRamos (3TPA)
- NancyWiegand (3TPB)
- MichelDumontier (3TPN)
- BillHogan (3TPT)
- FranciscoEdgarCastilloBarrera (3TPU)
- CarlosRueda (3TPV)
- MichaelFitzmaurice (3TQ7)
- ScottHills (3TQF)
- CeciliaZanniMerk (3TQM)
- MichelVandenBossche (3TQN)
- SameeraAbar (3TQO)
- NurhamizahHamka (3TQV)
- BrianHaugh (3TVL)
- MeikaUngricht (3TZ1)
- MariaKeet (3TZ2)
- MatthewHettinger (3U4F)
- JulitaBermejoAlonso (3U4G)
- AlessandroOltramari (3U4H)
- PatrickMaroney (3U4W)
- MathiasBrochhausen (3U50)
- DeborahNichols (3U52)
- JoelNatividad (3U53)
- SimonSpero (3U6Y)
Note that the solicitation for endorsements is now closed (as of end 15-Jun-2013.) We thank everyone who has been involved in this OntologySummit2013, as well as from those in the broader community who has taken a position and provided the above endorsements. (3S27)
Further Comments, Suggestions and Dialog: (3T9O)
As suggested by the Summit Co-chairs towards the end of the OntologySummit2013_Symposium, we would invite further comments, suggestions and encourage a continued dialog around the theme of this Summit, and in particular, evolving around that which is covered in the communique, despite the fact that the content of this Communique is now locked-in, with the snapshot position that we have collectively developed and supported. (3T9P)
Please post any further comments, suggestions and thoughts to this OntologySummit2013_Communique at: /CommentsSuggestions ... suggestions for future Summits can be posted here. (3T9Q)
For the record ... (3RZH)
Towards Ontology Evaluation across the Life Cycle (3RZI)
- 2013_05_03 - version 1.0.0 of the OntologySummit2013_Communique was formally adopted by the community at the OntologySummit2013_Symposium (3RZJ)
- The OntologySummit2013 Communique, in format prepared for The OntologySummit2013_Symposium, is here: https://docs.google.com/document/d/1gaQ8DIdg20AGJnlaCpQ7ebwJH_L4c4tK5RH1wyMHQig/edit?usp=sharing (3J2E)
- shortened url for the OntologySummit2013 Communique: (3RZE)
-- this page is maintained by the Co-lead Editors of the OntologySummit2013 Communique and the WikiAdmin ... please do not edit. (3RZQ)