oor-forum
[Top] [All Lists]

[oor-forum] FW: URIs and Unique IDs

To: "OpenOntologyRepository-discussion" <oor-forum@xxxxxxxxxxxxxxxx>
From: "Obrst, Leo J." <lobrst@xxxxxxxxx>
Date: Fri, 31 Oct 2008 14:32:18 -0400
Message-id: <9F771CF826DE9A42B548A08D90EDEA8003EA2C65@xxxxxxxxxxxxxxxxx>

I’m cross-posting this. Versioning is definitely an issue we need to address for OOR, and though we may derive our methods from others, we should certainly consider the ramifications now.

 

Leo

 

_____________________________________________
Dr. Leo Obrst       The MITRE Corporation, Information Semantics
lobrst@xxxxxxxxx    Information Discovery & Understanding, Command and Control Center

Voice: 703-983-6770 7515 Colshire Drive, M/S H305
Fax: 703-983-1379   McLean, VA 22102-7508, USA
 

From: semantic-web-request@xxxxxx [mailto:semantic-web-request@xxxxxx] On Behalf Of Michael F Uschold
Sent: Thursday, October 30, 2008 5:15 AM
To: semantic-web@xxxxxx
Cc: aldo.gangemi@xxxxxxxxx; Conor Shankey; Peter Mika; Ora Lassila; Pan, Dr Jeff Z.; Tim Berners-Lee; Frank van Harmelen; sean.bechhofer@xxxxxxxxxxxxxxxx
Subject: URIs and Unique IDs

 

I'm resending this message to the semantic web discussion group for the record.

On Wed, Oct 29, 2008 at 3:53 PM, Michael F Uschold <uschold@xxxxxxxxx> wrote:

Currently there is no accepted practice on how/whether to migrate to new URIs when a new version of an ontology is published. This is largely due to the fact that there is no good technology for managing versioning, and the W3C consciously (and probably sensibly) decided not to address the issue.  Versioning information is meant to be placed on a version annotation.

However the current situation is like the wild West, and everyone will be doing different things, resulting in a mess.

Wordnet published a new version and minted all new URIs even though many or most of the entries were semantically identical.
The SKOS working group is currently considering the pros and cons of various options. One is to adopt all new URIs in a new namespace, just like Wordnet. Another is to keep the exact same name space, and change the semantics of a small number of terms while keeping the same URI. A third is to keep the same URI for the unchanged terms, and mint new URIs for the terms with different semantics.

This is a problem because they have no guidelines, they are basically stumbling along in the dark.

I believe that this is an urgent matter that needs attention to prevent a nightmare from unfolding.

In the current state of semantic web use, it may not matter to much what choice the SKOS team chooses. This is mainly relatively few applications will be impacted, which may be due to the fact that the applications are not driven by the ontologies.

However, when usage of ontologies and ontology-driven applications becomes more mainstream, the differences could be profound. Given that this issue is intimately tied up with versioning, and that we have no good solutions yet, do we continue to throw our hands up and punt? Absolutely not, it is essential that a good precedent is set ASAP that is based on sound principles.

Here is how.

We should imagine a future where ontology versioning is handled properly and do things that are going to make things easy to migrate to that future. We don't know how the versioning black box will work, but we should be able to make some clear and definitive statements about WHAT it does.

For example, in the future, ontology-driven applications will be fairly mainstream. URIs are used as unique identifiers. When applications are driven from ontologies, then they will break if you change the semantics in mid-stream.  Imagine an application that relied on the semantics of broader as it was originally specified with transitivity.  They loaded data that was created using that semantics. Then the SKOS spec changes and broader is no longer transitive. New datasets are created according to this new meaning. The application loads more data. It needs to know which data is subject to transitive closure and which is not. This is impossible, if the same SKOS URI is used for versions with different semantics.  They are different beasts, and thus MUST have different URIs.

Similarly, if SKOS mints a whole new namespace and changes all the URIs, the application also has a problem. It has datasets with the old URI and datasets with the new URIs. This means that the datasets will not be linked like they should, they will treat the two different URIs for the same thing as being different.  If one wanted to go into OWL-Full, one can use owl:sameAs, but this is not very practical.  The only reasonable solution is to have the same URI for things with the same semantics.

Thus, any ontology versioning systemof the future will rely on these two principles:
1. If the semantics of a term changes, then it needs to have a new unique ID.
2. If the semantics of a term does NOT change, then it should maintain the same ID in any future versions.

If either of these two guidelines are broken, then so will the ontology-driven applications of the future.

These maxims hold without exception for any standards that are formally released as standards.
A question arises if we need to hold to the same standards for standards like SKOS which was never formally blessed.

The practical difficulties will be the same whether the standard is blessed or not. It only really depends on whether the standard is a de facto standard,or whether it is getting significant use. If users build things and ontology producers break things through carelessness, this will hinder semantic web technology adoption.

Another question is what to do if the original standard is belived to be incorrect, and the new one is the fixed one. Can one then keep the same URI?
Again, the answer should be informed by the impact on applications. The same problems will occur if you change the semantics and keep the same URI even if you are fixing a mistake.  The URI with the wrong semantics must keep its original unique ID.

Michael Uschold

 


_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/oor-forum/  
Subscribe: mailto:oor-forum-join@xxxxxxxxxxxxxxxx 
Config/Unsubscribe: http://ontolog.cim3.net/mailman/listinfo/oor-forum/  
Shared Files: http://ontolog.cim3.net/file/work/OOR/ 
Wiki: http://ontolog.cim3.net/cgi-bin/wiki.pl?OpenOntologyRepository     (01)
<Prev in Thread] Current Thread [Next in Thread>
  • [oor-forum] FW: URIs and Unique IDs, Obrst, Leo J. <=