ontolog-forum
[Top] [All Lists]

Re: [ontolog-forum] What goes into a Lexicon?

To: ontolog-forum@xxxxxxxxxxxxxxxx
From: "John F. Sowa" <sowa@xxxxxxxxxxx>
Date: Mon, 27 Feb 2012 09:09:42 -0500
Message-id: <4F4B8EA6.4090605@xxxxxxxxxxx>
David E, Amanda, David P,    (01)

DE
> How ontologies could add value, I don't have a clue, since as far as
> I know it's a hard&  fast ontological rule that requires a term to
> have a single definition/meaning... an extremely unrealistic
> constraint for this sort of ugly real world challenge.  [If I've got
> this wrong, please set me straight.]    (02)

The only hard and fast rule is that anybody who thinks there is
a hard and fast rule is almost certainly wrong.    (03)

There are two issues here: (1) the use of URIs to point to unique
definitions of word senses in natural languages, and (2) their
use to define unique meanings in some formally defined theories.    (04)

For both of these issues, the hope that URIs will magically solve
all the problems of ambiguity and interoperability is, so to speak,
forlorn.  In other words, it's pie in the sky after you die.    (05)

DE
> Imposing an organized ontology on this disorganized language
> collection probably isn't going be of much help.    (06)

I agree.  That is issue #1 about URIs for NL word senses.
Some issues:    (07)

  1. Who is going to decide which unique URI to use in any particular
     case?  Dictionaries developed by professional lexicographers
     disagree on the number of senses for most words.  Even when
     expert linguists annotate texts from a fixed selection of word
     senses, a rate of agreement of 95% is rare.    (08)

  2. The error rates for statistical methods for choosing word senses
     are far worse than the rates by professionals, but they are often
     better than the choices by untrained native speakers.  Yet the
     error rates for both are so high that the illusion of precision
     is just that -- an illusion.    (09)

  3. I'm not saying that annotating NL documents is bad.  But I'm just
     saying that you can't assume that those annotations have the kind
     of precision that is necessary for complex deductions.  They might
     be useful for many purposes, but you can't assume that they will
     guarantee the kind of precision that logicians feel comfy with.    (010)

So let's move on to the use of URIs for interoperability among
different computer systems that have formally defined theories.    (011)

  1. Take a simple case, where we have a collection of terms defined
     in some version of logic, say predicates in FOL or classes in OWL.
     We'll start with just three terms -- A, B, and C -- each of which
     is a URI that points to a definition of each one in terms of the
     others.  I'll just use 3 for convenience, but you can have any
     number of terms.  And let's assume that those definitions will
     remain unchanged -- that's a big assumption, given the way the
     WWW works, but let's assume that nobody changes the definitions.    (012)

  2. Now suppose that we have two computer systems X and Y that
     import the definitions of A, B, and C and combine them with
     theories that add lots of further information, including many
     independently developed axioms and definitions of other terms
     that are private to each.  Those new axioms will assert new
     relationships among A, B, and C.  How can we be sure that
     the independently defined extensions in X and Y will remain
     consistent with one another?    (013)

  3. To be specific, let's suppose that those terms A, B, C...
     take common words like 'person', 'employee', 'name', 'address',
     'salary', 'manager', etc., and give them precise definitions.
     Then the systems X and Y each add different info that customize
     those definitions by adding new info, but without changing the
     base definitions specified by the URIs.  Can we be sure that
     those independently defined customizations will be consistent?    (014)

  4. Suppose that systems X and Y interchange messages that use
     only the common terms that are specified by the URIs.  But
     internally, each system adds different statements about them.
     In X, all the added info in X is consistent with the base
     definitions.  In Y, all the added info in Y is consistent
     with the base definitions.  But how can we be sure that the
     independently added info in X and Y will not create conflicts
     when the "same" message using the "same" definitions is
     interpreted in each system?    (015)

  5. These issues imply that meanings depend on more than generic
     definition specified by a URI.  In each context in which you
     use those terms, other statements will add more "meaning" --
     i.e., they will specialize the terms further.  If you use those
     same terms in different contexts, the meanings will be specialized
     in different ways.    (016)

  6. Does that mean you need a different URI for every use of a term
     in every possible context?  Do you have to change the URI each
     time you add one more axiom to the context in which it is used?
     If two different systems are going to share data that uses the
     same terms, but they use them in different ways internally,
     how could URIs resolve these issues?    (017)

These issues, by the way, plague every large system in the world.
Developers try to keep the definitions consistent when they go from
from version 1.0 to 2.x, to y.z -- but they can't really depend on
complete consistency.  Just look at the updates you keep getting
from your favorite vendors.    (018)

AV
> And, before somebody wonders how one could possibly work with and edit
> a massive ontology like that without readable names, the answer is,
> more easily than ever. You could view the concepts by ID only, or
> with the parenthetical inclusion of a preferred label in your
> language of choice.    (019)

I agree that the URI (Unreadable Resource Identifier) is the official
name.  But by itself, that doesn't solve either (1) the NL issues
about choosing word senses or (2) the formal logic issues about
using the "same" definitions in different formal contexts.    (020)

DP
> The meaning that's interesting to a user of any system (ontology or not)
> are in the algorithms, visualizations and analysis that depend completely
> on what the concepts of interest mean in the language of the discipline
> in which the application is used. That language is almost completely based
> on text read by a human - the software developer.    (021)

That is a good point:  even when all terms are defined and used in
formal languages (logic or procedures), the people who use those
formal languages read the informal NL text -- not the formal
definitions specified by the URI.  That mixes issues (1) and (2).    (022)

Leo    (023)


















_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/  
Config Subscr: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/  
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/ 
To join: http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage#nid1J    (024)

<Prev in Thread] Current Thread [Next in Thread>