Dear Paola, (01)
Proper names tend to start with capital letters, you can also infer
something from the grammatical structure. So there is a decent amount
of NLP here. (02)
But be careful, this is all a lot short of extracting ALL of the
information in a text, just certain types that it knows the patterns
for. I expect the Reuters stuff just has quite a lot of patterns.
Anything that doesn't fit a pattern gets ignored. (03)
Regards (04)
Matthew West
Reference Data Architecture and Standards Manager
Shell International Petroleum Company Limited
Registered in England and Wales
Registered number: 621148
Registered office: Shell Centre, London SE1 7NA, United Kingdom (05)
Tel: +44 20 7934 4490 Mobile: +44 7796 336538
Email: matthew.west@xxxxxxxxx
http://www.shell.com
http://www.matthew-west.org.uk/ (06)
> -----Original Message-----
> From: ontolog-forum-bounces@xxxxxxxxxxxxxxxx
> [mailto:ontolog-forum-bounces@xxxxxxxxxxxxxxxx]On Behalf Of
> paola.dimaio@xxxxxxxxx
> Sent: 14 February 2008 16:47
> To: [ontolog-forum]
> Subject: Re: [ontolog-forum] language ambiguity (was: Axiomatic
> ontology)
>
>
> Thanks Matthew
> sounds like simple rules then? how does the system know if a word is a
> proper name though, does it use an ontology or taxonomy as a
> reference, I ll look into it
>
> p
>
>
> On 2/14/08, matthew.west@xxxxxxxxx <matthew.west@xxxxxxxxx> wrote:
> > Dear Paola,
> >
> > > >if you consider this sort of thing to be a tool for computer
> > > > assisted ontology development, then it can be very helpful,
> > > particularly
> > > > where we are talking about extracting brute facts. However,
> > > if you are
> > > > talking about more general ontology extraction, then of
> > > course the ontolgoy
> > > > produced is going to be no better than that of the document
> > > considered, and
> > > > there are the usual issues with ambiguity that computers
> > > usually struggle
> > > > with, especially using words with different senses in close
> > > proximity.
> > > >
> > >
> > > I agree, and think that such functionality can be useful to aid
> > > concept extraction for further refinement. How does the system
> > > identify the classes, objects and relations I dont
> understand, do they
> > > have to be in RDF? Or how does it do it? (haven't read the
> > > documentation)
> >
> > MW: What I have seen is a kind of NLP parsing, so nouns are classes,
> > proper names are individuals, spotting patterns for names of people,
> > addresses, and company names, dates, and then understanding
> the patterns
> > around certain key words, so when you see something like:
> > "Shell bought XYZ co from ABC Corp on July 17th 2006"
> > it can create the appropriate records of the activity, when
> it happened
> > and who the parties were involved.
> >
> > Regards
> >
> > Matthew West
> > Reference Data Architecture and Standards Manager
> > Shell International Petroleum Company Limited
> > Registered in England and Wales
> > Registered number: 621148
> > Registered office: Shell Centre, London SE1 7NA, United Kingdom
> >
> > Tel: +44 20 7934 4490 Mobile: +44 7796 336538
> > Email: matthew.west@xxxxxxxxx
> > http://www.shell.com
> > http://www.matthew-west.org.uk/
> >
> > >
> > >
> > > Paola Di Maio
> > >
> > > _________________________________________________________________
> > > Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/
> > > Subscribe/Config:
> > http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/
> > Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
> > Shared Files: http://ontolog.cim3.net/file/
> > Community Wiki: http://ontolog.cim3.net/wiki/
> > To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx
> >
> >
> >
> >
> > _________________________________________________________________
> > Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/
> > Subscribe/Config:
> http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/
> > Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
> > Shared Files: http://ontolog.cim3.net/file/
> > Community Wiki: http://ontolog.cim3.net/wiki/
> > To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx
> >
> >
>
>
> --
> Paola Di Maio
> School of IT
> www.mfu.ac.th
> *********************************************
>
> _________________________________________________________________
> Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/
> Subscribe/Config:
> http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/
> Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
> Shared Files: http://ontolog.cim3.net/file/
> Community Wiki: http://ontolog.cim3.net/wiki/
> To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx
>
>
> (07)
_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/
Subscribe/Config: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/
To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx (08)
|