To: | ontolog-forum@xxxxxxxxxxxxxxxx |
---|---|
From: | FERENC KOVACS <f.kovacs@xxxxxxxxxxxxxx> |
Date: | Sat, 25 Sep 2010 14:29:30 +0000 (GMT) |
Message-id: | <276660.75521.qm@xxxxxxxxxxxxxxxxxxxxxxxxxxx> |
John,
Thank you very much. You are wonderful again. What a fantastic resource!
Many thanks, Ferenc
OntoNotes was announced as a big project with large resources that are being produced and released under the LGPL license. See the first excerpt at the end of this note for the main URL and a paragraph that describes the project. Following are the components: * Treebank. Text annotated with syntactic information * PropBank. Verbs tagged with their semantic argument structure * Word Sense. Tagging all verbs and nouns with their word sense and linking them to the Omega ontology * Ontology. The Omega Ontology, a broad-coverage ontology containing word senses * Coreference. Marking multiple mentions of the same entity in text. The second excerpt below describes the Omega ontology. These resources are very useful, especially for NLP. However, the intent of the OntoNotes project is to annotate huge volumes of text for the purpose of training statistical tools. In my opinion, large-scale annotation by hand is obsolescent and unnecessary. In any case, the resources can be used for other projects as well. John Sowa ______________________________________________________________ From http://www.bbn.com/ontonotes/ The OntoNotes project is a collaborative effort between Raytheon BBN Technologies, the University of Colorado, the University of Pennsylvania, and the University of Southern California's Information Sciences Institute to produce such a resource. It aims to annotate a large corpus comprising various genres of text (news, conversational telephone speech, weblogs, use net, broadcast, talk shows) in three languages (English, Chinese, and Arabic) with structural information (syntax and predicate argument structure) and shallow semantics (word sense linked to an ontology and coreference). OntoNotes builds on two time-tested resources, following the Penn Treebank for syntax and the Penn PropBank for predicate-argument structure. Its semantic representation will include word sense disambiguation for nouns and verbs, with each word sense connected to an ontology, and coreference. Over the course of the five-year program, our current goals call for annotation of over a million words each of English and Chinese, and half a million words of Arabic. From http://www.isi.edu/~philpot/papers/ijcnlp05/ijcnlp-olr05.pdf Omega is a 120,000-node terminological ontology constructed at USC ISI as the reorganization and synthesis of WordNet 2.0 (Miller 1990; Fellbaum 1998), a lexically oriented network constructed on general cognitive principles, and Mikrokosmos (Mahesh 1996; O’Hara et al. 1998), a conceptual resource originally conceived to support translation, into a new upper model, created expressly in order to facilitate the merging of lower models into a functional whole. Omega, like its close predecessor SENSUS (Knight et al. 1994), can be characterized as a shallow, lexically oriented, term taxonomy. By far the majority of its concepts can be stated in English by a single word. Omega contains no formal concept definitions and only relatively few interconnections (semantic relations) between concepts. By making few commitments to any specific theories of semantics or particular representations, Omega enjoys a malleability that has allowed it to be used in a variety of applications, including question answering and information integration. _________________________________________________________________ Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/ Config Subscr: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/ Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx Shared Files: http://ontolog.cim3.net/file/ Community Wiki: http://ontolog.cim3.net/wiki/ To join: http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage#nid1J To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx _________________________________________________________________ Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/ Config Subscr: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/ Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx Shared Files: http://ontolog.cim3.net/file/ Community Wiki: http://ontolog.cim3.net/wiki/ To join: http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage#nid1J To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx (01) |
<Prev in Thread] | Current Thread | [Next in Thread> |
---|---|---|
|
Previous by Date: | [ontolog-forum] OntoNotes and the Omega ontology, John F. Sowa |
---|---|
Next by Date: | Re: [ontolog-forum] [Fwd: Re: More on patents], Rich Cooper |
Previous by Thread: | [ontolog-forum] OntoNotes and the Omega ontology, John F. Sowa |
Next by Thread: | Re: [ontolog-forum] OntoNotes was announced as a big project with large resources, Jack Park |
Indexes: | [Date] [Thread] [Top] [All Lists] |