[Top] [All Lists]

[ontolog-forum] What support should a corpus provide?

To: "'John F Sowa'" <sowa@xxxxxxxxxxx>, <corpora@xxxxxx>
Cc: "'[ontolog-forum] '" <ontolog-forum@xxxxxxxxxxxxxxxx>
From: "Rich Cooper" <rich@xxxxxxxxxxxxxxxxxxxxxx>
Date: Fri, 8 Aug 2014 11:12:00 -0700
Message-id: <02e801cfb334$43376990$c9a63cb0$@englishlogickernel.com>

Dear Corpus Analysts and Ontologists,


I have just made available a corpus of documents from the US Patent and Trademark Office which are available for corpus analysts.  The tools available now are sufficient for supporting attorneys, inventors, scientists, and other similar application legal and technology roles. 


What additional support should I provide in the software for supporting corpus analysis of selected patent document subsets?  I have a web site with extensive help and tutorial materials – I suggest starting at:




to see an index of capability descriptions.  I can make available the “frequent words” and the “rare words” lists as text files, along with the patent documents in whole or in sections for data, abstract, description and claims, which are already extracted from the selected document set.  The claim tree is parsed, and the claims are separated into claim elements, all of which can be provided. 


Is there anything else that corpus analysts would like to see in the software?


Suggestions highly appreciated,




Rich Cooper


Rich AT EnglishLogicKernel DOT com

9 4 9 \ 5 2 5 - 5 7 1 2

Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/  
Config Subscr: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/  
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/ 
To join: http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage#nid1J    (01)

<Prev in Thread] Current Thread [Next in Thread>
  • [ontolog-forum] What support should a corpus provide?, Rich Cooper <=