ontolog-forum
[Top] [All Lists]

Re: [ontolog-forum] What words mean

To: "Rob Freeman" <lists@xxxxxxxxxxxxxxxxxxx>
Cc: "[ontolog-forum] " <ontolog-forum@xxxxxxxxxxxxxxxx>
From: "Barker, Sean (UK)" <Sean.Barker@xxxxxxxxxxxxxx>
Date: Tue, 19 Feb 2008 16:53:08 -0000
Message-id: <E18F7C3C090D5D40A854F1D080A84CA4B4E42D@xxxxxxxxxxxxxxxxxxxxxx>


This mail is publicly posted to a distribution list as part of a process
of public discussion, any automatically generated statements to the
contrary non-withstanding. It is the opinion of the author, and does not
represent an official company view.    (01)

Rob,    (02)

        Thanks for the reference. My hypothesis actual is based on some
data mining issues, rather than NLP or AI, but following through the
links, I have a suspicion that the two problems may be closely related,
in the sense that the same theories may be applicable to both. Or
conversely, I'm going to be wary of "we can solve it by data mining"
project.    (03)

Sean Barker
BAE SYSTEMS - Advanced Technology Centre
Bristol, UK
+44(0) 117 302 8184    (04)

BAE Systems (Operations) Limited
Registered Office: Warwick House, PO Box 87, Farnborough Aerospace
Centre, Farnborough, Hants, GU14 6YU, UK
Registered in England & Wales No: 1996687     (05)

> -----Original Message-----
> From: ontolog-forum-bounces@xxxxxxxxxxxxxxxx 
> [mailto:ontolog-forum-bounces@xxxxxxxxxxxxxxxx] On Behalf Of 
> Rob Freeman
> Sent: 16 February 2008 06:05
> To: [ontolog-forum]
> Subject: Re: [ontolog-forum] What words mean
> 
> 
>                *** WARNING ***
> 
> This mail has originated outside your organization, either 
> from an external partner or the Global Internet. 
>      Keep this in mind if you answer this message. 
> 
> Hi Sean,
> 
> On Feb 16, 2008 12:58 AM, Barker, Sean (UK) 
> <Sean.Barker@xxxxxxxxxxxxxx> wrote:
> >
> > 5) As a working hypothesis, one might like to try the following:
> >
> > a) Web pages are generated by a finite set of random processes;
> > b) Each process has a set of probability distribution and 
> correlations 
> > functions that determine the probability of words appearing 
> on the page.
> > c) An investigation into the properties of the web from the words 
> > contained on a web page is an attempt to infer from the 
> distributions 
> > what the set of generating processes is.
> 
> Have you heard of the "Hutter Prize" for the compression of 
> text by human knowledge?
> 
> "...in 1950, Claude Shannon estimated the entropy 
> (compression limit) of written English to be about 1 bit per 
> character [3]. To date, no compression program has achieved 
> this level."
> (http://cs.fit.edu/~mmahoney/compression/rationale.html)
> 
> There's an annual Euro 50,000 prize for the best effort.
> (http://prize.hutter1.net/)
> 
> The idea of the prize is the old one that we (can't predict, 
> and thus compress, text as much as we expect because we...) 
> need human knowledge to "disambiguate" natural language. 
> That's an old idea. I believe almost the opposite. But the 
> prize, and the work of Marcus Hutter which motivated it, is 
> interesting for what it says about the predictability of 
> natural language, and in particular the "randomness"
> of meaning. Where by the "randomness of meaning" I mean that 
> Hutter's work (like Schmidhuber's "New AI") assumes it is 
> necessary to use probabilistic model of intelligence.
> 
> It is also a definition of intelligence dependent on goals, note (c.f.
> W. J. Freeman). Hutter: "No Intelligence without Goals."
> 
> Hutter has written a book on this:
> 
> Universal Artificial Intelligence - Sequential Decisions 
> based on Algorithmic Probability
> 
> http://www.hutter1.net/ai/uaibook.htm
> 
> -Rob
>  
> _________________________________________________________________
> Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/
> Subscribe/Config: 
> http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/
> Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
> Shared Files: http://ontolog.cim3.net/file/ Community Wiki: 
> http://ontolog.cim3.net/wiki/ To Post: 
> mailto:ontolog-forum@xxxxxxxxxxxxxxxx
>  
> 
>     (06)

********************************************************************
This email and any attachments are confidential to the intended
recipient and may also be privileged. If you are not the intended
recipient please delete it from your system and notify the sender.
You should not copy it or use it for any purpose nor disclose or
distribute its contents to any other person.
********************************************************************    (07)


_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/  
Subscribe/Config: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/  
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/ 
To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx    (08)

<Prev in Thread] Current Thread [Next in Thread>