ontolog-forum
[Top] [All Lists]

Re: [ontolog-forum] Natural Language based SPARQL Generator

To: "[ontolog-forum]" <ontolog-forum@xxxxxxxxxxxxxxxx>
From: Adrian Walker <adriandwalker@xxxxxxxxx>
Date: Fri, 1 Feb 2013 09:18:05 -0500
Message-id: <CABbsESdKr0sL6_2rNrc6njVs7Um2PXKuAzmQrnK3ZBJZPgA3Dg@xxxxxxxxxxxxxx>
Hi John,

You wrote...

    As for webifying a version of SQL, that would be fairly easy to do.

In one sense that's been in operation of a number of years now.

MySql and Oracle both support remote querying.  We use this in our system in "linking rules" [1,2] such as
url:reengineeringllc.com  dbms:mysql  dbname:mysql tablename:T2
 port:3306 id:mysql password:*** -------------------------------------------------------------- in this-month the refinery this-name has committed to schedule
this-amount gallons of this-product
So Yes, easy to do for small-to-medium data, but both SQL and SPARQL will have efficiency issues with multi-site distributed big data.

                          -- Adrian

[1]  www.reengineeringllc.com/demo_agents/Oil-IndustrySupplyChain1MySql1.agent

[2]  www.reengineeringllc.com/IBL_tutorial_part4.html

Internet Business Logic
A Wiki and SOA Endpoint for Executable Open Vocabulary English Q/A over SQL and RDF
Online at www.reengineeringllc.com  
Shared use is free, and there are no advertisements

Adrian Walker
Reengineering

On Thu, Jan 31, 2013 at 11:36 PM, John F Sowa <sowa@xxxxxxxxxxx> wrote:
Kingsley and Doug,

The Quepy developers use the NLTK toolkit, which is an open-source
set of Python-based software for NLP processing.  It's widely used
for teaching purposes.  But it is not state of the art NLP software.

KI
> it's only using DBpedia whereas if it used the LOD cloud cache
> there would be a much broader knowledgebase.

Google answered every one of my five questions, but Quepy could
only answer one of them.  I also tried Bing, which did just
as well as Google on all five.

In fact, Bing got a better answer for the question "When did
the Revolutionary War end?"  In addition to hits that were similar
to Google's, Bing gave the following answer above the list of hits:

Bing
> The American Revolutionary War began on Wednesday, April 19, 1775
> and ended on Wednesday, September 3, 1783.

DF
> I asked for the President of the UK, and since the SPARQL query
> was for a leader, not a president, the answers returned were
> David Cameron and Queen Elizabeth II.

I typed "Who is the president of the UK?" to Google and Bing.
Both of them found the following plus some other relevant hits:

Bing and Google
>     Who is the president of the United Kingdom - The Q&A wiki
>     wiki.answers.com › … › United Kingdom › UK Politics
>
>     The United Kingdom is a parliamentary constitutional monarchy
>     and has no president. HM Queen Elizabeth II is Head of State.
>     The Right Honourable David Cameron MP is ...

Of course, Google and Microsoft (Bing) are multi-billion dollar
corporations with huge R & D budgets.  Quepy is OK for homework
exercises in a course on NLP.

DF
> It seems to generate SPARQL without using any ontology.

I read some of the Quepy documentation, which indicates that
they do recognize "classes" and "subclasses".  But Google,
Bing, and many other commercial companies have much richer
resources.

KI
> To conclude, the key point I sought to make via this post is that
> natural language based SPARQL generation is an emerging frontier.

I doubt that.  Neither Google nor Bing use RDF, SPARQL, or OWL.
Instead, they do pattern matching directly to the raw, unannotated
natural language texts.

I'll admit that there is a large and growing corpus of tagged
documents, for which RDF processing can be useful.  But the raw NL
documents are growing at a much faster rate than the tagging.

KI
> that never happened on the SQL front, in any kind of webby way, of
> course, I would happily look at a live Web accessible SQL based system
> to see if it can match the most basic SPARQL functionality demonstrated
> by Quepy fronting SPARQL

SQL has a superset of the expressive power of RDF.  People had developed
very sophisticated NLP query systems for DB queries 30 years ago.  For
examples, see http://www.jfsowa.com/pubs/futures.pdf .  Most of those
systems never became profitable or they remained niche products.

But some of them have been connected to speech systems for those
annoying automated telephone systems.  Replacing SQL with SPARQL
will do nothing to make them less annoying.

As for webifying a version of SQL, that would be fairly easy to do.
In Fact, Tim B-L included SQL as one of the languages that had to be
supported.  (See his DAML proposal of 2000.)  Oracle and IBM do that
with their products.  But the clueless academics who jumped on the
DAML bandwagon refused to support SQL.

John


_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/  
Config Subscr: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/  
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/ 
To join: http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage#nid1J    (01)

<Prev in Thread] Current Thread [Next in Thread>