ontolog-forum
[Top] [All Lists]

Re: [ontolog-forum] Natural Language based SPARQL Generator

To: ontolog-forum@xxxxxxxxxxxxxxxx
From: Kingsley Idehen <kidehen@xxxxxxxxxxxxxx>
Date: Fri, 01 Feb 2013 11:21:11 -0500
Message-id: <510BEB77.9040405@xxxxxxxxxxxxxx>
On 2/1/13 9:18 AM, Adrian Walker wrote:
Hi John,

You wrote...

    As for webifying a version of SQL, that would be fairly easy to do.

In one sense that's been in operation of a number of years now.

MySql and Oracle both support remote querying.  We use this in our system in "linking rules" [1,2] such as

I've asked you before, and I'll ask you once again: can you share a hyperlink based super key from your system that resolves to an entity relationship graph? Even better, one endowed with machine and human comprehensible semantics. That's what an open data oriented system should deliver in this age of the World Wide Web. Anything less than that is just another data silo in disguise.

url:reengineeringllc.com  dbms:mysql  dbname:mysql tablename:T2
 port:3306  id:mysql  password:***
--------------------------------------------------------------
in this-month the refinery this-name has committed to schedule 
 this-amount gallons of this-product
So Yes, easy to do for small-to-medium data, but both SQL and SPARQL will have efficiency issues with multi-site distributed big data.
No, don't make generalizations like that for either. It all depends on the DBMS engine behind either query language. The problem that SQL has is that becomes exponentially expensive as data disparity increases. The schemas of conventional RDBMS engines are overbearing and their explicit joins are expensive.

SPARQL leverages the fact that schemas are very loosely coupled (they can be factored in after data creation) and implicit nature of entity relationships that constitute the relation tuples. In addition, an RDF store handles reference types naturally so you don't need inconsistently implemented reference (as part of DDL) and de-refreference functions (as part of query patterns) .

Kingsley

                          -- Adrian

[1]  www.reengineeringllc.com/demo_agents/Oil-IndustrySupplyChain1MySql1.agent

[2]  www.reengineeringllc.com/IBL_tutorial_part4.html

Internet Business Logic
A Wiki and SOA Endpoint for Executable Open Vocabulary English Q/A over SQL and RDF
Online at www.reengineeringllc.com  
Shared use is free, and there are no advertisements

Adrian Walker
Reengineering

On Thu, Jan 31, 2013 at 11:36 PM, John F Sowa <sowa@xxxxxxxxxxx> wrote:
Kingsley and Doug,

The Quepy developers use the NLTK toolkit, which is an open-source
set of Python-based software for NLP processing.  It's widely used
for teaching purposes.  But it is not state of the art NLP software.

KI
> it's only using DBpedia whereas if it used the LOD cloud cache
> there would be a much broader knowledgebase.

Google answered every one of my five questions, but Quepy could
only answer one of them.  I also tried Bing, which did just
as well as Google on all five.

In fact, Bing got a better answer for the question "When did
the Revolutionary War end?"  In addition to hits that were similar
to Google's, Bing gave the following answer above the list of hits:

Bing
> The American Revolutionary War began on Wednesday, April 19, 1775
> and ended on Wednesday, September 3, 1783.

DF
> I asked for the President of the UK, and since the SPARQL query
> was for a leader, not a president, the answers returned were
> David Cameron and Queen Elizabeth II.

I typed "Who is the president of the UK?" to Google and Bing.
Both of them found the following plus some other relevant hits:

Bing and Google
>     Who is the president of the United Kingdom - The Q&A wiki
>     wiki.answers.com › … › United Kingdom › UK Politics
>
>     The United Kingdom is a parliamentary constitutional monarchy
>     and has no president. HM Queen Elizabeth II is Head of State.
>     The Right Honourable David Cameron MP is ...

Of course, Google and Microsoft (Bing) are multi-billion dollar
corporations with huge R & D budgets.  Quepy is OK for homework
exercises in a course on NLP.

DF
> It seems to generate SPARQL without using any ontology.

I read some of the Quepy documentation, which indicates that
they do recognize "classes" and "subclasses".  But Google,
Bing, and many other commercial companies have much richer
resources.

KI
> To conclude, the key point I sought to make via this post is that
> natural language based SPARQL generation is an emerging frontier.

I doubt that.  Neither Google nor Bing use RDF, SPARQL, or OWL.
Instead, they do pattern matching directly to the raw, unannotated
natural language texts.

I'll admit that there is a large and growing corpus of tagged
documents, for which RDF processing can be useful.  But the raw NL
documents are growing at a much faster rate than the tagging.

KI
> that never happened on the SQL front, in any kind of webby way, of
> course, I would happily look at a live Web accessible SQL based system
> to see if it can match the most basic SPARQL functionality demonstrated
> by Quepy fronting SPARQL

SQL has a superset of the expressive power of RDF.  People had developed
very sophisticated NLP query systems for DB queries 30 years ago.  For
examples, see http://www.jfsowa.com/pubs/futures.pdf .  Most of those
systems never became profitable or they remained niche products.

But some of them have been connected to speech systems for those
annoying automated telephone systems.  Replacing SQL with SPARQL
will do nothing to make them less annoying.

As for webifying a version of SQL, that would be fairly easy to do.
In Fact, Tim B-L included SQL as one of the languages that had to be
supported.  (See his DAML proposal of 2000.)  Oracle and IBM do that
with their products.  But the clueless academics who jumped on the
DAML bandwagon refused to support SQL.

John



 
_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/  
Config Subscr: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/  
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/ 
To join: http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage#nid1J
 


-- 

Regards,

Kingsley Idehen	      
Founder & CEO 
OpenLink Software     
Company Web: http://www.openlinksw.com
Personal Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca handle: @kidehen
Google+ Profile: https://plus.google.com/112399767740508618350/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen




Attachment: smime.p7s
Description: S/MIME Cryptographic Signature


_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/  
Config Subscr: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/  
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/ 
To join: http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage#nid1J    (01)

<Prev in Thread] Current Thread [Next in Thread>