ontology-summit
[Top] [All Lists]

Re: [ontology-summit] Big Data definition

To: Ontology Summit 2014 discussion <ontology-summit@xxxxxxxxxxxxxxxx>
From: David Price <dprice@xxxxxxxxxxxxxxx>
Date: Thu, 23 Jan 2014 10:49:21 +0000
Message-id: <10C83975-EA26-4A34-AA11-C160C001A1A9@xxxxxxxxxxxxxxx>
NIST created a working group on this topic.

From : NIST Big Data
Definitions and Taxonomies
Version 0.9
Definitions & Taxonomies Subgroup
NIST Big Data Working Group (NBD-WG)
November, 2013

Big data is used as a concept that refers to the inability of traditional data architectures to efficiently handle the new data sets. Characteristics that force a new architecture to achieve efficiencies are the dataset-at-rest characteristics volume, and variety of data from multiple domains or types; and from the data-in-motion characteristics of velocity, or rate of flow, and variability, as the change in velocity. Each of these characteristics result in different architectures or different data lifecycle process orderings to achieve needed efficiencies. A number of other terms (particularly anything that can be expressed using a term starting with the letter ‘V”) are also used, but a number of these refer to the analytics, not to new big data architectures.

Their docs are at :


Cheers,
David

UK +44 7788 561308
US +1 336 283 0606




On 23 Jan 2014, at 10:08, Ian Glendinning wrote:

I tend to use a working definition more like this:

Data is "big" when its size makes it large enough that patterns
discoverable within it are statistically more significant than any
patterns imposed by original schemas.

Ian

On Thu, Jan 23, 2014 at 12:32 AM, John McClure <jmcclure@xxxxxxxxxxxxxx> wrote:
Hello,
Is the following definition for "Big Data" from Wikipedia what we're working
with? Or, in the context of Semantic Web and Applied Ontologies, are we
presuming "Big Semantic Data"?
Thanks for your comments.

Big data[1][2] is the term for a collection of data sets so large and
complex that it becomes difficult to process using on-hand database
management tools or traditional data processing applications. The challenges
include capture, curation, storage,[3] search, sharing, transfer,
analysis,[4] and visualization. The trend to larger data sets is due to the
additional information derivable from analysis of a single large set of
related data, as compared to separate smaller sets with the same total
amount of data, allowing correlations to be found to "spot business trends,
determine quality of research, prevent diseases, link legal citations,
combat crime, and determine real-time roadway traffic conditions."[5][6][7]




_________________________________________________________________
Msg Archives: http://ontolog.cim3.net/forum/ontology-summit/
Subscribe/Config: http://ontolog.cim3.net/mailman/listinfo/ontology-summit/
Unsubscribe: mailto:ontology-summit-leave@xxxxxxxxxxxxxxxx
Community Files: http://ontolog.cim3.net/file/work/OntologySummit2014/
Community Wiki: http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologySummit2014
Community Portal: http://ontolog.cim3.net/wiki/


_________________________________________________________________
Msg Archives: http://ontolog.cim3.net/forum/ontology-summit/   
Subscribe/Config: http://ontolog.cim3.net/mailman/listinfo/ontology-summit/  
Unsubscribe: mailto:ontology-summit-leave@xxxxxxxxxxxxxxxx
Community Files: http://ontolog.cim3.net/file/work/OntologySummit2014/
Community Wiki: http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologySummit2014  
Community Portal: http://ontolog.cim3.net/wiki/


_________________________________________________________________
Msg Archives: http://ontolog.cim3.net/forum/ontology-summit/   
Subscribe/Config: http://ontolog.cim3.net/mailman/listinfo/ontology-summit/  
Unsubscribe: mailto:ontology-summit-leave@xxxxxxxxxxxxxxxx
Community Files: http://ontolog.cim3.net/file/work/OntologySummit2014/
Community Wiki: http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologySummit2014  
Community Portal: http://ontolog.cim3.net/wiki/     (01)
<Prev in Thread] Current Thread [Next in Thread>