OntologySummit2014 session-05 Track-D: Tackling the Variety Problem in Big Data - I - Thu 2014-02-13    (466N)

Panelists / Briefings:    (466R)

Archives:    (466W)

Abstract    (4686)

Track D: "Tackling the Variety Problem in Big Data - I ... intro slides    (4687)

This is our 9th OntologySummit, a joint initiative by Ontolog, NIST, NCOR, NCBO, IAOA & NCO_NITRD with the support of our co-sponsors.    (4688)

Since the beginnings of the Semantic Web, ontologies have played key roles in the design and deployment of new semantic technologies. Yet over the years, the level of collaboration between the Semantic Web and Applied Ontology communities has been much less than expected. Within Big Data applications, ontologies appear to have had little impact.    (4689)

This year's Ontology Summit is an opportunity for building bridges between the Semantic Web, Linked Data, Big Data, and Applied Ontology communities. On the one hand, the Semantic Web, Linked Data, and Big Data communities can bring a wide array of real problems (such as performance and scalability challenges and the variety problem in Big Data) and technologies (automated reasoning tools) that can make use of ontologies. On the other hand, the Applied Ontology community can bring a large body of common reusable content (ontologies) and ontological analysis techniques. Identifying and overcoming ontology engineering bottlenecks is critical for all communities.    (468A)

OntologySummit2014 will pose and address the primary challenges in these areas of interaction among the different communities. The Summit activities will bring together insights and methods from these different communities, synthesize new insights, and disseminate knowledge across field boundaries.    (468B)

At the Launch Event on 16 Jan 2014, the organizing team has provided an overview of the program, and how we will be framing the discourse. Today's session (OntologySummit2014 session-05) is the first virtual panel session featured by Track-D, which focuses on "Tackling the Variety Problem in Big Data."    (468C)

This session will begin with an introduction to ontologies and Big Data and will then proceed to a series of three talks that discuss the integration of ontologies with Big Data. The focus will be on those aspects of Big Data that are most likely to benefit from the use of ontologies, primarily the problem of managing the complexity and variety of the data being processed and analyzed. The panelists will be discussing the use of semantic technologies for:    (46C9)

After the panelists briefings, there will be time for Q&A and an open discussion among the panel and all participants.    (468E)

For more information about Track D, see Track D page.    (46CE)

Please add your input to the discussion at: OntologySummit2014_Tackling_Variety_in_BigData_CommunityInput    (46CF)

See more details at: OntologySummit2014 (homepage for this summit)    (468F)

Briefings:    (468G)

Agenda:    (468P)

OntologySummit2014 session-05 Track-D: Tackling the Variety Problem in Big Data-I    (468Q)

Session Format: this is a virtual session conducted over an augmented conference call    (468R)

Proceedings    (468W)

Please refer to the above    (468X)

IM Chat Transcript captured during the session:    (468Y)

 see raw transcript here.    (468Z)
 (for better clarity, the version below is a re-organized and lightly edited chat-transcript.)
 Participants are welcome to make light edits to their own contributions as they see fit.    (4690)
 -- begin in-session chat-transcript --    (4691)
	------
	Chat transcript from room: summit_20140213
	2014-02-13 GMT-08:00 [PST]
	------    (46F4)
	[9:23] PeterYim: Welcome to the    (46F5)
	 = OntologySummit2014 session-05 Track-D: Tackling the Variety Problem in Big Data - I - Thu 2014-02-13 =    (46F6)
	Summit Theme: OntologySummit2014: "Big Data and Semantic Web Meet Applied Ontology"    (46F7)
	Session Topic: Track D: "Tackling the Variety Problem in Big Data - I"    (46F8)
	Session Co-chairs: 
	  Professor KenBaclawski (Northeastern University)
	  Professor AnneThessen (Arizona State University)    (46F9)
	Briefings:    (46FA)
	* Professor KenBaclawski (Northeastern University) - Tackling the Variety Problem in Big Data - An Introduction"    (46FB)
	* Mr. EricChan (Oracle) - "Enabling OODA Loop with Information Technology"    (46FC)
	* Mr. NathanWilson (Encyclopedia of Life) - "The Semantic Underpinnings of EOL TraitBank"    (46FD)
	* Ms. RuthDuerr (National Snow and Ice Data Center) - "Semantics and the SSIII Project"    (46FE)
	Logistics:    (46FF)
	* Refer to details on session page at: http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2014_02_13    (46FG)
	* (if you haven't already done so) please click on "settings" (top center) and morph from "anonymous" to your RealName; also please enable "Show timestamps" while there.    (46FH)
	* Mute control (phone keypad): *7 to un-mute ... *6 to mute    (46FI)
	* Attn: Skype users ... see: http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2014_02_13#nid467E
	** you may connect to (the skypeID) "joinconference" whether or not it indicates that it is online 
	   (i.e. even if it says it is "offline," you should still be able to connect to it.)
	** if you are using skype and the connection to "joinconference" is not holding up, try using (your favorite POTS or 
	   VoIP line, etc.) either your phone, skype-out or google-voice and call the US dial-in number: +1 (206) 402-0100 
	   ... when prompted enter Conference ID: 141184#
	** Can't find Skype Dial pad?
	*** for Windows Skype users: Can't find Skype Dial pad? ... it's under the "Call" dropdown menu as "Show Dial pad"
	*** for Linux Skype users: if the dialpad button is not shown in the call window you need to press the "d" hotkey to enable it    (46FJ)
	* when posting in this Chat-room, kindly observe the following ...
	** whenever a name is used, please use the full WikiWord name format (every time you don't, some volunteer will have to make an edit afterwards)
	** always provide context (like: "[ref. JaneDoe's slide#12], I think the point about context is great" ... rather than "that's great!" 
	   as the latter would mean very little in the archives.)
	** when responding to a specific individual's earlier remarks, please cite his/her full WikiWord names *and* 
	   the timestamp (in PST) of his/her post that you are responding to (e.g. "@JaneDoe [11:09] - I agree, but, ...")
	** use fully qualified url's (include http:// ) without symbols (like punctuations or parentheses, etc.) right before of after that URL    (46FK)
	Attendees: AbhayKashyap, AdelGhoneimy, AliHashemi, AmandaVizedom, AnatolyLevenchuk, 
	AndreaWesterinen, AnneThessen, BobMorris, BobbinTeegarden, BruceBray, CarmenChui, CarolBean, 
	ChristineKapp, ChristophLange, ConradBeaulieu, DennisWisnosky, DennisPierson, DieterGawlick, 
	DouglasDonahue, EdBernot, EricChan, FrancescaQuattri, GaryBergCross, GlennFreytag, HaroldBoley, 
	JacoDuPreez, KenBaclawski, LeoObrst, LesMorgan, LianaKiff, MatthewLange, MeganKatsumi, 
	MichaelGruninger, MikeBennett, NancyWiegand, NathanWilson, PavithraKenjige, PeterYim, RamSriram, 
	RexBrooks, RichardMartin, RuthDuerr, ShahanKhatchadourian, SimonSpero, SteveRay, SundayOjo, 
	TerryLongstreth, ToddSchneider, TorstenHahmann, UriShani, VictorAgroskin,    (46FL)
	 == Proceedings ==    (46FM)
	[8:32] anonymous morphed into NathanWilson    (46FN)
	[8:42] anonymous morphed into LesMorgan    (46FO)
	[8:49] anonymous morphed into NathanWilson    (46FP)
	[9:02] anonymous morphed into FrancescaQuattri    (46FQ)
	[9:16] anonymous morphed into FrancescaQuattri    (46FR)
	[9:21] ericChan morphed into EricChan    (46FS)
	[9:23] Anne Thessen morphed into AnneThessen    (46FT)
	[9:25] anonymous morphed into BruceBray    (46FU)
	[9:26] EdBernot: Good day to all.    (46FV)
	[9:27] anonymous morphed into BobMorris    (46FW)
	[9:31] anonymous morphed into TerryLongstreth    (46FX)
	[9:33] anonymous1 morphed into BobbinTeegarden    (46FY)
	[9:34] anonymous morphed into ShahanKhatchadourian    (46FZ)
	[9:37] anonymous morphed into DieterGawlick    (46G0)
	[9:38] PeterYim: == KenBaclawski starts the session on behalf of the session co-chairs ... see 
	slides under: http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2014_02_13#nid466Z    (46G1)
	[9:42] anonymous morphed into TorstenHahmann    (46G2)
	[9:50] anonymous1 morphed into RamSriram    (46G3)
	[9:51] PeterYim: == EricChan presenting ...    (46G4)
	[9:51] AnneThessen: The really interesting thing about the diagram on EricChan slide #3 is that it 
	looks like the scientific process    (46G5)
	[9:52] anonymous1 morphed into FrancescaQuattri    (46G6)
	[9:54] anonymous morphed into AdelGhoneimy    (46G7)
	[9:55] BobMorris: @Anne: But it is unclear to me where "Evidence" should fit.    (46G8)
	[9:57] RuthDuerr: @Anne: Well your observation goes along with what I've been saying for years about 
	most people who use experience or other evidence to make decisions are actually doing science 
	whether they know it or not    (46G9)
	[9:59] AdelGhoneimy: I totally agree. KIDS is very much in line with the scientific method    (46GA)
	[10:02] AnneThessen: I also like the evolutionary applications on EricChan slide #8 can accommodate 
	changing data and knowledge. Again, this is like the scientific process.    (46GB)
	[10:03] AdelGhoneimy: Our intent in KIDS is to support knowledge evolution on personal/community level    (46GC)
	[10:03] AnneThessen: @ BobMorris: Maybe evidence would fit into FIHD? Or it would be another layer 
	that is used to build the "facts"    (46GD)
	[10:04] SimonSpero: @Anne: if it changes, was it knowledge?    (46GE)
	[10:06] AdelGhoneimy: Could you elaborate please    (46GF)
	[10:06] MatthewLange: Anyone, what is FSD?    (46GG)
	[10:07] AnneThessen: @SimonSpero: Perhaps...because systems change over time. However knowledge can 
	change because we got it wrong before. Maybe that's not really knowledge.    (46GH)
	[10:07] AdelGhoneimy: FSD = Flexible Schema Data    (46GI)
	[10:08] AdelGhoneimy: Actually system changes because of knowledge evolution    (46GJ)
	[10:08] MatthewLange: Thanks...would be good to have acronyms defined in slides    (46GK)
	[10:08] PeterYim: ... slide#17 now    (46GL)
	[10:08] PeterYim: ... slide#18 now    (46GM)
	[10:11] AnneThessen: @AdelGhoneimy: I was thinking more of natural systems. Like the oceans, climate 
	or an ecosystem. What we knew about how an ecosystem worked 200 years ago may not be valid now.    (46GN)
	[10:17] AdelGhoneimy: @AnneThessen: Yes, I agree and due to KIDS provenance you can retrospective 
	view old data in light of new knowledge. Especially for understanding trends    (46GO)
	[10:11] PeterYim: the Ontolog demo Eric mentioned in slide#16 ref. the capture of tacit knowledge 
	among Ontolog participants can be found at the talk - "Application of OASIS Integrated Collaboration 
	Object Model (ICOM) with Oracle Database 11g Semantic Technologies" by : EricChan, RameshVasudevan & 
	AlanWu (ICOM; Oracle); DeirdreLee & LauraDragan (ICOM; DERI) - see: 
	http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2010_08_05#nid2FC6    (46GP)
	[10:13] PeterYim: == NathanWilson presenting ...    (46GQ)
	[10:18] KenBaclawski: The slides are at 
	http://ontolog.cim3.net/file/work/OntologySummit2014/2014-02-13_OntologySummit2014_Tackling-the-Vari 
	ety-Problem-in-Big-Data-1/OntologySummit2014_Semantic-Underpinnings-of-EOL-TraitBank_w-Demo-NathanWi 
	lson_20140213.pdf    (46GR)
	[10:21] DouglasDonahue: And, that is/will be unto the English language; only!    (46GS)
	[10:24] PeterYim: @DouglasDonahue & ALL: please include some context (like ref. NathanWilson's 
	slide#5 blah, blah, blah) when posting a message ... so the archived chat-transcript will make sense 
	to people reading it at a later time (say 3 years from now)    (46GT)
	[10:29] DouglasDonahue: Sorry. Yes, of course. In description of language problems associated 
	w/semantics associated with identifying terms. Homophony being a consequent, or perhaps an 
	underlying 'feature' of the language. Only trying to point out, or perhaps hint, that such problems 
	become exacerbated with the I18N as well.    (46GU)
	[10:31] PeterYim: @DouglasDonahue, thank you (no problem)    (46GV)
	[10:23] anonymous morphed into AbhayKashyap    (46GW)
	- [10:24] LianaKiff: Large portions of this presentation don't appear in the PDF.    (46GX)
	- [10:25] AliHashemi: @Liana, try downloading the slides. It wasn't rendering properly in the 
	browser, but loads fine in a dedicated pdf viewer    (46GY)
	- [10:32] UriShani: no text on slide 20?    (46GZ)
	- [10:33] PeterYim: @Uri - ref. above [10:25] AliHashemi: try running the slides from a pdf viewer 
	on your desktop after downloading it    (46H0)
	[10:33] MatthewLange: has EOL open-sourced the PHPO backend code, and Ruby front-end code?    (46H1)
	[10:36] AnneThessen: @ MatthewLange: All EOL code is open source as far as I know    (46H2)
	[10:36] MatthewLange: @AnneThessen thanks, do you know where it is available, GitHub perhaps?    (46H3)
	[10:39] MatthewLange: @AnneThessen hosting in a place like GitHub would enable more rapid (and free) 
	feature enhancements from the community    (46H4)
	[10:40] AnneThessen: @MatthewLange: I do not know where the EOL code is hosted. Perhaps you should 
	ask Nathan at the end.    (46H5)
	[10:42] MatthewLange: @NathanWilson, are you hosting the front-end Ruby, and back-end PHP code in a 
	public repository somewhere, GitHub perhaps?    (46H6)
	[10:35] DouglasDonahue: How about stated, or even inferred, scope of contents elements. I'm 
	supposing Humans are not included, or is this incorrect?    (46H7)
	[10:37] AliHashemi: @Douglas: http://eol.org/pages/42268/overview    (46H8)
	[10:37] AliHashemi: (no predators - hurray!)    (46H9)
	[10:37] DouglasDonahue: Woops, did it again. Thank you for the EOL reference URI.    (46HA)
	[10:38] AliHashemi: and rather: http://eol.org/pages/327955/overview    (46HB)
	[10:38] AliHashemi: (2 predators)    (46HC)
	- [10:38] PeterYim: ... on slide#24 now    (46HD)
	[10:39] PeterYim: == RuthDuerr presenting ...    (46HE)
	- [10:46] PeterYim: ... on slide#5 now    (46HF)
	- [10:47] PeterYim: ... on slide#6 now    (46HG)
	[10:58] GaryBergCross: Ruth's discussion of the series of sea-ice ontologies from standard body 
	terminologies (and models) touches on a modular approach as well as reuse of existing work to good advantage.    (46HH)
	[11:05] PeterYim: ref. RuthDuerr's SSIII sea ice ontology (slide#10) ... time and again, these real 
	application cases demonstrate that starting by "ontologizing" existing standards would make an 
	efficient and effective improvement approach    (46HI)
	[11:07] PeterYim: ... continuing the pursuit of "OntologyBasedStandards" should promise to be a good 
	use of this community's time - ref. http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologyBasedStandards    (46HJ)
	[11:02] anonymous morphed into PavithraKenjige    (46HK)
	- [11:03] PeterYim: ... on slide#13 now    (46HL)
	[11:11] DouglasDonahue: Kudos Semantics-and-the-SSIII-Project, as it ties together the science of 
	the subject itself, in conjunction with a bonfide understanding, and leveraging of the associated IT 
	methodology to bring it to life in real world. Thinking ahead, and provisioning for incorporation, 
	and use of other languages, from the get-go is extremely cool to see up close!    (46HM)
	[11:12] AnneThessen: Just to play devil's advocate. I'm looking at RuthDuerr Slide #23. If these 
	communities have successfully used oral traditions to pass on knowledge for millennia, what do they 
	think technology has to offer? In many ways digital data are far more fragile.    (46HN)
	[11:14] RuthDuerr: @Anne: Well... the elders may like the oral tradition; but their kids all have 
	cell phones and iPads...    (46HO)
	[11:14] GaryBergCross: @Anne Since Digital forms may be more widely shared and they are long lasting 
	they may allow a new path for slow accumulation and consolidation.    (46HP)
	[11:17] BobbinTeegarden: @Ruth What were you thinking about for visualization (and animation over time)?    (46HQ)
	[11:17] MikeBennett: @Anne @Gary @Ruth maybe this is the transformation of the knowledge contained 
	in this planet, from traditional communities to shared digital resources.    (46HR)
	[11:09] PeterYim: == AnatolyLevenchuk making a call for Hackathon Project Proposals ...    (46HS)
	[11:10] GaryBergCross: I note in passing that Apps for Schema.org has been a discussion topic on the 
	Forum. For example, http://ontolog.cim3.net/forum/ontology-summit/2014-02/msg00185.html    (46HT)
	[11:17] GaryBergCross: Peter is there a link to AnatolyLevenchuk's slides???    (46HU)
	[11:18] PeterYim: @Gary, Anatoly's slides is the deck labeled "[4-Hackathon]" under: 
	http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2014_02_13#nid4670    (46HV)
	[11:19] DouglasDonahue: Should the Semantics-and-the-SSIII-Project, not be held as a positive, for 
	oral traditions only data exchange, moving forward then? Such data necessarily is then committed to 
	the Cloud, upon 'discovery', capture, and not only existent in the environment, (where it's 
	likelihood for loss is greater) at some point.    (46HW)
	[11:17] PeterYim: == Q & A and Open Discussion ...    (46HX)
	- [11:17] MatthewLange: how do i unmute again? ... [ *7 to un-mute ; *6 to go back to mute again ]    (46HY)
	[11:20] PeterYim: ref. MatthewLange question about availability of the codes for the various 
	projects, like EOL - please post url's so we can access them    (46HZ)
	[11:19] RuthDuerr: http://code.google.com/p/ssiii/    (46I0)
	[11:21] RuthDuerr: If you go to http://nsidc.org/ssiii there are purl's for each of the ontologies    (46I1)
	[11:21] anonymous morphed into GlennFreytag    (46I2)
	[11:22] NathanWilson: EOL website code is available at: https://github.com/EOL/eol    (46I3)
	[11:23] NathanWilson: Harvesting code is available here: https://github.com/EOL/eol_php_code    (46I4)
	[11:23] NathanWilson: Other EOL related code is available through the GitHub organization: 
	https://github.com/EOL    (46I5)
	[11:25] MatthewLange: Thanks @RuthDuerr and @NathanWilson kudos for open sourcing, and providing the 
	links to the community. Great marketing.    (46I6)
	[11:23] LeoObrst: @EricChan: I'm still not sure what "directive" is. It seems like it's a 
	quasi-belief, and/or an action based on that belief. Since the next step is "fact", I am not sure of 
	the relationships among these. But that's not quite right.    (46I7)
	[11:17] (repeating) BobbinTeegarden: @Ruth What were you thinking about for visualization (and 
	animation over time)?    (46I8)
	[11:27] RuthDuerr: @BobbinTeegarden: Sorry - just saw your post now. We have systems that take a 
	date range, data set name, and a polygon and provide a number of analysis services (reprojection, 
	subsetting, aggregation, etc.). I want to take the polygon's that come out of the semantic search to 
	feed into these other services to produce the analyses based on the semantic input. A full answer 
	might be rather lengthy...    (46I9)
	[11:28] BobbinTeegarden: @Ruth Thank you, would love to hear more if there's a url. Great work!    (46IA)
	[11:38] RuthDuerr: @BobbinTeegarden: Sorry no URL at this point...    (46IB)
	[11:39] BobbinTeegarden: @Ruth look at ICES Foundation, something I'm working with. They're not 
	there yet, but you're not alone ;0)    (46IC)
	[11:29] NathanWilson: Regarding the comments about humans, yes we have a page: 
	http://eol.org/pages/327955/overview    (46ID)
	[11:30] NathanWilson: Also we do have some predators listed: Tiger Shark and surprisingly California Poppy.    (46IE)
	[11:30] NathanWilson: The later is clearly an example of erroneous data :-)    (46IF)
	[11:31] AnneThessen: @NathanWilson: referring to California Poppy listed as human predator, there 
	are carnivorous plants, but that is strange. Because of heroin deaths?    (46IG)
	[11:40] NathanWilson: I'm sure the California Poppy issue is simply a data error probably from our data providers.    (46IH)
	[11:30] NathanWilson: We'll get that cleaned up shortly.    (46II)
	[11:36] AnneThessen: @NathanWilson: Polar bears should certainly be listed as a human predator. It 
	is the only species that actively hunts Homo sapiens.    (46IJ)
	[11:32] DouglasDonahue: In the event that the level of visioning, and provenance, with Ontology 
	development, and instantiation; then shouldn't it be about that time? Such functions are inherent 
	within development. Discussion should probably be warranted among this committee; if it isn't 
	already going on, now?    (46IK)
	[11:38] PeterYim: Please mark you calendars and reserve this time, every Thursday, for the 
	OntologySummit2014 virtual panel session series. In particular ... next Track D session ill be on 
	Thu 2014.03.27 - session-11 Session-06 will be up next Thursday - Thu 2014.02.20 (same time) - 
	OntologySummit2014: Synthesis-I & Communique Table-of-Content Discussion - see developing details 
	at: http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2014_02_20    (46IL)
	[11:38] PeterYim: @ALL: if you are not subscribed to the [ontology-summit] mailing list yet, please 
	do so (to allow yourself to participate in the ongoing asynchronous discourse) - 
	http://ontolog.cim3.net/mailman/listinfo/ontology-summit (or drop me a line - peter.yim [at] cim3.com)    (46IM)
	[11:39] DouglasDonahue: Very cool; cheers    (46IN)
	[11:39] GaryBergCross: Bye    (46IO)
	[11:39] LeoObrst: Great talks and discussion: thanks, all!    (46IP)
	[11:39] PeterYim: -- session ended: 11:36am PST --    (46IQ)
 -- end of in-session chat-transcript --    (4692)

Additional Resources:    (4699)


For the record ...    (469H)

How To Join (while the session is in progress)    (469I)

Conference Call Details    (4675)

Attendees    (4682)