ppy/EarthScienceOntolog-s01_chat-transcript_unedited_20120823a.txt --------- Chat transcript from room: ontolog_20120823 2012-08-23 GMT-08:00 --------- [09:23] Welcome to the = EarthScienceOntolog: Panel Session-01 - Thu 2012-08-23 = Mini-Series Theme: An Earth Science Ontology Dialog ("EarthScienceOntolog") Session Topic: Value Proposition of Ontology and Semantic Technology for the Earth Science Community Session Co-chairs: Dr. LeoObrst and Professor KrishnaSinha Panel-Briefings: * Professor KrishnaSinha - "How Can Semantics Change Data Practices of the EarthCube Geoscience Community?" * Professor KrzysztofJanowicz - "The Value Proposition of Semantic Technologies and Ontologies for the Earth Sciences" * Dr. DaliaVaranka - "Rethinking the Meaning of Data for Integrated Science Problem Solving" * Professor PascalHitzler - "On the uptake of Semantic Web Technologies" * Mr. MikeDean - "Semantic Web for Earth Science" Logistics: * Refer to details on session page at: http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2012_08_23 * (if you haven't already done so) please click on "settings" (top center) and morph from "anonymous" to your RealName * Mute control: *7 to un-mute ... *6 to mute * Can't find Skype Dial pad? ** for Windows Skype users: Can't find Skype Dial pad? ... it's under the "Call" dropdown menu as "Show Dial pad" ** for Linux Skype users: please stay with (or downgrade to) Skype version 2.x for now (as a Dial pad seems to be missing on Linux-based Skype v4.x for skype-calls.) Attendees: Amit Deokar, Anne Thessen, BobSmith, BobbinTeegarden, Chuck Ward, Cybershare - Deana Pennington, David Valentine, Deborah L Nichols, Deborah McGuinness, DicksonLukose, DougFoxvog, Frank Chum, Gary Berg-Cross, Genhan Chen, Joanne Luciano, JoelCarbonera, JohnGraybeal, KenBaclawski, Krzysztof Janowicz, LeoObrst, Mara Abel, Marcio Faerman, Marshall X Ma, MikeBennett, MikeDean, Naicong Li, NancyWiegand, Pascal Hitzler, PatriceSeyed, PatrickVirden, PavithraKenjige, PeterYim, Scott Hills, Scott Peckham, SiriJodha Khalsa, ToddSchneider, TomTinsley, Uma, Yannis Roussakis, Yingjie Hu, alex.shkotin, krishna sinha, mark, == Proceedings: == [08:34] anonymous morphed into dickson-lukose [08:35] dickson-lukose morphed into DicksonLukose [08:58] anonymous morphed into Marcio Faerman [09:25] Krzysztof Janowicz: So dar I cannot connect to the VNC server [09:25] Pascal: yep cannot connect either yet :) [09:26] PeterYim: @Krzysztof & Pascal - don't worry, vnc service is only optional [09:29] anonymous4 morphed into mark [09:30] anonymous2 morphed into Genhan Chen [09:30] anonymous3 morphed into Deborah L Nichols [09:30] anonymous1 morphed into Yingjie Hu [09:30] anonymous2 morphed into Naicong Li [09:30] anonymous3 morphed into JoelCarbonera [09:31] anonymous morphed into TomTinsley [09:31] anonymous1 morphed into BobSmith [09:31] anonymous1 morphed into Anne Thessen [09:32] anonymous3 morphed into PatrickVirden [09:32] anonymous2 morphed into Scott Hills [09:32] anonymous4 morphed into Chuck Ward [09:32] anonymous1 morphed into Cybershare - Deana Pennington [09:33] Yingjie Hu: I can see the vnc, but it doesn't allow me to input the password [09:35] Krzysztof Janowicz: same for me, you can download the slides at http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2012_08_23 [09:35] anonymous1 morphed into alex.shkotin [09:35] Yingjie Hu: OK, thank you [09:37] anonymous1 morphed into David Valentine [09:39] PeterYim: == LeoObrst started the session with the introductory slides [09:39] anonymous1 morphed into Uma [09:40] anonymous1 morphed into BobbinTeegarden [09:40] ToddSchneider: Leo, what does 'multi-level' ontology mean? [09:41] anonymous1 morphed into Mara Abel [09:43] anonymous1 morphed into Marshall X Ma [09:43] anonymous morphed into PatriceSeyed [09:43] PeterYim: == KrishnaSinha presenting ... [09:47] LeoObrst: "Multi-level" can mean 2 things: 1) the typical foundational, midlevel/super domain, and domain levels of ontological architecture, but also 2) multiple levels of granularity and contexts/perspectives in specific domains such as Earth Science. [09:49] PeterYim: Krishna is on slide#5 now [09:49] PeterYim: #6 [09:49] anonymous morphed into Yannis Roussakis [09:50] Pascal morphed into Pascal Hitzler [09:51] PeterYim: @Leo - please prompt those on the call to log into the chat-room again (we have 45 people on the call now) [09:56] Scott Hills: Regarding discovery of data: This does not just apply to that held by individuals, but to that maintained in "Data Centers" as well. How many Data Centers might contain data of interest, and how many of those do you know about, and know how to use (each often has a different UI)? [09:57] anonymous1 morphed into SiriJodha Khalsa [09:58] anonymous morphed into PaithraKenjige [09:59] anonymous1 morphed into Deborah McGuinness [10:00] PeterYim: == KrzysztofJanowicz presenting ... [10:00] PaithraKenjige: Did he say that we have such a semantic infrastructure already available?? [10:01] Chuck Ward: Have you also considered community expecations such as data archiving requirements with journal articles? e.g. evolutionary biology [10:02] anonymous morphed into krishna sinha [10:09] SiriJodha Khalsa: what is the advantage of creating or using an ontology for encoding provenance/quality/appropriateness-of-use information beyond creating metadata according to a well-defined standards like ISO 191**? [10:12] krishna sinha1: Hi Scott, [10:12] krishna sinha1: Scott, [10:13] PaithraKenjige morphed into PavithraKenjige [10:14] krishna sinha1: The data centers are very visible and their data content is well known; an ontology framework that can map the data held in different centers would be welcome [10:15] Pascal Hitzler: @Krishna - what do you mean with "framework" here? [10:15] anonymous morphed into DougFoxvog [10:15] Anne Thessen: This is true for many data centers, but there are some, such as Dryad that takes files with little to no guidance as to structure or content [10:15] anonymous morphed into Scott Peckham [10:16] Genhan Chen: To Krishna: could you provide an example to explain the ontology framework on your slide #11? [10:18] krishna sinha1: @Pascal- an ontology based infrastructure that points to the data content of data centers [10:19] Scott Hills: Krishna, we may have different definitions for visibility. They may be accessible (your sense of "visible?"), but I question how many people know *where* to access them. For example, if I ask for the data centers that contain geochemical data, or seismic data, or well cores (take your pick), how many people could offer a comprehensive list? [10:19] Amit Deokar: We notice a lot of emphasis on managing data. What are your views on how important it is to manage associated computational models, share them, annotate them, and so forth so that they are accessible like data, avoiding problems of reinventing the wheel? [10:20] Amit Deokar: The above question was for Krishna [10:20] PeterYim: == DaliaVaranka presenting ... [10:22] krishna sinha1: @ Amit- i support an ontology framework that organizes services that would include existing models and other computational tools..its the service ontology on slide 11 [10:23] LeoObrst: @Krzysztof: you cited some research concerning the last bullet on translation between conceptual models on slide 5. Can you provide that reference? [10:26] krishna sinha1: @ Scott-- a high level ontology that can capture the data types held at Centers would enable a user to find which centers hold what types of data [10:27] Krzysztof Janowicz: Yes Leo, for instance the work of Mark Gahegan. This paper may be a good starting point and overview: A Semantic Web Map Mediation Service: Interactive Redesign and Sharing of Map Legends by Mark Gahegan, Will Smart, Sina Masoud-Ansari, and Brandon Whitehead. [10:29] PeterYim: == PascalHitzler presenting ... [10:30] Krzysztof Janowicz: "Did he say that we have such a semantic infrastructure already available??" --> I would argue so [10:37] JohnGraybeal: To Siria Jodha (Hi!): The advantages of using ontologies for encoding any content, *including* provenance/quality/appropriateness-of-use information: (1) Readers of the metadata know exactly what you mean ('exactly', relatively speaking), (2) there is better mapping/understanding across communities for these concepts. (While I appreciate ISO 191** provides vocabularies, I often find them underdefined, incomplete, and locally inconsistent. And often ISO 191** doesn't provide a vocabulary, so the content is wide open.) [10:41] JohnGraybeal: @Krishna: Re your endorsement of data citation: Why should this community particularly endorse data citation, more than it should endorse all the data practices that benefit from ontologies: description, provenance, versioning, unique identification, ...? All seem noble causes. [10:41] Scott Hills: Krishna, our thinking is starting to converge. The point I was working toward: I believe a good argument can be made for the need for ontologies that enable discovery of data across the landscape of proliferating Data Centers. I suspect the same ontologies would help enable discovery of data published by individuals. That said, I suspect the "level" of such ontologies need to go beyond data type, or their utility will be very limited. [10:44] Krzysztof Janowicz: Scott, yes ontologies should always be 'more' than just data models [10:47] krishna sinha1: @Scott--I agree, and I also think most users would be comfortable with high to mediuum level granularity [10:48] PeterYim: == MikeDean presenting ... [10:49] DougFoxvog: Scott Hill asks above "how many people could offer a comprehensive list [of data centers that contain various types of geochemical data]". This seems to me to actually be a call for a knowledge base of data centers containing information about what types of data are covered in the various data centers. With linked data, such a knowledge base need not be centrally located, merely a vocabulary for making the statements needs to be defined, each data base could specify what types of data it covers, and the linked data needs to be made widely available. [10:51] anonymous morphed into Marshall X Ma [10:52] PeterYim: whoever it was that said, "Krzysztof, fully agree wit your view. Understanding formal semantic and reusing well fundamented ontologies. " ... the space to type your message is next to the "send" button (at the bottom) and not the box next to the "hand" button [10:52] Scott Hills: Pascal, woudld you agree that whether an ontology is high-quality depends on how well it satisfies the needs for which it was engineered? [10:55] Pascal Hitzler: @Scott - this looks reasonable, however I would think that more can be said about the "quality" issue. In fact, there is quite a bit of work on "Evaluation of Ontologies", and there is also a established workshop series about this topic. [10:57] BobbinTeegarden: @Pascal URL for the workshop? [10:58] Pascal Hitzler: I suggest to start with Denny Vrandecic's PhD thesis: http://www.aifb.kit.edu/images/b/b5/OntologyEvaluation.pdf [10:58] LeoObrst: @Krishna: I think one very important issue you raised is the incentive for individuals (and organizations) for sharing data: in the research community, what is the incentive for a researcher to expose his/her data and its meaning? What supports scholarly acknowledgment of the contribution? [11:01] Pascal Hitzler: The workshop series - see e.g. http://km.aifb.kit.edu/ws/eon2007/ . [11:02] krishna sinha1: @Leo--There are two ways to expose data..regular publications for which there is a citation index ( often used for promotions etc) ..the critical issue is what incentives we can support to have people share data thats not formally published..data citation index is one way to go [11:02] PeterYim: == open discussion commences ... [11:03] Pascal Hitzler: re. "Linked Data Quality", have a look e.g. at http://blog.semantic-web.at/2012/08/09/whats-wrong-with-linked-data/ [11:03] Marshall X Ma: @LeoObrst: Such as, a stable repository to upload and describe (with a metadata form) the dataset, and easy way to assign a DOI for it. [11:05] Pascal Hitzler: Leo/Krishna: The Semantic Web journal, for example, recently established "Linked Dataset Descriptions" as regular paper type. The purpose of this introduction is to generate "academic incentive" for the production of high-quality Linked Datasets. [11:07] Marshall X Ma: @Pascal: Another journal (Geoscience Data Journal) with similar ideas: http://onlinelibrary.wiley.com/journal/10.1002/%28ISSN%292049-6060 [11:11] ToddSchneider: John, part of the needed infrastructure is the Open Ontology Repository. [11:12] JohnGraybeal: Yes (the question was, "What has to happen on the ground for these advantages to be obtained?" With particular attention to the practical realities of the practicing data managers. [11:13] JohnGraybeal: @Todd: As provider of an OOR equivalent (MMI's ORR), I want to go farther. Let's assume there's a perfect repository in existence. And even, really good vocabularies from communities. How do they integrate this into their developed data systems? [11:14] ToddSchneider: John, education. It would appear that many people working in the area of 'semantic technologies' don't know enough. Dalia addressed this. [11:14] Scott Hills: Pascal, thanks for the URL. I'll look through the material (up through 2007?). Any chance I'll find suggestions to answers like, what applications are best served using SKOS vs. OWL DL with domain-specific properties? [11:15] Marcio Faerman: My question referred on how to justify and support, throughout the community, @Krzysztof Janowicz "Final Thought" slide - "... It seems that we hope to arrive at semantic interoperability by standardization instead of investing into research on alignment and semantic translation to reduced incompatibility. This may turn out to be a fundamental misconception. I believe that standardization is the more difficult of both approaches..." [11:16] Krzysztof Janowicz: John, IMHO it needs a earth science specific tutorial and a set of reusable building blocks [11:16] MikeBennett: Something we are finding is the distinction between one ontology for one application, versus overall domain ontology (standard) which is use-case neutral. Both are vital. And different. [11:18] Marcio Faerman: Then I followed up with the question about conciliating multi-disciplinary research and knowledge discoverability through semantic translation. [11:18] Pascal Hitzler: @Scott - I would guess that answers to such specific questions can probably not be found easily. "Best practices" in applying semantic technologies are not (yet) written up in any concise form, afaik. Note that it is only two years or so that we got textbooks which cover the bare basics of the discipline. Currently, I guess the best approach to get such questions answered is to consult a specialist. [11:18] LeoObrst: @Mike: do you have a url for the LOD Framework initiative? [11:19] Krzysztof Janowicz: MikeBennett, yes -- I am not arguing against higher level ontologies. It needs both. [11:19] MikeBennett: @Krzystof agreed. Work to be done on the heuristics for extracting a relevant sub-set of domain ontology for a given use case. [11:22] JohnGraybeal: @Mike: In addition to single ontology vs overarching ontology, there are typically multiple authoritative (sic) vocabularies within each domain, and maybe one or two ontologies as well. Making integrative use of that information is, for now, impossible without a lot of human investment. [11:24] MikeBennett: @John very much so. Needs to be done across industries. One chance to get it right I think. [11:25] Krzysztof Janowicz: IMHO, this makes GeoSPARQL so strong [11:27] Scott Hills: Pascal, thanks for that info (13:18). Good to know, since I haven't been able to find such material to date. [11:29] Joanne Luciano: Thanks to the organizers and presenters! [11:29] Frank Chum: W3C Oil, Gas & Chemical Business Group: http://www.w3.org/community/oilgaschem/ is developing a use case to collaborate on ontologies for drilling automation. [11:29] Frank Chum: Involving multiple Oil & Gas standards organizations [11:29] PeterYim: == Final remarks by KrishnaSinha ... [11:31] Frank Chum: Thank you. [11:31] PeterYim: great session! [11:31] Krzysztof Janowicz: thanks, bye bye [11:31] LeoObrst: Thanks, All! [11:31] PeterYim: -- session ended: 11:31am PDT -- ---------