Hi all, we are having a Hackathon while at the ESWC. (01)
May 27 in Montpillier, France. (02)
Theme: (03)
The ability to extract meaningful, machine-interpretable data from (04)
scholarly publications in PDF form is a big challenge. Several open (05)
source libraries exist that attempt to automate this process, but work (06)
needs to be done on them to improve accuracy and reliability. Some (07)
specific and relevant challenges include: (08)
Ability to automatically identify and tokenize citations from the PDF (09)
(or more accurately, from a string of text) (010)
Ability to automatically identify those blocks of text that represent (011)
the narrative in a PDF. (012)
Ability to identify references within the narrative, extract their (013)
scope, and associate them with citation information in the PDF. (014)
Anybody interested is welcome to join us, http://scholrev.org/hackathon/ (015)
Please contact Casey McLaughlin <casey.mclaughlin@xxxxxxxxxxx> (016)
--
Alexander Garcia
http://www.alexandergarcia.name/
http://www.usefilm.com/photographer/75943.html
http://www.linkedin.com/in/alexgarciac (017)
_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/
Config Subscr: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/
To join: http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage#nid1J (018)
|