> [DB] this Thursday ... cannot participate - but the three
> of you can definitely go ahead and meet and I'll catch up
> later. (02)
[ppy] Unfortunate ... how about the immediate following week
(same time Thu 5/25)? (03)
> [DB] You are right about the spreadsheet - it seems that it
> cut off about 202 lines (content objects) at the bottom.
> I will go back and regenerate the export from the Coast site
> map to include the last 202 objects. (04)
[ppy] hopefully, using, say, the .csv format may break us loose
from the spreadsheet maximum number of rows constraint. (05)
Ref my earlier discussion (see:
), I noticed on your inventory spreadsheet, you ARE inventorying
all versions of a wiki page. Therefore, ref that discussion, you
might consider just inventorying the 'current' version of each
page, and leave the history behind. (06)
> [DB] > Re: shared file workspace - let me see if I can run
> the Teragram concept extraction file against a remote
> directory. (07)
[ppy] you have already captured the links in your inventory map
now. Ref your spreadsheet
<ontolog-cwe-inventory-map_dbedford_20060514a.xls> rows 22382 &
below (showing our shared ./file/resource content), and, rows
25407 & below (showing our shared ./file/work content). (08)
The way we have it, any item that is dropped into the CWE -- i.e.
discussion archives, wiki page or shred-files in the repository)
are browser (http) accessible. Which means, you don't have to
'map' to a 'remote directory' ... as long as Teragram can extract
via a web link, our content will be available (as it is already
being served by apache over http.)
Thanks & regards. =ppy
dbedford@xxxxxxxxxxxxx wrote Mon, 15 May 2006 11:59:59 -0400:
> Unfortunately, I have a KMWG Board Meeting this Thursday so cannot
> but the three of you can definitely go ahead and meet and I'll catch up later.
> You are right about the spreadsheet - it seems that it cut off about 202 lines
> (content objects) at the bottom. I will go back and regenerate the export
> the Coast site map to include the last 202 objects. Also, this is only to
> Level 6 so anything lower in the WIKI will not be included.
> Re: shared file workspace - let me see if I can run the Teragram concept
> extraction file against a remote directory. If I can, we don't need a second
> store of the data. If I need a "local Bank"
> Best regards,
> Denise (010)
> "Peter P. Yim"
> Sent by: Ontologizing-Ontolog
> ontologizing-bo <ontologizing@xxxxxxxxxxxxxxxx>
> [ontologizing] Re: [TaxoThesaurus]
> 05/15/2006 Ontolog Wiki Inventory
> 11:40 AM
> Please respond
> 1. This is great!
> (a) Denise ... your plan sounds good.
> (b) we actually don't have a Invited Speaker or Scheduled
> Discussion Event for Ontolog this Thursday (nor the next week).
> Would be great if we devote, say, half the session, on
> discussions relating to this project. Can you all make it to this
> Thursday's (5/11 1:30~3:00pm EDT / 10:30am~12:00 PDT) call? And
> next week's?
> 2. File content: at first glance -
> (a) I unzipped and opened up your spreadsheet file with my
> OpenOffice application (I don't have MS Excel on my desktop
> machine), and was only able to see up to row 65536. I have a
> feeling this may not be the end of the content you have put
> together. It seems that either you or I ran into a constraint
> posed by either yours or my spreadsheet program.
> (b) I'll check (other versions of spreadsheet programs) too, but
> would you check and see if you might be able to put the content
> into a different file format (say, .csv) so that we can avoid
> this limitation, and possibly make the file more portable too,
> please Denise?
> 3. Ref. the attachment limitations ... Yes.
> (a) I renamed Denise' file to
> <ontolog-cwe-inventory-map_dbedford_20060514a.zip> sand have done
> uploading both the zip file and the unzipped .xls file. You will
> be able to find them under:
> (b) the CWE is designed so that users can drag and drop their
> shared file into the shared-file workspace via WebDAV (the web
> distributed authoring and versioning protocol).
> (c) the instructions (a one time setup) to set oneself up for
> this file upload can be found at:
> ... please try it. It could be a bit tricky for some machines,
> but I will be more than happy to walk the members of this team
> through the setup (during one of our calls) in the near future,
> if anyone wants it.
> (d) our shared-file workspace:
> > [DB] Peter do you think you could create the directory for
> > us to work from?
> [ppy] Our workspace for this project (is already up), and they
> are at:
> Wiki (collaborative authoring / shared display):
> Discussion workspace:
> distribution-list: mailto:ontologizing@xxxxxxxxxxxxxxxx
> discussion archives: http://ontolog.cim3.net/forum/ontologizing/
> Shared-file workspace:
> http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologTaxoThesaurus#nidMU1 ]
> ... don't know if that answers your question, Denise?
> (In particular, if the "shared-file workspace" directory the
> "directory" you want to see created?
> Thanks & regards. =ppy
> -- (011)
> Bob Smith wrote Mon, 15 May 2006 07:50:29 -0700:
>> Hi Denise,
>> Your plans for next steps sounds reasonable to me;
>> - I do not have immediate access to a crawler but I will check with a friend
>> at Kayvium about the status of his software.
>> - Lisa, Peter: any comments?
>> Bob (012)
>> -----Original Message-----
>> From: dbedford@xxxxxxxxxxxxx [mailto:dbedford@xxxxxxxxxxxxx]
>> Sent: Monday, May 15, 2006 7:38 AM
>> To: Peter P. Yim; Bob Smith; Lisa
>> Subject: Ontolog Wiki Inventory
>> Attached (at long last) is an inventory of the content of the Ontolog Wiki
>> down to Level 6 in the tree structure. (See attached file: Ontolog Wiki
>> Site Map.zip). The Ontology tree actually goes down 23 levels - across the
>> full site
>> - but after running the full scan it seemed that most of the content was at
>> level 6 or above.
>> It is about 15m, so I zipped it to get through any size ceilings you might
>> have on your email. As it turns out the file is too large for the list so
>> I'm sending it to your direct email addresses.
>> Here's how to reach the contents of the inventory:
>> Column A = path to the file
>> Column B = File Name
>> Column C = File Extension/Type
>> Column D = Title if there is one available Column E = Author where available
>> Column F = Last Modified Date Column G = File Size Column H = downloading
>> speed (we use this to check for slow pages/content) Column I = Outbound Link
>> Column J = External Links
>> Suggested next steps:
>> the team - Lisa especially - reviews the inventory to decide what kinds
>> content we will cover in our ontologizing exercise
>> we make a copy of the inventory wihch reflects our selections, and
>> the content to be ontologized from the wiki. Bob or Lisa - do you know
>> of a
>> crawler we could use to collect the content? If not, I can do this but
>> might take me a bit longer since I'll have to work at it in bits/pieces
>> time or ask a colleague here to recommend a tool;
>> store the content in a directory - Peter do you think you could create
>> directory for us to work from?
>> once we have the actual content in one/more directories, I'll proceed to
>> the concept extraction and build the categorizer, summarizer
>> Does this sound like a plan? Please let me know if you want me to run the
>> deeper - Peter can probably tell at a glance what is missing from the
>> Best regards,
>> Denise (013)
Msg Archives: http://ontolog.cim3.net/forum/ontologizing/
Community Portal: http://ontolog.cim3.net/
Community Files: http://ontolog.cim3.net/file/work/OntologizingOntolog/
Community Wiki: http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologizingOntolog (014)