ontologizing
[Top] [All Lists]

Re: [ontologizing] Re: [TaxoThesaurus] Ontolog Wiki Inventory

To: Ontologizing-Ontolog <ontologizing@xxxxxxxxxxxxxxxx>
From: "Peter P. Yim" <peter.yim@xxxxxxxx>
Date: Mon, 15 May 2006 09:26:53 -0700
Message-id: <4468ABCD.6020206@xxxxxxxx>
Denise,    (01)


 > [DB] this Thursday ... cannot participate - but the three
 > of you can definitely go ahead and meet and I'll catch up
 > later.    (02)

[ppy] Unfortunate ... how about the immediate following week 
(same time Thu 5/25)?    (03)


 > [DB] You are right about the spreadsheet - it seems that it
 > cut off about 202 lines (content objects) at the bottom.
 > I will go back and regenerate the export from the Coast site
 > map to include the last 202 objects.    (04)

[ppy] hopefully, using, say, the .csv format may break us loose 
from the spreadsheet maximum number of rows constraint.    (05)

Ref my earlier discussion (see: 
http://ontolog.cim3.net/forum//ontologizing/2006-05/msg00002.html#nid09 
), I noticed on your inventory spreadsheet, you ARE inventorying 
all versions of a wiki page. Therefore, ref that discussion, you 
might consider just inventorying the 'current' version of each 
page, and leave the history behind.    (06)

 > [DB] > Re:  shared file workspace - let me see if I can run
 > the Teragram concept extraction file against a remote
 > directory.    (07)

[ppy] you have already captured the links in your inventory map 
now. Ref your spreadsheet 
<ontolog-cwe-inventory-map_dbedford_20060514a.xls> rows 22382 & 
below (showing our shared ./file/resource content), and, rows 
25407 & below (showing our shared ./file/work content).    (08)

The way we have it, any item that is dropped into the CWE -- i.e. 
discussion archives, wiki page or shred-files in the repository) 
are browser (http) accessible. Which means, you don't have to 
'map' to a 'remote directory' ... as long as Teragram can extract 
via a web link, our content will be available (as it is already 
being served by apache over http.)
Thanks & regards.  =ppy
--    (09)



dbedford@xxxxxxxxxxxxx wrote Mon, 15 May 2006 11:59:59 -0400:
> Peter,
> 
> Unfortunately, I have a KMWG Board Meeting this Thursday so cannot 
>participate -
> but the three of you can definitely go ahead and meet and I'll catch up later.
> 
> You are right about the spreadsheet - it seems that it cut off about 202 lines
> (content objects) at the bottom.   I will go back and regenerate the export 
>from
> the Coast site map to include the last 202 objects.   Also, this is only to
> Level 6 so anything lower in the WIKI will not be included.
> 
> Re:  shared file workspace - let me see if I can run the Teragram concept
> extraction file against a remote directory.   If I can, we don't need a second
> store of the data.  If I need a "local Bank"
> 
> Best regards,
> Denise    (010)



>              "Peter P. Yim"                                                   
>  
>              <peter.yim@cim3                                                  
>  
>              .com>                                                           
>To 
>              Sent by:                Ontologizing-Ontolog                     
>  
>              ontologizing-bo         <ontologizing@xxxxxxxxxxxxxxxx>          
>  
>              unces@xxxxxxxxx                                                 
>cc 
>              im3.net                                                          
>  
>                                                                         
>Subject 
>                                      [ontologizing] Re: [TaxoThesaurus]       
>  
>              05/15/2006              Ontolog Wiki Inventory                   
>  
>              11:40 AM                                                         
>  
>                                                                               
>  
>                                                                               
>  
>              Please respond                                                   
>  
>                    to                                                         
>  
>              Ontologizing-On                                                  
>  
>                   tolog                                                       
>  
>              <ontologizing@o                                                  
>  
>              ntolog.cim3.net                                                  
>  
>                     >                                                         
>  
>                                                                               
>  
> 
> 
> 
> Denise,
> 
> 1. This is great!
> 
> (a) Denise ... your plan sounds good.
> 
> (b) we actually don't have a Invited Speaker or Scheduled
> Discussion Event for Ontolog this Thursday (nor the next week).
> Would be great if we devote, say, half the session, on
> discussions relating to this project. Can you all make it to this
> Thursday's (5/11 1:30~3:00pm EDT / 10:30am~12:00 PDT) call? And
> next week's?
> 
> 2. File content: at first glance -
> 
> (a) I unzipped and opened up your spreadsheet file with my
> OpenOffice application (I don't have MS Excel on my desktop
> machine), and was only able to see up to row 65536. I have a
> feeling this may not be the end of the content you have put
> together. It seems that either you or I ran into a constraint
> posed by either yours or my spreadsheet program.
> 
> (b) I'll check (other versions of spreadsheet programs) too, but
> would you check and see if you might be able to put the content
> into a different file format (say, .csv) so that we can avoid
> this limitation, and possibly make the file more portable too,
> please Denise?
> 
> 3. Ref. the attachment limitations ... Yes.
> 
> (a) I renamed Denise' file to
> <ontolog-cwe-inventory-map_dbedford_20060514a.zip> sand have done
> uploading both the zip file and the unzipped .xls file. You will
> be able to find them under:
> http://ontolog.cim3.net/file/work/OntologizingOntolog/TaxoThesaurus/
> 
> (b) the CWE is designed so that users can drag and drop their
> shared file into the shared-file workspace via WebDAV (the web
> distributed authoring and versioning protocol).
> 
> (c) the instructions (a one time setup) to set oneself up for
> this file upload can be found at:
> http://ontolog.cim3.net/cgi-bin/wiki.pl?ToolsCollaboration#nidAVL
> ... please try it. It could be a bit tricky for some machines,
> but I will be more than happy to walk the members of this team
> through the setup (during one of our calls) in the near future,
> if anyone wants it.
> 
> (d) our shared-file workspace:
> 
>  > [DB] Peter do you think you could create the directory for
>  > us to work from?
> 
> [ppy] Our workspace for this project (is already up), and they
> are at:
> 
> Wiki (collaborative authoring / shared display):
> 
> Discussion workspace:
>    distribution-list: mailto:ontologizing@xxxxxxxxxxxxxxxx
>    discussion archives: http://ontolog.cim3.net/forum/ontologizing/
> 
> Shared-file workspace:
> http://ontolog.cim3.net/file/work/OntologizingOntolog/TaxoThesaurus/
> 
> [ref:
> http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologTaxoThesaurus#nidMU1 ]
> 
> ... don't know if that answers your question, Denise?
> (In particular, if the "shared-file workspace" directory the
> "directory" you want to see created?
> 
> Thanks & regards.  =ppy
> --    (011)


> Bob Smith wrote Mon, 15 May 2006 07:50:29 -0700:
>> Hi Denise,
>>
>> Thanks!
>>
>> Your plans for next steps sounds reasonable to me;
>>
>> - I do not have immediate access to a crawler but I will check with a friend
>> at Kayvium about the status of his software.
>>
>> - Lisa, Peter: any comments?
>>
>> Cheers,
>>
>> Bob    (012)


>> -----Original Message-----
>> From: dbedford@xxxxxxxxxxxxx [mailto:dbedford@xxxxxxxxxxxxx]
>> Sent: Monday, May 15, 2006 7:38 AM
>> To: Peter P. Yim; Bob Smith; Lisa
>> Subject: Ontolog Wiki Inventory
>>
>> All,
>>
>> Attached (at long last) is an inventory of the content of the Ontolog Wiki
>> down to Level 6 in the tree structure.  (See attached file: Ontolog Wiki
>> Site Map.zip).  The Ontology tree actually goes down 23 levels - across the
>> full site
>> - but after running the full scan it seemed that most of the content was at
>> level 6 or above.
>>
>> It is about 15m, so I zipped it to get through any size ceilings you might
>> have on your email.  As it turns out the file is too large for the list so
>> I'm sending it to your direct email addresses.
>>
>> Here's how to reach the contents of the inventory:
>>
>> Column A = path to the file
>> Column B = File Name
>> Column C = File Extension/Type
>> Column D = Title if there is one available Column E = Author where available
>> Column F = Last Modified Date Column G = File Size Column H = downloading
>> speed (we use this to check for slow pages/content) Column I = Outbound Link
>> Column J = External Links
>>
>> Suggested next steps:
>>
>>    the team - Lisa especially - reviews the inventory to decide what kinds
>> of
>>    content we will cover in our ontologizing exercise
>>    we make a copy of the inventory wihch reflects our selections, and
>> extract
>>    the content to be ontologized from the wiki.  Bob or Lisa - do you know
>> of a
>>    crawler we could use to collect the content?   If not, I can do this but
>> it
>>    might take me a bit longer since I'll have to work at it in bits/pieces
>> of
>>    time or ask a colleague here to recommend a tool;
>>    store the content in a directory - Peter do you think you could create
>> the
>>    directory for us to work from?
>>    once we have the actual content in one/more directories, I'll proceed to
>> run
>>    the concept extraction and build the categorizer, summarizer
>>
>> Does this sound like a plan?   Please let me know if you want me to run the
>> scan
>> deeper - Peter can probably tell at a glance what is missing from the
>> inventory.
>>
>> Best regards,
>> Denise    (013)

_________________________________________________________________
Msg Archives: http://ontolog.cim3.net/forum/ontologizing/ 
Subscribe/Unsubscribe/Config: 
http://ontolog.cim3.net/mailman/listinfo/ontologizing/
Community Portal: http://ontolog.cim3.net/
Community Files: http://ontolog.cim3.net/file/work/OntologizingOntolog/
Community Wiki: http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologizingOntolog    (014)
<Prev in Thread] Current Thread [Next in Thread>