Search the web
Sign In
New User? Sign Up
TaxoCoP · Taxonomy Community of Practice
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Show off your group to the world. Share a photo of your group with us.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 3304 - 3333 of 3333   Newest  |  < Newer  |  Older >  |  Oldest
Messages: Show Message Summaries   (Group by Topic) Sort by Date v  
#3333 From: "John O'Gorman" <jogorman@...>
Date: Mon Nov 23, 2009 2:37 pm
Subject: Re: Taxonomies and Relational Databases
laptopjockey
Offline Offline
Send Email Send Email
 
Feel free to consider Object Oriented flavours as well.
 
 
-----Original Message-----
From: John O'Gorman [mailto:jogorman@...]
Sent: Monday, November 23, 2009 07:35 AM
To: TaxoCoP@yahoogroups.com
Subject: [TaxoCoP] Taxonomies and Relational Databases

 
Good morning;
 
I once heard the relational theory of data referred to a 'set-based', and since taxonomists' bread and butter could be thought of as the formal division of things into set and subsets, I was wondering if anyone had any thoughts on a November morning about possible connections between the two.
 
John O'

 


#3332 From: "John O'Gorman" <jogorman@...>
Date: Mon Nov 23, 2009 2:35 pm
Subject: Taxonomies and Relational Databases
laptopjockey
Offline Offline
Send Email Send Email
 
Good morning;
 
I once heard the relational theory of data referred to a 'set-based', and since taxonomists' bread and butter could be thought of as the formal division of things into set and subsets, I was wondering if anyone had any thoughts on a November morning about possible connections between the two.
 
John O'

#3331 From: "StevenD" <sderose@...>
Date: Fri Nov 20, 2009 7:43 pm
Subject: Introduction
fressdir2
Online Now Online Now
Send Email Send Email
 
Hi. I just joined, so this is the requested self-introduction. I'm presently
sitting at the TBC workshop in San Jose.

I'm the Director of Linguistics for OpenAmplify (http://www.openamplify.com), a
small text analytics / semantic extraction platform vendor. We work especially
in the ad-placement space, and our products are especially strong in natural
language processing and linguistic analysis. Our products report topics,
polarity/sentiment, named entities, and the like, but also more actionable data
such as an author's expressions of intention toward various topics, time frames,
and more.

My academic background is in Computer Science and Linguistics. I am perhaps best
known for developing the methods for part-of-speech assignment that everybody
now uses (based on corpus analysis, Markov modeling, etc.) and for my work on
electronic information standards. I've been heavily involved in W3C, OASIS, ISO,
and other standards organizations, on XML, XPath, XLink, TEI, the Encoded
Archival Description standard familiar to librarians, and many others, often as
editor.

In 1989 I co-founded Electronic Book Technologies and created an SGML-based
browser and structured search engine for delivery of very large-scale
documentation (customers like Boeing, SGI, Sun, DEC, etc). One of my favorite
anecdotes is that when the US Navy deployed our "DynaText" system to replace the
printed maintenance manuals on destroyers, the ships rode 6 inches higher in the
water from the saved weight of paper.

Currently, I'm especially interested in crowd-sourcing, construction of
large-scale practical taxonomies and ontologies, and anything related.

Steve DeRose

#3330 From: "Stephanie Lemieux" <stephanie@...>
Date: Thu Nov 19, 2009 6:01 pm
Subject: Meet on Thursday at TBC
stephaniefkmg
Offline Offline
Send Email Send Email
 
Hi all,
For those of you in San Jose for TBC, meet after the SharePoint talk I'll be
giving today (Thursday) with Jeff Carr and we can all go out for a drink!
See you tonight,
S.

#3329 From: Matt Moore <innotecture@...>
Date: Tue Nov 17, 2009 11:58 pm
Subject: Re: Re: word/phrase extracting/sorting tools
innotecture
Online Now Online Now
Send Email Send Email
 
Jordan,

This may be of interest: http://www.readwriteweb.com/archives/australian_museum_uses_open_calais.php - Seb Chan is a pretty cutting-edge guy.

And this was my response: http://engineerswithoutfears.blogspot.com/2008/04/hybrid-tagging.html

Cheers,

Matt


From: Jordan Cassel <jordan_cassel@...>
To: TaxoCoP@yahoogroups.com
Sent: Wed, November 18, 2009 3:59:43 AM
Subject: Re: [TaxoCoP] Re: word/phrase extracting/sorting tools

 


On a related note, Calais has a Beta program (http://viewer. opencalais. com/) which allows you to input text and it will identify 'entities' present and categorize them according to a taxonomy. Check out the output in the left rail.  This is intended to be a step towards the semantic Web.  I've been playing with it a little bit trying to figure out if I can operationalize it here where we have a lot of unstructured text, in order to improve search, navigation, and help power related content.

-Jordan


From: deddy205ar <deddy@davideddy. com>
To: TaxoCoP@yahoogroups .com
Sent: Mon, November 16, 2009 7:34:27 PM
Subject: [TaxoCoP] Re: word/phrase extracting/sorting tools

 

Heather -

A Mac application that I like is DEVONthink (www.devon-technolo gies.com) for $150.

It produces a "concordance" (word count & frequencies) from a body of text.

I've never tried for phrases, but will spend some time in Help to see if it does that
too.

- David




#3328 From: "Mrs Mary Jo" <jborrell@...>
Date: Tue Nov 17, 2009 6:21 pm
Subject: SLA Taxonomy Division at TBC
jborrell007
Offline Offline
Send Email Send Email
 
Hello,
If you're attending the Taxonomy Boot Camp this week, stop by the SLA Taxonomy
Division table in the hall just outside the sessions and pick up information.

The Taxonomy Division addresses ways to organize and structure information so
that content is accessible and useful. It offers a practical context for
exploring issues and sharing experiences related to planning, creating and
maintaining taxonomies, thesauri, authority files, and other controlled
vocabularies and information structures.

If you're not going to the conference, you can get more information from
http://wiki.sla.org/display/SLATAX or information on joining SLA at
http://www.sla.org/content/membership/joinsla/index.cfm

Thanks,
Jennifer Borrell
PR Committee Chair
SLA Taxonomy Division

#3327 From: Jordan Cassel <jordan_cassel@...>
Date: Tue Nov 17, 2009 4:59 pm
Subject: Re: Re: word/phrase extracting/sorting tools
ypc624
Offline Offline
Send Email Send Email
 

On a related note, Calais has a Beta program (http://viewer.opencalais.com/) which allows you to input text and it will identify 'entities' present and categorize them according to a taxonomy. Check out the output in the left rail.  This is intended to be a step towards the semantic Web.  I've been playing with it a little bit trying to figure out if I can operationalize it here where we have a lot of unstructured text, in order to improve search, navigation, and help power related content.

-Jordan


From: deddy205ar <deddy@...>
To: TaxoCoP@yahoogroups.com
Sent: Mon, November 16, 2009 7:34:27 PM
Subject: [TaxoCoP] Re: word/phrase extracting/sorting tools

 

Heather -

A Mac application that I like is DEVONthink (www.devon-technolo gies.com) for $150.

It produces a "concordance" (word count & frequencies) from a body of text.

I've never tried for phrases, but will spend some time in Help to see if it does that
too.

- David



#3326 From: Christine Connors <CJMConnors@...>
Date: Tue Nov 17, 2009 2:14 pm
Subject: Re: word/phrase extracting/sorting tools
CJMConnors
Offline Offline
Send Email Send Email
 
Heather - 

I'm just getting caught up on some mail folders, sorry for the delay!  Have you looked at the tools available from SIL?  <http://www.sil.org/computing/catalog/show_software_catalog.asp?by=all&name=>

Best, 
Christine

From: Heather Hedden <heather@...>
To: TaxoCoP@yahoogroups.com
Sent: Wed, November 11, 2009 9:34:22 AM
Subject: [TaxoCoP] word/phrase extracting/sorting tools

 

Does anyone have recommendations for simple/inexpensive tools that a taxonomist could use to automatically gather words and phrases from content and sort or count them (then to generate histograms, for example), as a method to gather candidate terms from text when building a taxonomy? A colleague mentioned Word List from MechanicWords http://www.mechanicwords.com/ but I'd like to know of others like it.  This would be an automated supplement to a content audit.  Also, is this something simple enough to code that any in-house software developers in a large enterprise (with time resources) might be able to code for?

Thanks.

-- Heather

-- Heather Hedden
Hedden Information Management
www.Hedden-Informat ion.com



1 of 1 File(s)


#3325 From: "deddy205ar" <deddy@...>
Date: Tue Nov 17, 2009 3:34 am
Subject: Re: word/phrase extracting/sorting tools
deddy205ar
Offline Offline
Send Email Send Email
 
Heather -

A Mac application that I like is DEVONthink (www.devon-technologies.com) for
$150.

It produces a "concordance" (word count & frequencies) from a body of text.

I've never tried for phrases, but will spend some time in Help to see if it does
that
too.

- David

#3324 From: Nick Berry <infoglutton@...>
Date: Fri Nov 13, 2009 5:56 pm
Subject: Re: Taxonomy Bootcamp
taxonomizer
Offline Offline
Send Email Send Email
 
Hi Stephanie, I'll be there.  Looking forward to seeing you!
-Nick

On Fri, Nov 13, 2009 at 9:07 AM, Stephanie Lemieux <stephanie@...> wrote:
 

Hi all,

Let us know if you're going to be at TBC this year - would love to organize a group dinner or drinks...

Jeff, Seth and I are doing a whopping 6 prsentations - hope to see you there!

Stephanie
Earley & Associates



#3323 From: "Stephanie Lemieux" <stephanie@...>
Date: Fri Nov 13, 2009 5:07 pm
Subject: Taxonomy Bootcamp
stephaniefkmg
Offline Offline
Send Email Send Email
 
Hi all,

Let us know if you're going to be at TBC this year - would love to organize a
group dinner or drinks...

Jeff, Seth and I are doing a whopping 6 prsentations - hope to see you there!

Stephanie
Earley & Associates

#3322 From: Matt Moore <innotecture@...>
Date: Thu Nov 12, 2009 8:42 pm
Subject: Re: word/phrase extracting/sorting tools [1 Attachment]
innotecture
Online Now Online Now
Send Email Send Email
 
Hi,

I think there's 3 aspects to these tools worth considering:
- Where the text comes from do you have to enter the text manually? Can you just point it at a server? How does it cope with multiple file formats?
- What does it do? Is it just counting? Is there fancy statistical or semantic analysis?
- What form do the outputs take? Stats? Visuals?

The more functionality you want at each step: i. the more processing power you'll need & ii. the less likely the tool is to be free.

Another option is to examine the search logs for frequency of search terms (assuming your organisation has a search tool that people use). In some ways, this is easier than text crunching and may yield better results if you are interested in taxonomy from a UX perspective.

Regards,

Matt


#3321 From: Seth Earley <seth@...>
Date: Thu Nov 12, 2009 6:06 am
Subject: Re: word/phrase extracting/sorting tools
seth_earley
Offline Offline
Send Email Send Email
 
One challenge I have seen with these tools is related to performance. Sampling
large amounts of content can bog down systems and take inordinate amounts of
time to process.

Our colleague Prakash Govindarajulu has created a tool to do this sampling
called Disk Bee. I am sure he would welcome opportunities to get feedback. He
can be reached at Prakash@...

Seth

Seth Earley
President
Earley & Associates
781-820-8080
www.earley.com

Sent from my Blackberry

________________________________
From: TaxoCoP@yahoogroups.com <TaxoCoP@yahoogroups.com>
To: TaxoCoP@yahoogroups.com <TaxoCoP@yahoogroups.com>
Sent: Wed Nov 11 12:36:48 2009
Subject: Re: [TaxoCoP] word/phrase extracting/sorting tools



Collecting basic statistics on individual words should be utterly trivial for
any competent programmer (assuming it's easy to access the content).  It could
be done in a scripting language (perl, python, etc.), a statistics language (R),
or a regular programming language (Java, C#, etc.). Maybe even in Visual Basic.
There are also free packages available online for this sort of thing.

Collecting phrases (n-grams) is a bit harder (and many of the free toys don't do
it), because you don't want to look at the raw frequency of n-grams (in which
case things like "of the" are the most common) but their frequency relative to
what you'd expect from sequences of random words (divide the frequency of the
n-gram by the product of the frequencies of the component words).  You will
still get "of the" near the top, but this will filter out many noise cases.

Though it is hard to do this accurately across a variety of subtle cases: is
building the same word as buildings? is mine the same word as mining? is
Coca-Cola one word or two? is Carpenter (presumably someone's name) the same
word as carpenter (presumably a job description)? does Santa Fe refer to the
city or to the railroad? etc.).  In fact, it covers a lot of the problem areas
of computational linguistics, but you don't need that level of sophistication. 
In fact, you're probably OK treating all non-alphanumeric characters as word
boundaries.

There are also suites of computational linguistic tools (many of them free) that
will do all this and more, but many of them will be too complicated to install
and learn for such a simple task.

Good luck!

                -s

On Wed, Nov 11, 2009 at 9:34 AM, Heather Hedden
<heather@...<mailto:heather@...>> wrote:


Does anyone have recommendations for simple/inexpensive tools that a taxonomist
could use to automatically gather words and phrases from content and sort or
count them (then to generate histograms, for example), as a method to gather
candidate terms from text when building a taxonomy? A colleague mentioned Word
List from MechanicWords http://www.mechanicwords.com/ but I'd like to know of
others like it.  This would be an automated supplement to a content audit. 
Also, is this something simple enough to code that any in-house software
developers in a large enterprise (with time resources) might be able to code
for?

Thanks.

-- Heather

--
Heather Hedden
Hedden Information Management
www.Hedden-Information.com<http://www.Hedden-Information.com>

1 of 1 File(s)


#3320 From: "Bob Bater" <bbater@...>
Date: Wed Nov 11, 2009 10:58 pm
Subject: RE: word/phrase extracting/sorting tools
bbater2001
Offline Offline
Send Email Send Email
 

Heather,

 

What an interesting question! Such tools could be invaluable. I look forward to the responses, which I am sure will be forthcoming (I have none myself, unfortunately).

 

Regards,

 

Bob

 

From: TaxoCoP@yahoogroups.com [mailto:TaxoCoP@yahoogroups.com] On Behalf Of Heather Hedden
Sent: 11 November 2009 15:34
To: TaxoCoP@yahoogroups.com
Subject: [TaxoCoP] word/phrase extracting/sorting tools

 

 

Does anyone have recommendations for simple/inexpensive tools that a taxonomist could use to automatically gather words and phrases from content and sort or count them (then to generate histograms, for example), as a method to gather candidate terms from text when building a taxonomy? A colleague mentioned Word List from MechanicWords http://www.mechanicwords.com/ but I'd like to know of others like it.  This would be an automated supplement to a content audit.  Also, is this something simple enough to code that any in-house software developers in a large enterprise (with time resources) might be able to code for?

Thanks.

-- Heather

-- 
Heather Hedden
Hedden Information Management
www.Hedden-Information.com


#3319 From: Stavros Macrakis <macrakis@...>
Date: Wed Nov 11, 2009 8:36 pm
Subject: Re: word/phrase extracting/sorting tools
smacrakis
Offline Offline
Send Email Send Email
 
Collecting basic statistics on individual words should be utterly trivial for any competent programmer (assuming it's easy to access the content).  It could be done in a scripting language (perl, python, etc.), a statistics language (R), or a regular programming language (Java, C#, etc.). Maybe even in Visual Basic. There are also free packages available online for this sort of thing.

Collecting phrases (n-grams) is a bit harder (and many of the free toys don't do it), because you don't want to look at the raw frequency of n-grams (in which case things like "of the" are the most common) but their frequency relative to what you'd expect from sequences of random words (divide the frequency of the n-gram by the product of the frequencies of the component words).  You will still get "of the" near the top, but this will filter out many noise cases.

Though it is hard to do this accurately across a variety of subtle cases: is building the same word as buildings? is mine the same word as mining? is Coca-Cola one word or two? is Carpenter (presumably someone's name) the same word as carpenter (presumably a job description)? does Santa Fe refer to the city or to the railroad? etc.).  In fact, it covers a lot of the problem areas of computational linguistics, but you don't need that level of sophistication.  In fact, you're probably OK treating all non-alphanumeric characters as word boundaries.

There are also suites of computational linguistic tools (many of them free) that will do all this and more, but many of them will be too complicated to install and learn for such a simple task.

Good luck!

               -s

On Wed, Nov 11, 2009 at 9:34 AM, Heather Hedden <heather@...> wrote:
 

Does anyone have recommendations for simple/inexpensive tools that a taxonomist could use to automatically gather words and phrases from content and sort or count them (then to generate histograms, for example), as a method to gather candidate terms from text when building a taxonomy? A colleague mentioned Word List from MechanicWords http://www.mechanicwords.com/ but I'd like to know of others like it.  This would be an automated supplement to a content audit.  Also, is this something simple enough to code that any in-house software developers in a large enterprise (with time resources) might be able to code for?

Thanks.

-- Heather

-- Heather Hedden
Hedden Information Management
www.Hedden-Information.com


#3318 From: Matt Moore <innotecture@...>
Date: Wed Nov 11, 2009 8:07 pm
Subject: Re: word/phrase extracting/sorting tools
innotecture
Online Now Online Now
Send Email Send Email
 
I've played with Wordle a fair bit: http://www.wordle.net/


From: Heather Hedden <heather@...>
To: TaxoCoP@yahoogroups.com
Sent: Thu, November 12, 2009 1:34:22 AM
Subject: [TaxoCoP] word/phrase extracting/sorting tools

 

Does anyone have recommendations for simple/inexpensive tools that a taxonomist could use to automatically gather words and phrases from content and sort or count them (then to generate histograms, for example), as a method to gather candidate terms from text when building a taxonomy? A colleague mentioned Word List from MechanicWords http://www.mechanic words.com/ but I'd like to know of others like it.  This would be an automated supplement to a content audit.  Also, is this something simple enough to code that any in-house software developers in a large enterprise (with time resources) might be able to code for?

Thanks.

-- Heather

-- 
Heather Hedden
Hedden Information Management
www.Hedden-Informat ion.com



#3317 From: Heather Hedden <heather@...>
Date: Wed Nov 11, 2009 2:34 pm
Subject: word/phrase extracting/sorting tools
hbhedden
Offline Offline
Send Email Send Email
 
Does anyone have recommendations for simple/inexpensive tools that a taxonomist could use to automatically gather words and phrases from content and sort or count them (then to generate histograms, for example), as a method to gather candidate terms from text when building a taxonomy? A colleague mentioned Word List from MechanicWords http://www.mechanicwords.com/ but I'd like to know of others like it.  This would be an automated supplement to a content audit.  Also, is this something simple enough to code that any in-house software developers in a large enterprise (with time resources) might be able to code for?

Thanks.

-- Heather
-- Heather Hedden
Hedden Information Management
www.Hedden-Information.com

#3316 From: "John O'Gorman" <jogorman@...>
Date: Thu Nov 5, 2009 6:01 pm
Subject: Re: Re: Most common facets used within organisations
laptopjockey
Offline Offline
Send Email Send Email
 
Hey Nick;
 
I'm up for that conversation. Connecting the concepts of faceted classification, master data management and business intelligence is brilliant - and where, incidentally, I believe information management will be going in the very near future.
 
If I may, I have a couple of observations for the group. Hopefully I haven't worn out my welcome...
 
First, in my world a facet is just a way to organize a fixed number of lists - stand alone columns, if you will - of related entities.  The difference with Q6 is that there are no assumptions about how the values in the lists will be used. For example, the words Manufacturer, Supplier, Distributor and Vendor are all values on the Roles list. Just the words, not the companies.  Likewise, all the companies important to my enterprise appear on the Organizations list. In order to keep all of these lists completely 'decoupled' no other attributes appear on the list. This is where Nick's MDM connection comes in.
 
Second, building faceted tables now becomes a matter of associating the entities from individual lists to each other as needed. So, for example, I can now create an association between a given organization and an address (street or lat /long) and a Role. Likewise for Persons in the Role of Customer.  I can extend this approach to create associations between Persons in my Customer table with Articles in my Products table.  I can also do interesting things like give one Organization multiple Roles so that Acme Recreation Inc can be a Manufacturer, Distributor and Vendor at the same time.
 
Finally, one recommendation about Location. It is tempting to make things like trails, stores, distribution centres and huts 'Locations', but in Q6 - while these and all other 'fixed' assets have a strong location aspect - they are in fact Physical Assets with a Location association.  Trust me on this one...your BI reports will come out much nicer if you separate the asset from its coordinates.
 
Best of luck, Nick and if you want to talke more, please contact me off-list.
 
John O'
 
 
 
-----Original Message-----
From: Nick Berry [mailto:infoglutton@...]
Sent: Thursday, November 5, 2009 09:36 AM
To: TaxoCoP@yahoogroups.com
Subject: Re: [TaxoCoP] Re: Most common facets used within organisations

 

These are not just facets - they are also master data elements that typically end up in data warehouses for business intelligence purposes.  Right now we're spearheading an MDM initiative, and our top "facets" or entities are:

  • Customer (attributes include name, address/location, activities, purchases, member/nonmember, etc)
  • Article (which is the same as Product - attributes include size, color, price, fabric/material, brand, gender, seasonality, status, etc...but an Article could be a rain jacket or an Adventure Trip or a ski rental)
  • Site (refers to stores, channels, distribution centers, trailheads, etc; attributes include name, location, type, etc)
  • Vendor (includes manufacturers, distributors, etc; attributes are name, address, associated Articles, etc)

We are applying standard IA principles to these elements - interviewing users to understand their needs, doing inventories of the data and how it is used, setting up taxonomies to manage relationships between facets and attributes, etc.  Except in the MDM world these tasks are called data profiling, data integration, data quality, data governance, etc.  

If there are other people on this list doing MDM and/or BI, I'd love to chat with you about data governance, org structure, tools, etc. 

Cheers,
Nick

nick berry, content architect & taxonomist

Recreational Equipment, Inc. (REI)

253-437-7860 (p) / 253-395-8201 (f)




On Thu, Nov 5, 2009 at 12:49 AM, stephaniefkmg <stephanie@earley.com> wrote:
 

This has been a fascinating discussion, weaving between business uses and theoretical concepts.

I'll be summarizing the main points in a blog post next week and on the TaxoCoP wiki, quoting from this discussion.

If any of the participants prefer not being quoted, please email me at stephanie@earley.com

Thanks!
S.


 


#3315 From: Nick Berry <infoglutton@...>
Date: Thu Nov 5, 2009 4:36 pm
Subject: Re: Re: Most common facets used within organisations
taxonomizer
Offline Offline
Send Email Send Email
 
These are not just facets - they are also master data elements that typically end up in data warehouses for business intelligence purposes.  Right now we're spearheading an MDM initiative, and our top "facets" or entities are:
  • Customer (attributes include name, address/location, activities, purchases, member/nonmember, etc)
  • Article (which is the same as Product - attributes include size, color, price, fabric/material, brand, gender, seasonality, status, etc...but an Article could be a rain jacket or an Adventure Trip or a ski rental)
  • Site (refers to stores, channels, distribution centers, trailheads, etc; attributes include name, location, type, etc)
  • Vendor (includes manufacturers, distributors, etc; attributes are name, address, associated Articles, etc)

We are applying standard IA principles to these elements - interviewing users to understand their needs, doing inventories of the data and how it is used, setting up taxonomies to manage relationships between facets and attributes, etc.  Except in the MDM world these tasks are called data profiling, data integration, data quality, data governance, etc.  

If there are other people on this list doing MDM and/or BI, I'd love to chat with you about data governance, org structure, tools, etc. 

Cheers,
Nick

nick berry, content architect & taxonomist

Recreational Equipment, Inc. (REI)

253-437-7860 (p) / 253-395-8201 (f)




On Thu, Nov 5, 2009 at 12:49 AM, stephaniefkmg <stephanie@...> wrote:
 

This has been a fascinating discussion, weaving between business uses and theoretical concepts.

I'll be summarizing the main points in a blog post next week and on the TaxoCoP wiki, quoting from this discussion.

If any of the participants prefer not being quoted, please email me at stephanie@...

Thanks!
S.



#3314 From: "stephaniefkmg" <stephanie@...>
Date: Thu Nov 5, 2009 8:49 am
Subject: Re: Most common facets used within organisations
stephaniefkmg
Offline Offline
Send Email Send Email
 
This has been a fascinating discussion, weaving between business uses and
theoretical concepts.

I'll be summarizing the main points in a blog post next week and on the TaxoCoP
wiki, quoting from this discussion.

If any of the participants prefer not being quoted, please email me at
stephanie@...

Thanks!
S.

#3313 From: "Kris Liberman" <k.liberman@...>
Date: Thu Nov 5, 2009 12:59 am
Subject: Register Now for "Ontologies for Knowledge Mapping and Discovery" in Boston
kliberman_2000
Offline Offline
Send Email Send Email
 
Please join SLA Boston on November 17th for...

*Ontologies for Knowledge Mapping and Discovery*

When: Tuesday, November 17, 5:30pm to 7:30pm
Where: Social Law Library -- John Adams Courthouse, One Pemberton Square, Suite 4100, Boston

We will have copies of Brandy's book available for sale for $70 (this is a 20% discount plus a small shipping charge) -- we can only accept exact cash or a check.

Agenda:
5:30pm to 6:00pm: Registration, networking, booksigning, and snacks

6:00pm to 7:00pm:

Brandy King, author of Finding the Concept, Not Just the Word: A Librarian's Guide to Ontologies and Semantics, will present an overview on ontologies: how they are different from other knowledge, how they are contructed, why they are useful, and how they are helping people find information.

Matt Johnson from Harvard's Center for biomedical Informatics will follow by providing a specific example of how ontologies are helping scientists discover new information about autism.
 
We will have copies of Brandy's book available for sale for $70 (this is a 20% discount plus a small shipping charge) -- we can only accept exact cash or a check.

7:00pm to 7:30pm: Questions and Wrap-up

7:30pm to ?: Continued discussion at the Kinsale Pub
(
http://www.classicirish.com/kinsale-directions.php)

Many thanks to HiveFire (http://www.hivefire.com/) for their support of this program!

***************************
Costs:
SLA/SCIP/ASIST Members - $20
Non-Member - $25
Between Jobs/Retired $10
Student $10

Registration:
Register here:
http://www.surveymonkey.com/s.aspx?sm=LDLvOKtuYbnzlYBgt4EDlw_3d_3d

Please register by November 15th if possible so that we will have an accurate count for the catering folks. Walk-ins are welcome but it is very helpful if you pre-register.

Payment:
Payments can be made through PayPal or by check. The paypal link is available on the registration form or checks can be made out to Boston Chapter/SLA and mailed to:

Alex Caracuzzo
SLA Boston Treasurer
Dewey Library for Management and Social Sciences Massachusetts
Institute of Technology
77 Massachusetts Avenue, E53-100
Cambridge, MA 02139

Please send any questions about the program to Kris Liberman at
k.liberman@... -- thanks!


---

#3312 From: Leonard Will <L.Will@...>
Date: Wed Nov 4, 2009 11:56 am
Subject: Re: Most common facets used within organisations
leonard_will
Offline Offline
Send Email Send Email
 
On Wed, 4 Nov 2009 at 10:01:31, Gabriel Tanase <gabtanase@...>
wrote
>Leonard, all,
>
>In the data(base) conceptual modeling arena <Party> is the most
>frequently used term, as a super-type of both Person and Organisation.
>See Silverston's books on Data Model Patterns. Furthermore, the concept
>of <Party Role> is closely associated with <Party>.
>
>E.g. a "Customer" is a <Party Role>, i.e. the role that a <Person>
>party (Joe SixPack) plays in the context of an <Company> organization
>party (ACME Inc., whose customer Joe SixPack becomes at a given point
>in time). The role of <Customer> may or may not depend on the existence
>of a legal arrangement (contract) between the company Party and the
>person Party. A <Party Role> is always related to a <Party> that plays
>that role in a specific context of at least one other <Party>.
>
>Admittedly, as a term <Party> is not much better than <Agent> or
><Actor>. The conceptual data models make heavy use of subtyping under
>both <Party> and <Party Role> to represent the usual business terms of
>Customer, Supplier, Service Provider, Beneficiary, Company,
>Not-for-Profit Organization, Household etc. etc.
>
>
>Regards,
>Gabriel

Gabriel -

Thanks. I have come across this use of <party>, and perhaps it is the
best term we can find for the job - so long as it is clear to everybody
what it means.

I see that the CIDOC relational data model, at

<http://cidoc.mediahost.org/data-model%28en%29%28E73%29.xml>

uses PEOPLE as the overall type, with

PEOPLE-PERSON
and
PEOPLE-GROUP

for individuals and groups respectively. These are rather cumbersome,
but I suppose that GROUP is more general than "organisation" as a group
may not be "organised".

<Party role> is more difficult, because we have to decide whether to
treat it as a coordination or link between separate PARTY and ROLE
facets,

e.g. "PARTY" & "ROLE"

or whether to treat ROLE as a characteristic of division, giving an
array within the PARTY facet introduced by the node label

<Party by role>

In neither case can it be used as a simple element of a citation order
formula for expressing compound subjects, because its location in such a
sequence will depend on the nature of the role.

Leonard

--
Willpower Information     (Partners: Dr Leonard D Will, Sheena E Will)
Information Management Consultants            Tel: +44 (0)20 8372 0092
27 Calshot Way                              L.Will@...
ENFIELD                                Sheena.Will@...
EN2 7BQ, UK                            http://www.willpowerinfo.co.uk/

#3311 From: Gabriel Tanase <gabtanase@...>
Date: Wed Nov 4, 2009 10:01 am
Subject: Re: Most common facets used within organisations
datamodeller
Offline Offline
Send Email Send Email
 
Leonard, all,

In the data(base) conceptual modeling arena <Party> is the most frequently used term, as a super-type of both Person and Organisation. See Silverston's books on Data Model Patterns.
Furthermore, the concept of <Party Role> is closely associated with <Party>.

E.g. a "Customer" is a <Party Role>, i.e. the role that a <Person> party (Joe SixPack) plays in the context of an <Company> organization party (ACME Inc., whose customer Joe SixPack becomes at a given point in time).
The role of <Customer> may or may not depend on the existence of a legal arrangement (contract) between the company Party and the person Party.
A <Party Role> is always related to a <Party> that plays that role in a specific context of at least one other <Party>.


Admittedly, as a term <Party> is not much better than <Agent> or <Actor>.
The conceptual data models make heavy use of subtyping under both <Party> and <Party Role> to represent the usual business terms of Customer, Supplier, Service Provider, Beneficiary, Company, Not-for-Profit Organization, Household etc. etc.


Regards,
Gabriel
http://www.linkedin.com/in/gabrieltanase


2009/11/3 Leonard Will <L.Will@...>  
[...]
3. It is difficult to find a term to name a facet to which people and
organisations both belong: "agent" can give the misleading impression
that it needs to be the pro-active member in a process; "actor" is
liable to misinterpretation. Has anyone found a good term for this?

Leonard

--
Willpower Information     (Partners: Dr Leonard D Will, Sheena E Will)
Information Management Consultants            Tel: +44 (0)20 8372 0092
27 Calshot Way                              L.Will@...
ENFIELD                                Sheena.Will@...
EN2 7BQ, UK                            http://www.willpowerinfo.co.uk/





#3310 From: "John O'Gorman" <jogorman@...>
Date: Tue Nov 3, 2009 3:33 pm
Subject: Re: Most common facets used within organisations
laptopjockey
Offline Offline
Send Email Send Email
 
Leonard, you raise some of the issues that I believe have been resolved with the development of Q6. BTW, I'm being deliberately provocative here to elicit these kinds of comments and I really appreciate the dialog.
 
Q6 insists on a fixed set of facets for a number of reasons, the primary one so that non-taxonomists cannot begin to develop their own. Think of it like rules of the game...chess for example has a definitive set of rules, otherwise it wouldn't be a game.  The secondary reason is to force an alternative method for managing the 'characteristics of division' Leonard mentions below.
 
First of all, a unique value can only appear once and only on one facet. That means that Matt Moore, Leonard Will, Heather Heddon,  Stavros Macrakis, Patrick Lambe and John O'Gorman are data points on the Person facet. If another Leonard Will comes along and is proven to be different than the current version, he will be added as a unique (in the sense of identitiy) value on the Person facet.
 
The same process is repeated for all facets, so the words 'Customer', 'Client', 'Supplier', 'Curmudgeon', 'Analyst' and 'Patient' would be added to the 'Role' facet. This is initially one of the more difficult concepts to grasp, because most data modellers would put these on the 'Person' facet. Putting them on a separate facet, though, makes them much more useful. Now, anyone on the Person facet can have multiple Roles simply by allowing a one-to-many relationship between Person and Role.  You don't have duplicate people in different roles which makes things much, much easier to manage. I'm compacting concepts here, but hopefully you get the jist.
 
The reason I believe this model will be of interest to the taxonomy community is because it gives us a whole new tool set for not only classifying data and by extension content, but for actively promoting initiatives like The Semantic Web and Master Data Management to name only two. The traditional method of assigning one entity to one place in a taxonomy is still valid and useful. However, when organizing very large - and potentially volatile - collections, the Q6 method assigns all of the names, terms, strings and values to common facets, then uses fairly rigid rules for combining the values into 'new' entities. In other words, it provides a very stable framework for managing what is currently a very chaotic universe of information.
 
One final point if I may. Point #3 in your post points to an interesting dillema in the classification process: what to 'name' things and when to expose the names.  I used 'Agent' as a primary facet name, but I have never exposed it my clients and for the same reason that both Leonard and Matt have mentioned. I don't sweat it any more, though - I just use whatever label they are comfortable with...developers use this technique all the time.
 
Cheers,
 
John O'
 
 
 
 
-----Original Message-----
From: Leonard Will [mailto:L.Will@...]
Sent: Tuesday, November 3, 2009 04:00 AM
To: TaxoCoP@yahoogroups.com
Subject: Re: [TaxoCoP] Most common facets used within organisations

 

On Mon, 2 Nov 2009 at 16:23:51, Matt Moore <innotecture@yahoo.com> wrote
>Leonard,
>
>What would appear on your list?
>
>Cheers,
>
>Matt

I don't think that it is possible to have a "definitive" list that suits
all circumstances. The references Patrick Lambe has given discuss the
reasons.

Two issues are sometimes confused when discussing facets:

1. The need to distinguish between

* "facets" (sometimes called "fundamental facets"), which are the
mutually exclusive categories we are talking about here, such as
objects, materials, activities, places, abstract concepts, and so on,

and

* "characteristics of division", such as "colour" or "age" which are
used to subdivide a facet into _arrays_, headed by a "node label" such
as "automobiles by colour" or "people by age"; these would be part of
the "objects" or "people" facets respectively.

2. The fact that in specifying a citation order of facets in a compound
subject you may have to list a single facet more than once, depending on
its role. For example, in the CRG / BC2 list quoted in William Denton's
paper

* thing/entity
* kind
* part
* property
* material
* process
* operation
* patient
* product
* by-product
* agent
* space
* time

terms from the "people" facet may be used as both "patient" and "agent",
and terms from the "object" facet may be used as "thing", "product",
"by-product", and so on. This makes is difficult to specify a consistent
combination order which can be mechanised based solely on the names of
facets - the entries in the above list are not really names of facets in
the sense defined in 1 above..

This issue is not so much of a problem in post-coordinate applications,
but as this list is talking about "taxonomies" (i.e. classification
schemes), rather than thesauri, I presume that some pre-coordination
will often be required.

3. It is difficult to find a term to name a facet to which people and
organisations both belong: "agent" can give the misleading impression
that it needs to be the pro-active member in a process; "actor" is
liable to misinterpretation. Has anyone found a good term for this?

Leonard

--
Willpower Information (Partners: Dr Leonard D Will, Sheena E Will)
Information Management Consultants Tel: +44 (0)20 8372 0092
27 Calshot Way L.Will@Willpowerinfo.co.uk
ENFIELD Sheena.Will@Willpowerinfo.co.uk
EN2 7BQ, UK http://www.willpowerinfo.co.uk/


#3309 From: Leonard Will <L.Will@...>
Date: Tue Nov 3, 2009 11:00 am
Subject: Re: Most common facets used within organisations
leonard_will
Offline Offline
Send Email Send Email
 
On Mon, 2 Nov 2009 at 16:23:51, Matt Moore <innotecture@...> wrote
>Leonard,
>
>What would appear on your list?
>
>Cheers,
>
>Matt

I don't think that it is possible to have a "definitive" list that suits
all circumstances. The references Patrick Lambe has given discuss the
reasons.

Two issues are sometimes confused when discussing facets:

1. The need to distinguish between

*  "facets" (sometimes called "fundamental facets"), which are the
mutually exclusive categories we are talking about here, such as
objects, materials, activities, places, abstract concepts, and so on,

and

* "characteristics of division", such as "colour" or "age" which are
used to subdivide a facet into _arrays_, headed by a "node label" such
as "automobiles by colour" or "people by age"; these would be part of
the "objects" or "people" facets respectively.


2. The fact that in specifying a citation order of facets in a compound
subject you may have to list a single facet more than once, depending on
its role. For example, in the CRG / BC2 list quoted in William Denton's
paper

      * thing/entity
      * kind
      * part
      * property
      * material
      * process
      * operation
      * patient
      * product
      * by-product
      * agent
      * space
      * time

terms from the "people" facet may be used as both "patient" and "agent",
and terms from the "object" facet may be used as "thing", "product",
"by-product", and so on. This makes is difficult to specify a consistent
combination order which can be mechanised based solely on the names of
facets - the entries in the above list are not really names of facets in
the sense defined in 1 above..

This issue is not so much of a problem in post-coordinate applications,
but as this list is talking about "taxonomies" (i.e. classification
schemes), rather than thesauri, I presume that some pre-coordination
will often be required.

3. It is difficult to find a term to name a facet to which people and
organisations both belong: "agent" can give the misleading impression
that it needs to be the pro-active member in a process; "actor" is
liable to misinterpretation. Has anyone found a good term for this?

Leonard

--
Willpower Information     (Partners: Dr Leonard D Will, Sheena E Will)
Information Management Consultants            Tel: +44 (0)20 8372 0092
27 Calshot Way                              L.Will@...
ENFIELD                                Sheena.Will@...
EN2 7BQ, UK                            http://www.willpowerinfo.co.uk/

#3308 From: Matt Moore <innotecture@...>
Date: Tue Nov 3, 2009 10:37 am
Subject: Re: Most common facets used within organisations
innotecture
Online Now Online Now
Send Email Send Email
 
Patrick,

This is all true but I've a lot of conceptual material around facet methods (how one should construct them, how one should apply them) - what I'm interested in is how facets are actually applied (or not) in organisations - which facets are used, why & how. In fact, I sense a survey coming on here (but I am sorting out some of the details in my head at the moment).

And yes, I agree, facets can also serve content management functions as well as findability...

Cheers,

Matt


From: Patrick Lambe <plambe@...>
To: TaxoCoP@yahoogroups.com
Sent: Tue, November 3, 2009 8:29:36 PM
Subject: Re: [TaxoCoP] Most common facets used within organisations

 

Hi Matt


As I'm sure you know, I discuss a range of candidates for facets in my book Organising Knowledge. For a very good discussion and some excellent references see William Denton's paper How to Make a Faceted Classification and Put It On the Web http://www.miskaton ic.org/library/ facet-web- howto.html - the Spiteri reference in particular is worth checking out, for the principles of facet identification. Denton's piece provides a nice bridge between the information science and facets in use, with some good examples.

The nice thing about facets is that there are only a finite number to choose from, however one wants to slice the world. More important it seems to me(and we lose sight of this sometimes) is which of the possible candidates should we select and why? That means finding a way of determining salience and usefulness for your taxonomy "customers". What is each facet going to do for you? It's not always about findability - facets can also serve content management functions (it's a content type: press release, therefore it needs to be routed for approval).

Best

Patrick

Patrick Lambe

website: www.straitsknowledg e.com

Have you seen our KM Method Cards or
Organisation Culture Cards?  





On Nov 3, 2009, at 3:05 AM, Matt Moore wrote:


John,

So where is this definitive list? References please!

Cheers,

Matt


From: John O'Gorman <jogorman@tiberon- ia.com>
To: TaxoCoP@yahoogroups .com
Sent: Tue, November 3, 2009 12:50:05 AM
Subject: Re: [TaxoCoP] Most common facets used within organisations

 

Good morning, Matt;
 
The definitive list of primary facets (a facet being defined as a mutually exclusive and exhaustive list of related entities) has nineteen entries - all of which are semantically consistent with the exchange of information between two parties. In other words, primary facets have to do with Agents (people and organizations) , Assets (digital, conceptual and physical), Locations (coordinate, named and relative), Actions (activities, events, processes and tasks), Functions (disciplines, roles and uses) and finally States (cycle, point, span and status). 
 
Secondary, tertiary and quaternary facets can be created by combining primary facets, using a mechanism analogous to the one for creating molecules or compounds in chemistry.
 
Hope that helps.
 
John O'
 
 
-----Original Message-----
From: Matt Moore [mailto:innotecture @yahoo.com]
Sent: Monday, November 2, 2009 04:40 AM
To: TaxoCoP@yahoogroups .com
Subject: [TaxoCoP] Most common facets used within organisations

 

Hello,

I'm curious as to what the most commonly used facets within an enterprise taxonomy might be. The ones that come to mind are:
- Products
- Customers/Clients/ Stakeholders
- Business Units
- Locations
- Processes
- Projects

Any further suggestions?

Regards,

Matt






#3307 From: Patrick Lambe <plambe@...>
Date: Tue Nov 3, 2009 9:29 am
Subject: Re: Most common facets used within organisations
plambe2002
Offline Offline
Send Email Send Email
 
Hi Matt

As I'm sure you know, I discuss a range of candidates for facets in my book Organising Knowledge. For a very good discussion and some excellent references see William Denton's paper How to Make a Faceted Classification and Put It On the Web http://www.miskatonic.org/library/facet-web-howto.html - the Spiteri reference in particular is worth checking out, for the principles of facet identification. Denton's piece provides a nice bridge between the information science and facets in use, with some good examples.

The nice thing about facets is that there are only a finite number to choose from, however one wants to slice the world. More important it seems to me(and we lose sight of this sometimes) is which of the possible candidates should we select and why? That means finding a way of determining salience and usefulness for your taxonomy "customers". What is each facet going to do for you? It's not always about findability - facets can also serve content management functions (it's a content type: press release, therefore it needs to be routed for approval).

Best

Patrick

Patrick Lambe

website: www.straitsknowledge.com

Have you seen our KM Method Cards or
Organisation Culture Cards?  





On Nov 3, 2009, at 3:05 AM, Matt Moore wrote:


John,

So where is this definitive list? References please!

Cheers,

Matt


From: John O'Gorman <jogorman@tiberon-ia.com>
To: TaxoCoP@yahoogroups.com
Sent: Tue, November 3, 2009 12:50:05 AM
Subject: Re: [TaxoCoP] Most common facets used within organisations

 

Good morning, Matt;
 
The definitive list of primary facets (a facet being defined as a mutually exclusive and exhaustive list of related entities) has nineteen entries - all of which are semantically consistent with the exchange of information between two parties. In other words, primary facets have to do with Agents (people and organizations) , Assets (digital, conceptual and physical), Locations (coordinate, named and relative), Actions (activities, events, processes and tasks), Functions (disciplines, roles and uses) and finally States (cycle, point, span and status). 
 
Secondary, tertiary and quaternary facets can be created by combining primary facets, using a mechanism analogous to the one for creating molecules or compounds in chemistry.
 
Hope that helps.
 
John O'
 
 
-----Original Message-----
From: Matt Moore [mailto:innotecture @yahoo.com]
Sent: Monday, November 2, 2009 04:40 AM
To: TaxoCoP@yahoogroups .com
Subject: [TaxoCoP] Most common facets used within organisations

 

Hello,

I'm curious as to what the most commonly used facets within an enterprise taxonomy might be. The ones that come to mind are:
- Products
- Customers/Clients/ Stakeholders
- Business Units
- Locations
- Processes
- Projects

Any further suggestions?

Regards,

Matt





#3306 From: Matt Moore <innotecture@...>
Date: Tue Nov 3, 2009 12:23 am
Subject: Re: Most common facets used within organisations
innotecture
Online Now Online Now
Send Email Send Email
 
Leonard,

What would appear on your list?

Cheers,

Matt


#3305 From: Matt Moore <innotecture@...>
Date: Tue Nov 3, 2009 12:23 am
Subject: Re: Most common facets used within organisations
innotecture
Online Now Online Now
Send Email Send Email
 
Heather,

And which six have you seen most often?

Cheers,

Matt


From: Heather Hedden <heather@...>
To: TaxoCoP@yahoogroups.com
Sent: Tue, November 3, 2009 7:59:08 AM
Subject: Re: [TaxoCoP] Most common facets used within organisations

 

I wouldn't consider a sub-type or category within a facet as a facet. I would count six facets from the list below.
Six is a better number anyway for implementing faceted search.

-- Heather

Heather Hedden
Hedden Information Management
Heather@Hedden. net www.Hedden-Informat ion.com


John O'Gorman wrote:
In the post...count 'em (19). No references. :~)
 
John O'
 
"
The definitive list of primary facets (a facet being defined as a mutually exclusive and exhaustive list of related entities) has nineteen entries - all of which are semantically consistent with the exchange of information between two parties. In other words, primary facets have to do with Agents (people and organizations), Assets (digitalconceptual and physical), Locations (coordinate, named and relative), Actions (activities, events, processes and tasks), Functions (disciplines, roles and uses) and finally States (cycle, point, span and status). 
 
"
 
 
-----Original Message-----
From: Matt Moore [mailto:innotecture@ yahoo.com]
Sent: Monday, November 2, 2009 12:05 PM
To: TaxoCoP@yahoogroups .com
Subject: Re: [TaxoCoP] Most common facets used within organisations

 
John,

So where is this definitive list? References please!

Cheers,

Matt


From: John O'Gorman <jogorman@tiberon- ia.com>
To: TaxoCoP@yahoogroups .com
Sent: Tue, November 3, 2009 12:50:05 AM
Subject: Re: [TaxoCoP] Most common facets used within organisations

 

 

Good morning, Matt;
 
The definitive list of primary facets (a facet being defined as a mutually exclusive and exhaustive list of related entities) has nineteen entries - all of which are semantically consistent with the exchange of information between two parties. In other words, primary facets have to do with Agents (people and organizations) , Assets (digital, conceptual and physical), Locations (coordinate, named and relative), Actions (activities, events, processes and tasks), Functions (disciplines, roles and uses) and finally States (cycle, point, span and status). 
 
Secondary, tertiary and quaternary facets can be created by combining primary facets, using a mechanism analogous to the one for creating molecules or compounds in chemistry.
 
Hope that helps.
 
John O'
 
 
-----Original Message-----
From: Matt Moore [mailto:innotecture @yahoo.com]
Sent: Monday, November 2, 2009 04:40 AM
To: TaxoCoP@yahoogroups .com
Subject: [TaxoCoP] Most common facets used within organisations

 

Hello,

I'm curious as to what the most commonly used facets within an enterprise taxonomy might be. The ones that come to mind are:
- Products
- Customers/Clients/ Stakeholders
- Business Units
- Locations
- Processes
- Projects

Any further suggestions?

Regards,

Matt


 



#3304 From: Matt Moore <innotecture@...>
Date: Tue Nov 3, 2009 12:22 am
Subject: Re: Most common facets used within organisations
innotecture
Online Now Online Now
Send Email Send Email
 
John,

I think we'll wait for the white paper. My initial comment is that while your
approach seems quite rigorous it also seems a little divorced from the language
that people use in businesses - no one talks about agents (unless they work for
the CIA) but they do talk aboutcustomers, clients, suppliers, and partners.

Cheers,

Matt



________________________________
From: John O'Gorman <jogorman@...>
To: TaxoCoP@yahoogroups.com
Sent: Tue, November 3, 2009 8:31:38 AM
Subject: Re:  [TaxoCoP] Most common facets used within organisations


Hi Stavros;

As a method for managing collections, Q6 is based on Ranganathan' s system of
classification. Interestingly, it also has a loose resemblence to the Zachman
Framework for enterprise architecture, but was independently derived.

The difference with Q6 is in its granularity and its geometry. In addition to
using the concepts of colon classification, the principles of Q6 start at word
level and can be extended right up through any digital asset. Photos, documents,
diagrams and construction drawings can all be deconstructed and reassembled
using the same nineteen facets and Q6 axioms.

I'm working on a whitepaper outlining the approach and the results of the first
few applications. It should be finished fairly shortly.

John O'

Messages 3304 - 3333 of 3333   Newest  |  < Newer  |  Older >  |  Oldest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help