Skip to search.

Breaking News Visit Yahoo! News for the latest.

×Close this window

web-indexing · Web Indexing

The Yahoo! Groups Product Blog

Check it out!

Group Information

  • Members: 297
  • Category: Indexing
  • Founded: Jan 31, 2005
  • Language: English
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Real people. Real stories. See how Yahoo! Groups impacts members worldwide.

Messages

Advanced
Messages Help
Messages 584 - 613 of 739   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Show Message Summaries Sort by Date ^  
#584 From: "Martin Carbone" <martycarbone@...>
Date: Thu May 8, 2008 4:52 pm
Subject: Generating an alphabetical internet index
martycarbone
Send Email Send Email
 
I have just about completed the first stage of my Alphabetical
Internet Index. << http://www.alphabeticalist.com >>

My strategy has been to wade through 13 primary and authoritative
targets and use their "topics" as a start for a controlled vocabulary.
Those "targets" are at the top of every alphabetical page on the site.

I have also checked out and linked to various small independent sites.

I have about 8,000 links that cover approximately 5,000 topics.

I plan to continue to look for good information on the existing topics
and link to that information. I will also add topics as needed.

Comments are invited regarding (a) the indexing on the site, (b)
the validity of this general approach to generating an Internet Index
and (c) this method of compiling a controlled vocabulary for internet
indexing.

If you consider my approach faulty, please suggest a better way to
select topics for an internet index.

Thanks in advance for your comments.


Marty Carbone

#585 From: Jon Jermey <jonjermey@...>
Date: Fri May 9, 2008 1:59 am
Subject: Re: Generating an alphabetical internet index
jonpjermey
Send Email Send Email
 
My problem is with the whole enterprise: before seeking for ways to
manually construct an internet index you need to demonstrate that a
manually-constructed internet index has value. Or in other words,
explain -- with examples -- why your approach is an improvement over
Google's.

Jon.

Martin Carbone wrote:
>
> If you consider my approach faulty, please suggest a better way to
> select topics for an internet index.
>
>

#586 From: "Martin Carbone" <martycarbone@...>
Date: Fri May 9, 2008 2:01 pm
Subject: Re: Generating an alphabetical internet index
martycarbone
Send Email Send Email
 
In web-indexing@yahoogroups.com, Jon Jermey <jonjermey@...> wrote: in
part:  "before seeking ... ways to ... construct an internet index you
need to demonstrate that (an) ... internet index has value."

------------- reply follows --------------

Thanks for your email. My website is an attempt to do exactly what you
suggest -- that is, (to use your words) "demonstrate that (an) ...
internet index has value."

In problem solving, there are basically two approaches; (1) you can
think about the problem, and (2) you can start solving the problem. They
can be done (a) in any order or (b) simultaneously or (c) any
combination of the two.

I have done lots of thinking before putting up the site. The site is my
way of testing that thinking by actually doing what I think can be done.

Please remember -- this is basically a first draft, a prototype -- it is
undoubtedly rough.

If you think my approach is faulty -- please suggest a better way to (1)
select topics for an internet index and (2) actually construct a
prototype for an internet index.

Marty Carbone




[Non-text portions of this message have been removed]

#587 From: Jon Jermey <jonjermey@...>
Date: Fri May 9, 2008 9:56 pm
Subject: Re: Re: Generating an alphabetical internet index
jonpjermey
Send Email Send Email
 
Hi Marty,

You are missing step (0) -- determine whether there is a problem in the
first place. You have _assumed_ that there is a problem with finding
information on the Internet which a manually-constructed system can
solve. Once you have actually _demonstrated_ this, then you can
reasonably go on to attempt to solve the problem. So let me ask again:
can you tell us what is wrong with Google that your system will fix, and
show us how it will it fix it?

Jon.


Martin Carbone wrote:
> In web-indexing@yahoogroups.com, Jon Jermey <jonjermey@...> wrote: in
> part:  "before seeking ... ways to ... construct an internet index you
> need to demonstrate that (an) ... internet index has value."
>
> ------------- reply follows --------------
>
> Thanks for your email. My website is an attempt to do exactly what you
> suggest -- that is, (to use your words) "demonstrate that (an) ...
> internet index has value."
>
> In problem solving, there are basically two approaches; (1) you can
> think about the problem, and (2) you can start solving the problem. They
> can be done (a) in any order or (b) simultaneously or (c) any
> combination of the two.
>
> I have done lots of thinking before putting up the site. The site is my
> way of testing that thinking by actually doing what I think can be done.
>
> Please remember -- this is basically a first draft, a prototype -- it is
> undoubtedly rough.
>
> If you think my approach is faulty -- please suggest a better way to (1)
> select topics for an internet index and (2) actually construct a
> prototype for an internet index.
>
> Marty Carbone
>
>
>

#588 From: "Martin Carbone" <martycarbone@...>
Date: Sat May 10, 2008 1:44 pm
Subject: Re: Generating an alphabetical internet index
martycarbone
Send Email Send Email
 
--- In web-indexing@yahoogroups.com, Jon Jermey <jonjermey@...> wrote,
in part: "You are missing step (0) -- determine whether there is a
problem in the first place. You have _assumed_ that there is a problem
with finding information on the Internet."

-------- reply follows ------------

1) Do you deny the existence of that problem?

2) Please read << http://www.howto-ville.com/about.html >> where I
talk about the existence of the problem. At that page there is this link
<< http://www.dmoz.org/about.html >> where Google talks about the problem.

3) I totally agree that you must start by determining if there is a
problem. See < http://www.howto-ville.com/problemsolvingtips.html >>
where I present my problem solving tips. But some problems are so
evident -- there is little need to discuss their existence.

4) See << http://www.howto-ville.com/searchingtheinternet.html >>
where I have about 25 links to a discussion of searching ... Some of
the links talk about the problem in scholarly language.

5) Your email seems to indicate that assumptions are bad things.
Please see my "A" page -- at "assumptions" where I argue they are
often valid.

6)  Even if there were no starkly evident problem. It is a good bet
that any system can be improved -- at least a little?


Marty Carbone

#589 From: "Martin Carbone" <martycarbone@...>
Date: Thu May 15, 2008 3:34 pm
Subject: New linked Internet indexes to "hobbies" and "sports"
martycarbone
Send Email Send Email
 
New linked Internet indexes to "hobbies" and "sports"

Here is our new linked index to hobbies -- just put up today.
http://www.howto-ville.com/hobbies.html

Compare it to wikipedia's list of hobbies
http://en.wikipedia.org/wiki/List_of_hobbies

Also check out our linked Internet index to "sports" which was derived
from our "hobbies" index
http://www.howto-ville.com/sports.html

Please comment on our definition of "sports" at the immediately above
link ("competitive, skillful endeavors with vigorous physical
activity").  We made up this definition to differentiate our sports
entries from entries you might find at "competition", "games",
"fitness activities", "hobbies" or other activities that might include
some, but not all elements of a "sport". For instance poker, chess and
checkers are not included as sports because they do not have vigorous
physical activity. "Jogging" is not included because it is not
competitive, while "walking" is included because it is often
competitive -- in fact, it is an Olympic sport. We tried to make the
definition inclusive of the basic concept of sports, but exclusive of
extraneous words -- in other words,"short but sweet".



Martin Carbone

martycarbone@...
http://www.alphabeticalist.com (send us tips for good websites for
this site)
http://www.howto-ville.com/about.html
Tel: 760-603-1910
Fax: 760-603-1930
5123 Don Rodolfo Drive / Carlsbad, CA 93010

#590 From: "Martin Carbone" <martycarbone@...>
Date: Thu May 22, 2008 3:46 pm
Subject: The IPL directory does a good job of indexing the web
martycarbone
Send Email Send Email
 
About the Internet Public Library (IPL) -- one of the best Internet
directories.

A consortium of colleges and universities with programs in information
science is developing and maintaining the IPL.

We have attempted to make the IPL information even easier to access by
putting all of their main subject headings and sub-subject headings on
two indexed pages -- each in  a different format.

Alphabetical Index / http://www.howto-ville.com/iplalphabetical.html
Categorized Index / http://www.howto-ville.com/iplsubjects.html

Use either or both of those pages and tell us which you prefer.

All of the IPL subjects and sub-subjects will also be melded into our
alphabetical pages.

Home page of IPL  << http://www.ipl.org/ >>

Comments are invited. Are we on the right track -- or on a fool's errand?


Marty Carbone

#591 From: "Martin Carbone" <martycarbone@...>
Date: Mon May 26, 2008 1:59 am
Subject: An index to 15 important and authoritative directories
martycarbone
Send Email Send Email
 
Our website now points to 15 important and authoritative directories.
We use their “topics” and "subjects" as a start for a controlled
vocabulary. The “targets” are listed at the top of every alphabetical
page.

Our site is therefor an index to those sites.

<< http://www.howto-ville.com/primarytargets.html >>

"A long, wide and deep look at the Internet"

Here they are: About.com / Answers.com / Bubl Link /
Catholic Encyclopedia / Google / Internet Public Library /
Internet Scout Project / Library Of Congress / NY Times /
Statistical Abstract of the United States / SuperCrawler /
TED.com / Templeton Categories /
Voice Of America "Special English" / Wikipedia

Marty Carbone
http://www.alphabeticalist.com

#592 From: "Martin Carbone" <martycarbone@...>
Date: Wed May 28, 2008 3:10 pm
Subject: Nine articles on Indexing the Internet
martycarbone
Send Email Send Email
 
Indexing the Internet -- a guide to some good articles on the subject.

Evidently, it is no longer (2008) fashionable or even acceptable to
consider the question of "Internet indexing". Most of the articles we
found were written around 1996 - before Google took over the field of
internet searching and became the de-facto way to find information.

In spite of that, we think some of you might find some helpful
information in the nine articles listed at the following URL.

http://www.howto-ville.com/indexingtheinternet.html


Marty Carbone

#593 From: <jmccrooks@...>
Date: Wed May 28, 2008 3:54 pm
Subject: Re: Nine articles on Indexing the Internet
pam272001
Send Email Send Email
 
Thanks, Martin. I'll take a look later this week.

Jessica


On Wed, 28 May 2008 15:10:41 -0000
   "Martin Carbone" <martycarbone@...> wrote:
> Indexing the Internet -- a guide to some good articles
>on the subject.
>
> Evidently, it is no longer (2008) fashionable or even
>acceptable to
> consider the question of "Internet indexing". Most of
>the articles we
> found were written around 1996 - before Google took over
>the field of
> internet searching and became the de-facto way to find
>information.
>
> In spite of that, we think some of you might find some
>helpful
> information in the nine articles listed at the following
>URL.
>
> http://www.howto-ville.com/indexingtheinternet.html
>
>
> Marty Carbone
>

#594 From: "margaret.pappas" <margaret.pappas@...>
Date: Thu May 29, 2008 1:59 am
Subject: Introducing myself & a request
margaret.pappas
Send Email Send Email
 
Dear All,

My name is Margaret Pappas. I am a professional librarian however, I am
VERY NEW to indexing. I have a book indexing project that will entail
VERY SPECIFIC DETAILED indexing i.e. in addition to indexing the pages,
I want to be specific as to 'where on the specific page the record
appears' so the reader can find it quickly.

Does anyone have any ideas/recommendations/suggestions?

Thanks for all your help!

Sincerely,
Margaret

#595 From: <jmccrooks@...>
Date: Thu May 29, 2008 4:49 pm
Subject: Fwd: Introducing myself & a request
pam272001
Send Email Send Email
 
Hi Margaret,

Welcome to the web indexing SIG.

Are you stating that you want to index the book and also
include the exact paragraph, line and place where the term
is? Are you referring to a traditional book of the book
index or a web site index?

The problem with such an approach if you are indexing a
traditional book is, what about the deiscussion of a
concept that span several pages. If you were just building
a concordance, it may be possible, but with an index where
it is much more than keyword selection I can't see this
approach being very useful.

Anyway, someone  else may have another take on it.

Jessica


    --- the forwarded message follows ---


[Non-text portions of this message have been removed]

#596 From: Margaret Pappas <margaret.pappas@...>
Date: Fri May 30, 2008 12:46 am
Subject: Re: Fwd: Introducing myself & a request
margaret.pappas
Send Email Send Email
 
Dear Jessica~
Thanks so much for your quick response. The book is traditional book index.
Your comments make sense to me. As I 'see' it, the book indexreally will be
a'concordance'...however, I am wanting to avoid the detailed 'paragraph, line'
indexing if I can. Just recently, I came across an interesting way to do
it...For example, if 'word' is listed on the page 5, I would count lines from
top to bottom to find it. Currently the book has 37 lines on the page. So
'word' is p. 5:L17T. (So I split the page into lines 1-19T or lines 1-18B. Do
I make sense? What do you think?
Thanks again for helping me.
Sincerely,
Margaret


----- Original Message ----
From: "jmccrooks@..." <jmccrooks@...>
To: web-indexing@yahoogroups.com
Sent: Thursday, May 29, 2008 9:49:13 AM
Subject: Fwd: [web-indexing] Introducing myself & a request


Hi Margaret,

Welcome to the web indexing SIG.

Are you stating that you want to index the book and also
include the exact paragraph, line and place where the term
is? Are you referring to a traditional book of the book
index or a web site index?

The problem with such an approach if you are indexing a
traditional book is, what about the deiscussion of a
concept that span several pages. If you were just building
a concordance, it may be possible, but with an index where
it is much more than keyword selection I can't see this
approach being very useful.

Anyway, someone else may have another take on it.

Jessica

--- the forwarded message follows ---

[Non-text portions of this message have been removed]






[Non-text portions of this message have been removed]

#597 From: "Jan Ross" <jan@...>
Date: Fri May 30, 2008 7:22 am
Subject: RE: Fwd: Introducing myself & a request[Scanned]
janrossuk
Send Email Send Email
 
Hi Margaret (and others)

I may be missing the point of your email, having now deleted your
original one, but your 'interesting' way (below) looks to be somewhat
time-consuming.  Have you looked at James Lamb's excellent little piece
of software called Word Embed.  This may be able to help you, but not
remembering your original requirements, I'm not sure.



Have a look at http://www.jalamb.com/wordembed.html





Best wishes

Jan

~~~~~~~~~~~~~~~~~~~~~~~

Jan Ross. B.Sc, B.Sc Hons, M.Sc, FSI

Tel: 01492 581050

email: jan@...

<http://www.merrallross-wales.co.uk/>



________________________________

From: web-indexing@yahoogroups.com [mailto:web-indexing@yahoogroups.com]
On Behalf Of Margaret Pappas
Sent: 30 May 2008 01:46
To: web-indexing@yahoogroups.com
Subject: Re: Fwd: [web-indexing] Introducing myself & a request[Scanned]



Dear Jessica~
Thanks so much for your quick response.  The book is traditional book
index.  Your comments make sense to me.  As I 'see' it, the book index
really will be a 'concordance'...however, I am wanting to avoid the
detailed 'paragraph, line' indexing if I can.  Just recently, I came
across an interesting way to do it...For example, if  'word' is listed
on the page 5, I would count lines from top to bottom to find it.
Currently the book has 37 lines on the page.  So  'word' is p. 5:L17T.
(So I split the page into lines 1-19T or  lines 1-18B.  Do I make sense?
What do you think?
Thanks again for helping me.
Sincerely,
Margaret






[Non-text portions of this message have been removed]

#598 From: Margaret Pappas <margaret.pappas@...>
Date: Sat May 31, 2008 6:08 am
Subject: Re: Fwd: Introducing myself & a request[Scanned]
margaret.pappas
Send Email Send Email
 
Dear Jan~
I want readers to quickly be able to findthe indexedword on the page.There is
no doubt about it that what I described is time consuming.However, does it
make sense?
I have looked at the link you provided briefly to get an overview of software,
but I am not sure it will address my specific need (see above) since I am
creatinga traditional book index/corcordance...HOWEVER, after going through the
link thoroughly, I may need to reassess the whole project if WordEmbed can
assist me.

I appreciate you taking the time to help me. If I can clarify the project for
you, I can give you a call. I live in California.
Sincerely,
Margaret
P.S.Oops,I apologize for typingyour first name incorrectly...


----- Original Message ----
From: Jan Ross <jan@...>
To: web-indexing@yahoogroups.com
Sent: Friday, May 30, 2008 12:22:32 AM
Subject: RE: Fwd: [web-indexing] Introducing myself & a request[Scanned]


Hi Margaret (and others)

I may be missing the point of your email, having now deleted your
original one, but your 'interesting' way (below) looks to be somewhat
time-consuming. Have you looked at James Lamb's excellent little piece
of software called Word Embed. This may be able to help you, but not
remembering your original requirements, I'm not sure.

Have a look at http://www.jalamb. com/wordembed. html

Best wishes

Jan

~~~~~~~~~~~~ ~~~~~~~~~ ~~

Jan Ross. B.Sc, B.Sc Hons, M.Sc, FSI

Tel: 01492 581050

email: jan@merrall- ross..co.uk

<http://www.merrallr oss-wales. co.uk/>

____________ _________ _________ __

From: web-indexing@ yahoogroups. com [mailto:web-indexing@ yahoogroups. com]
On Behalf Of Margaret Pappas
Sent: 30 May 2008 01:46
To: web-indexing@ yahoogroups. com
Subject: Re: Fwd: [web-indexing] Introducing myself & a request[Scanned]

Dear Jessica~
Thanks so much for your quick response. The book is traditional book
index. Your comments make sense to me. As I 'see' it, the book index
really will be a 'concordance' ...however, I am wanting to avoid the
detailed 'paragraph, line' indexing if I can. Just recently, I came
across an interesting way to do it...For example, if 'word' is listed
on the page 5, I would count lines from top to bottom to find it.
Currently the book has 37 lines on the page. So 'word' is p. 5:L17T.
(So I split the page into lines 1-19T or lines 1-18B. Do I make sense?
What do you think?
Thanks again for helping me.
Sincerely,
Margaret

[Non-text portions of this message have been removed]






[Non-text portions of this message have been removed]

#599 From: "Dwight Walker" <dwight@...>
Date: Tue Jun 24, 2008 9:34 am
Subject: word index ideas
roorythmics
Send Email Send Email
 
On a computer screen, a search say in Google, would highlight the
word in the page in the results pages in bold or with some colour
background such as yellow.

Doing such a word index on paper (which it sounds like what you want
if you want to name every single word in the page indexed by its
location using some sort of code) means you have to maintain a huge
index of single words.

First create a list of stop words (words that are ignored) such
as 'a', 'the', 'an'. Then index the rest by paragraph at most.

I'd do an inverted index so the customer can just look up a word and
see it in context with the locator as page number or perhaps
paragraph number if they are numbered. Use the eye of the reader to
find the word they want.

See this on what an inverted index is:

http://www.answers.com/topic/inverted-file?cat=technology

I can only think a scientific or historical work would require such
exactness using manual processes.

Dwight Walker
WWWalker Web Development Pty Ltd
www.wwwalker.com.au

--- In web-indexing@yahoogroups.com, Margaret Pappas
<margaret.pappas@...> wrote:
>
> Dear Jan~
> I want readers to quickly be able to findthe indexedword on the
page.There is no doubt about it that what I described is time
consuming.However, does it make sense?
> I have looked at the link you provided briefly to get an overview
of software, but I am not sure it will address my specific need (see
above) since I am creatinga traditional book
index/corcordance...HOWEVER, after going through the link thoroughly,
I may need to reassess the whole project if WordEmbed can assist me.
>
> I appreciate you taking the time to help me. If I can clarify the
project for you, I can give you a call. I live in California.
> Sincerely,
> Margaret

#600 From: "Dwight Walker" <dwight@...>
Date: Tue Jun 24, 2008 9:50 am
Subject: Re: Nine articles on Indexing the Internet
roorythmics
Send Email Send Email
 
Can't you see that Google uses basic indexing skills like citation
indexes and those facts don't change just are scaled using massive
computer databases and search technologies like MapReduce to match a
word to a page.

http://labs.google.com/papers/mapreduce.html

Just because internet indexing is old doesn't mean it is useless to
engineer better indexes using the mind not just resolve to let a
massive computer algorithm chew through the data. You can drive the
technology to get a better result.

You seem to have a very dim view of manual indexes. Sure they are
outrun by Google but Google is only good because it uses good
indexing principles not because it has a much faster and scaleable
technology to build on.

Garbage in - garbage out.

If Google has a lousy algorithm based on brute force searching, it
would mean waiting for hours or days to get a result. They use good
mathetmatics and information retrieval to optimise retrieval.

There is an ignorance of information management in this question.
Most people seem to underrate librarians as just glorified clerks.

see

http://www.answers.com/information+science?cat=technology

Unfortunately librarians are often not good at showing the value of
information management and it takes a lot of convincing people who
reckon it is dead obvious how a search works and how best to optimise
the results so quicker more relevant results are found (the ranking).

Now I just use information management in project management as
software produces tons of information which is very well managed
using information principles. Why fight the outsiders? Just use the
power of knowledge and you will benefit despite the Philistines'
attacks.

Engineers always think information is bits and bytes. Librarians
always reckon information is classification. There is still a huge
gulf in understanding between the 2 disciplines. Luckily I did both
fields so am in the middle and can see both angles - a hybrid. It is
a very valuable cusp to be on and gave me tons of leads and creative
ideas when I crossed over into information science from straight
software engineering.

Dwight Walker
WWWalker Web Development Pty Ltd
www.wwwalker.com.au

> On Wed, 28 May 2008 15:10:41 -0000
>   "Martin Carbone" <martycarbone@...> wrote:
> > Indexing the Internet -- a guide to some good articles
> >on the subject.
> >
> > Evidently, it is no longer (2008) fashionable or even
> >acceptable to
> > consider the question of "Internet indexing". Most of
> >the articles we
> > found were written around 1996 - before Google took over
> >the field of
> > internet searching and became the de-facto way to find
> >information.
> >
> > In spite of that, we think some of you might find some
> >helpful
> > information in the nine articles listed at the following
> >URL.
> >
> > http://www.howto-ville.com/indexingtheinternet.html
> >
> >
> > Marty Carbone
> >
>

#601 From: Martin Carbone <martycarbone@...>
Date: Tue Jun 24, 2008 2:07 pm
Subject: misunderstanding about indexes and Google?
martycarbone
Send Email Send Email
 
I, Marty Carbone, originally wrote, in part: "Evidently, it is no longer (2008)
fashionable or
even acceptable to consider the question of "Internet indexing". Most of the
articles we
  found were written around 1996 - before Google took over the field of internet
searching
and became the de-facto way to find information. In spite of that, we think some
of you
might find some helpful information in the nine articles listed at the following
URL.
<< http://www.howto-ville.com/indexingtheinternet.html >> "

------------- Dwight Walker responded, in part ------

"Just because internet indexing is old doesn't mean it is useless to
engineer better indexes using the mind not just resolve to let a
massive computer algorithm chew through the data. You can drive the
technology to get a better result.

You seem to have a very dim view of manual indexes. ... "

-------------- My response to Dwight --------------

I do not have a dim view of manual indexes. My view is exactly the opposite. I
respect
manual indexes (also called directories). My entire website is basically a
manually
compiled index. I think manually compiled indexes and directories can be a much
more
efficient way to find things on the internet than using Google-type search boxes
exclusively.

If the compilers of the directories do a good job, the information they put in
their
directories will be much more on-point than the average single-entry-response
from Google.

Google-type searches have their place of course. It would be impossible to
compile a manual
index without first using Google type search-boxes.

The nine articles I list on my site have lots of good information. Most of it
seems to be
written before Google took over.

I think Google is amazing as a first try at finding things -- but all that
returned information must
be carefully sifted before it is truly useful.

In general, I love librarians


Marty



  Martin Carbone

martycarbone@...
http://www.alphabeticalist.com (send us tips for good websites for this site)
http://www.howto-ville.com/about.html
Tel: 760-603-1910
Fax: 760-603-1930
5123 Don Rodolfo Drive / Carlsbad, CA 93010


[Non-text portions of this message have been removed]

#602 From: Margaret Pappas <margaret.pappas@...>
Date: Tue Jun 24, 2008 9:16 pm
Subject: Re: word index ideas
margaret.pappas
Send Email Send Email
 
Dear Dwight,
I am intrigued by yourrecommendation and will go to the link you provide for
more information re:'inverted indexes'.
I truly appreciate you taking the time and helping me.
Sincerely,
Margaret
P.S. If I needed to get further clarification/assistance, may I call you?



----- Original Message ----
From: Dwight Walker <dwight@...>
To: web-indexing@yahoogroups.com
Sent: Tuesday, June 24, 2008 2:34:30 AM
Subject: [web-indexing] word index ideas


On a computer screen, a search say in Google, would highlight the
word in the page in the results pages in bold or with some colour
background such as yellow.

Doing such a word index on paper (which it sounds like what you want
if you want to name every single word in the page indexed by its
location using some sort of code) means you have to maintain a huge
index of single words.

First create a list of stop words (words that are ignored) such
as 'a', 'the', 'an'. Then index the rest by paragraph at most.

I'd do an inverted index so the customer can just look up a word and
see it in context with the locator as page number or perhaps
paragraph number if they are numbered. Use the eye of the reader to
find the word they want.

See this on what an inverted index is:

http://www..answers. com/topic/ inverted- file?cat= technology

I can only think a scientific or historical work would require such
exactness using manual processes.

Dwight Walker
WWWalker Web Development Pty Ltd
www.wwwalker. com.au

--- In web-indexing@ yahoogroups. com, Margaret Pappas
<margaret.pappas@ ...> wrote:
>
> Dear Jan~
> I want readers to quickly be able to findthe indexedword on the
page.There is no doubt about it that what I described is time
consuming.However, does it make sense?
> I have looked at the link you provided briefly to get an overview
of software, but I am not sure it will address my specific need (see
above) since I am creatinga traditional book
index/corcordance. ..HOWEVER, after going through the link thoroughly,
I may need to reassess the whole project if WordEmbed can assist me.
>
> I appreciate you taking the time to help me. If I can clarify the
project for you, I can give you a call. I live in California.
> Sincerely,
> Margaret






[Non-text portions of this message have been removed]

#603 From: Dbolso@...
Date: Thu Jul 3, 2008 9:22 am
Subject: New Web Indexing Article
dgolso
Send Email Send Email
 
Hi,

The June issue (vol. 26, no. 2, pp. 71-78) of The Indexer (Society of
Indexers, UK) contains and article by Ilana Kingsley about Web site indexes: 
"The
Usability of Academic Library Website Indexes: An Investigation."

The citation has also been posted on the Resources/Articles page of  our SIG
site at <_http://web-indexing.org/articles.htm_
(http://web-indexing.org/articles.htm) >.

Debbie Olson
SIG Webmaster



**************Gas prices getting you down? Search AOL Autos for
fuel-efficient used cars.     
(http://autos.aol.com/used?ncid=aolaut00050000000007)


[Non-text portions of this message have been removed]

#604 From: "Diana Tashjian" <diana@...>
Date: Thu Jul 3, 2008 3:59 pm
Subject: RE: New Web Indexing Article
diana@...
Send Email Send Email
 
Caveat: I'm a newbie, to indexing and to web indexing, but I read the
article with interest. I wanted to put my ideas out there and see what other
people thought.

My conclusion: I think we're stuck with users preferring the search over the
web index method and I don't believe education will change that. I was
trying to figure out why this may be so, and I think it's because the users
like having direct access to the data as opposed to using an index which
necessarily puts someone else's filter on the data. Maybe there's also an
ancillary benefit to doing a search and then maybe re-doing it: the user
learns more about the data on the site...

Anyway, just some ideas...

Diana Tashjian

-----Original Message-----

The June issue (vol. 26, no. 2, pp. 71-78) of The Indexer (Society of
Indexers, UK) contains and article by Ilana Kingsley about Web site indexes:
"The Usability of Academic Library Website Indexes: An Investigation."

The citation has also been posted on the Resources/Articles page of  our SIG
site at <_http://web-indexing.org/articles.htm_
(http://web-indexing.org/articles.htm) >.

Debbie Olson
SIG Webmaster

#605 From: Susan Woita <susanwoita@...>
Date: Thu Jul 3, 2008 5:31 pm
Subject: RE: New Web Indexing Article
susanwoita
Send Email Send Email
 
I agree that users are (and have been for a while) more search-centric and don't
understand the benefit of multiple information retrieval methods. Time swings
the pendulum, though, and for that reason, directories/indexes may experience a
renaissance of sorts down the line, perhaps in a different form.

I use directories for discovery and search for specific information. Often,
people don't know what they don't know and thus are missing critical mass of
data as a result. When I browse a directory, it sparks ideas for other areas of
interest that I didn't think of (and likely never would have occurred to me).

-Susan

===============

Brain Sox (for the Bohemian in you)

http://www.tackytrashwear.com/brainsox.html

--- On Thu, 7/3/08, Diana Tashjian <diana@...> wrote:
From: Diana Tashjian <diana@...>
Subject: RE: [web-indexing] New Web Indexing Article
To: web-indexing@yahoogroups.com
Date: Thursday, July 3, 2008, 8:59 AM











             Caveat: I'm a newbie, to indexing and to web indexing, but I read
the

article with interest. I wanted to put my ideas out there and see what other

people thought.



My conclusion: I think we're stuck with users preferring the search over the

web index method and I don't believe education will change that. I was

trying to figure out why this may be so, and I think it's because the users

like having direct access to the data as opposed to using an index which

necessarily puts someone else's filter on the data. Maybe there's also an

ancillary benefit to doing a search and then maybe re-doing it: the user

learns more about the data on the site...



Anyway, just some ideas...



Diana Tashjian



-----Original Message-----



The June issue (vol. 26, no. 2, pp. 71-78) of The Indexer (Society of

Indexers, UK) contains and article by Ilana Kingsley about Web site indexes:

"The Usability of Academic Library Website Indexes: An Investigation. "



The citation has also been posted on the Resources/Articles page of  our SIG

site at <_http://web-indexing .org/articles. htm_

(http://web-indexing .org/articles. htm) >.



Debbie Olson

SIG Webmaster





























[Non-text portions of this message have been removed]

#606 From: "Veronica" <veronicasemail@...>
Date: Fri Jul 11, 2008 7:33 pm
Subject: New Member Introduction
veronicasemail
Send Email Send Email
 
My name is Veronica Hughes and I am a new member of the Web Indexing
Group, coming here through the American Society for Indexing
(http://www.asindexing.org/site/asiabout.shtml). My background is in
librarianship and I am currently in the process of branching out on my
own as an indexer/information services provider.

A little about my background: I recently graduated with a Master's of
Science in Library Information Science and Certificate of Advanced
Studies in Digital Libraries concentrating in Information
Organization. As a part of my coursework and independent study I
created access to eighteen retrospective issues of a Nevada newspaper,
The Boulder City Daily News. This involved constructing a controlled
vocabulary, creating subject and name indexes (both in print and
database formats), and digitizing the paper for linking to in the
database. I have also done volunteer work for the Nevada State Museum
& Historical Society's library
(http://dmla.clan.lib.nv.us/docs/MUSEUMS/lv/vegas.htm), organizing and
classifying ephemera, and designing and creating databases for
uncataloged books.

And, by the way, although my references are to Nevada, I moved to
Washington State this past May. I
look forward to contributing to the group and thanks in advance for
sharing your knowledge.

Veronica Hughes

#607 From: "Suzi Kaplan" <suzik@...>
Date: Sat Jul 26, 2008 11:09 pm
Subject: RE: New Member Introduction
sewk
Send Email Send Email
 
Welcome Veronica.  Web-indexing is a great SIG with wide open opportunities for
involvement.  Chime in anytime with
ideas, questions or just about anything else web and index related!

Suzi


-----Original Message-----
From: web-indexing@yahoogroups.com [mailto:web-indexing@yahoogroups.com] On
Behalf Of Veronica
Sent: Friday, July 11, 2008 3:33 PM
To: web-indexing@yahoogroups.com
Subject: [web-indexing] New Member Introduction

My name is Veronica Hughes and I am a new member of the Web Indexing
Group, coming here through the American Society for Indexing
(http://www.asindexing.org/site/asiabout.shtml). My background is in
librarianship and I am currently in the process of branching out on my
own as an indexer/information services provider.

A little about my background: I recently graduated with a Master's of
Science in Library Information Science and Certificate of Advanced
Studies in Digital Libraries concentrating in Information
Organization. As a part of my coursework and independent study I
created access to eighteen retrospective issues of a Nevada newspaper,
The Boulder City Daily News. This involved constructing a controlled
vocabulary, creating subject and name indexes (both in print and
database formats), and digitizing the paper for linking to in the
database. I have also done volunteer work for the Nevada State Museum
& Historical Society's library
(http://dmla.clan.lib.nv.us/docs/MUSEUMS/lv/vegas.htm), organizing and
classifying ephemera, and designing and creating databases for
uncataloged books.

And, by the way, although my references are to Nevada, I moved to
Washington State this past May. I
look forward to contributing to the group and thanks in advance for
sharing your knowledge.

Veronica Hughes



------------------------------------

Yahoo! Groups Links

#608 From: Heather Hedden <heather@...>
Date: Tue Aug 26, 2008 3:36 pm
Subject: Web site indexing online workshop
hbhedden
Send Email Send Email
 
Registration is now open for the fall 2008 session of the online
workshop "Creating Web Site Indexes"
It runs four weekly sessions during the month of October,  this is the
only time that it is offered this year, and it will probably not be
offered again until fall 2009.
It is offered by the Continuing Education Program of Simmons College
Graduate School of Library and Information Science.

Workshop summary:
http://www.simmons.edu/gslis/continuinged/workshops/online.shtml#online_a28
Registration information:
http://www.simmons.edu/gslis/continuinged/register.shtml
Outline and readings (pdf):
http://gslisce.simmons.edu/downloads/08FA/Website_Indexes.pdf
Instructor information:
http://gslisce.simmons.edu/instructors/ce-hedden.html
Online course home page sample preview:
http://www.hedden-information.com/view2.php

International participation is welcome. There is no scheduled
participation time; students log in at their convenience.

For registration and payment questions, contact the continuing education
program administrator, Kris Liberman, 617-521-2803, gslisce@...
For workshop content/syllabus questions contact me, Heather Hedden, the
instructor, heather@...

#609 From: Dbolso@...
Date: Fri Sep 5, 2008 10:34 am
Subject: SIG web site update
dgolso
Send Email Send Email
 
Hi,

I'll be updating our SIG site at the end of the month so please send any
links/citations to any interesting articles you've read related to web indexing
or info about any relevant upcoming workshops, classes, or conferences. Any
other comments or suggestions are also welcome!

Thank you,

Debbie Olson
ASI Web SIG webmaster



**************It's only a deal if it's where you want to go. Find your travel
deal here.
(http://information.travel.aol.com/deals?ncid=aoltrv00050000000047)


[Non-text portions of this message have been removed]

#610 From: "Jeannine McAllister " <jmcallister@...>
Date: Fri Sep 5, 2008 3:08 pm
Subject: RE: SIG web site update
jeannine_mca...
Send Email Send Email
 
Debbie-

I stepped down from being the manager of the Behavioral and Social Sciences
SIG.  Please asked around to find out who took my place.

Thank you,

Jeannine

-----Original Message-----
From: web-indexing@yahoogroups.com [mailto:web-indexing@yahoogroups.com] On
Behalf Of Dbolso@...
Sent: Friday, September 05, 2008 8:35 AM
To: web-indexing@yahoogroups.com
Subject: [web-indexing] SIG web site update

Hi,

I'll be updating our SIG site at the end of the month so please send any
links/citations to any interesting articles you've read related to web
indexing
or info about any relevant upcoming workshops, classes, or conferences. Any

other comments or suggestions are also welcome!

Thank you,

Debbie Olson
ASI Web SIG webmaster



**************It's only a deal if it's where you want to go. Find your
travel
deal here.
(http://information.travel.aol.com/deals?ncid=aoltrv00050000000047)


[Non-text portions of this message have been removed]


------------------------------------

Yahoo! Groups Links

#611 From: Dbolso@...
Date: Wed Sep 17, 2008 12:47 pm
Subject: Dublin Core metadata used in header of web pages
dgolso
Send Email Send Email
 
Has anyone had occasion to use a metadata schema (e.g. Dublin Core  elements)
in the head section of a web site?

I understand this provides for better interoperability and harvesting and  is
a step up from the traditional placement of keywords in the head section. For
  an example you can view the source codes at _www.ibm.com_
(http://www.ibm.com) .

Debbie Olson
Freelance Indexer & Librarian
_www.olson-info.com_ (http://www.olson-info.com)



**************Psssst...Have you heard the news? There's a new fashion blog,
plus the latest fall trends and hair styles at StyleList.com.
(http://www.stylelist.com/trends?ncid=aolsty00050000000014)


[Non-text portions of this message have been removed]

#612 From: Glenda Browne <glendabrowne@...>
Date: Sat Sep 20, 2008 7:18 am
Subject: Re: Dublin Core metadata used in header of web pages
index4fun
Send Email Send Email
 
We have used Dublin Core metadata elements at www.nps.org.au, along with
one AGLS element (Australian Government extension of DC) and some plain
elements. One aim was to enable more straightforward harvesting of the
data. One advantage of a schema in general is that you can define fields
that are compulsory, those that can be duplicated and so on. One
disadvantage of using Dublin Core, which I never fully comprehended, is
that Dublin Core subject and description are apparently less likely to
be used by web-wide search engines than plain description and subject
fields. For this reason we duplicated these.

Glenda.

Dbolso@... wrote:
> Has anyone had occasion to use a metadata schema (e.g. Dublin Core  elements)
> in the head section of a web site?
>
> I understand this provides for better interoperability and harvesting and  is
> a step up from the traditional placement of keywords in the head section. For
>  an example you can view the source codes at _www.ibm.com_
> (http://www.ibm.com) .
>
> Debbie Olson
> Freelance Indexer & Librarian
> _www.olson-info.com_ (http://www.olson-info.com)
>
>
>
> **************Psssst...Have you heard the news? There's a new fashion blog,
> plus the latest fall trends and hair styles at StyleList.com.
> (http://www.stylelist.com/trends?ncid=aolsty00050000000014)
>
>
> [Non-text portions of this message have been removed]
>
>
> ------------------------------------
>
> Yahoo! Groups Links
>
>
>
>

#613 From: Dbolso@...
Date: Tue Sep 23, 2008 10:23 am
Subject: Re: Dublin Core metadata used in header of web pages
dgolso
Send Email Send Email
 
This is good to know/keep in mind. Thank you.

Debbie

In a message dated 9/20/2008 3:18:37 A.M. Eastern Daylight Time,
glendabrowne@... writes:

One
disadvantage of using Dublin Core, which I never fully comprehended, is
that Dublin Core subject and description are apparently less likely to
be used by web-wide search engines than plain description and subject
fields. For this reason we duplicated  these.

Glenda.







**************Looking for simple solutions to your real-life financial
challenges?  Check out WalletPop for the latest news and information, tips and
calculators.      (http://www.walletpop.com/?NCID=emlcntuswall00000001)


[Non-text portions of this message have been removed]

Messages 584 - 613 of 739   Oldest  |  < Older  |  Newer >  |  Newest
Add to My Yahoo!      XML What's This?

Copyright 2010 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines NEW - Help