Professional Documents
Culture Documents
The Future of Information Filtering
The Future of Information Filtering
The Future of Information Filtering
� NOTICE
- Orrin Klapp, Overload and Boredom: Essays on the Quality of Life in the Information Society
Never before has so much information been available to the general public. And
never before has the flow of
newly-created information been greater. There has
also never before been a time when so much information has
been available in an
electronic form. We live in an "Information Age," the extent of which is
growing by the
minute. "We have for the first time an economy based on a key
resource that is not only renewable, but self-
generating," notes John Naisbitt
in Megatrends. "Running out is not a problem, but drowning in it is."
Information Anxiety
besser.tsoa.nyu.edu/impact/s94/students/paul/paul_final.html 1/4
12/18/21, 11:45 AM The Future of Information Filtering
problem only
gets worse if we consider the databases of "reference" information, which are
growing at an even
greater rate.
The main goal of Wurman's Information Anxiety is to help readers filter their
conventional information. He
prescribes a "Low-Fat Information Diet," which
should limit the sources a person should look through. He
suggests limiting
the input to one of each kind of information source (daily newspaper, news
magazine, culture
magazine...). Unless a person has a personal reader who will
perform filtering for him or her (like some rich
executives), this kind of
approach is necessary.
Another method would observe the articles that a user decides to read, analyze
their content, and add that
information to a cumulative user profile. For more
accurate results, a program could ask a reader to indicate how
interesting each
article is after reading it. A system could construct a user profile from this
information by
simply add every word found in interesting articles, and
increasing the ranking of words that appear multiple
times.
Commonly-used words can create problems when this kind of approach is taken.
We wouldn't want occurances
of conjunctions to determine if an article is
filtered in or out. The simple solution is to construct a list of
commonly-used words, and make sure they are never put into a user profile. A more advanced method would
besser.tsoa.nyu.edu/impact/s94/students/paul/paul_final.html 2/4
12/18/21, 11:45 AM The Future of Information Filtering
analyze a word within the context of an article, by making a ratio out of the number of times the word occurs in
the article and the total words in the article. This is then compared to average occurance of that word in any text.
If the occurance of the word is higher in our article, the program weights it more significantly.
Once the program has created a reader profile, it can use this to screen
articles in or out. The program can, again,
look directly at the number of
keyword matches, calculate ranked matchings, and compare the matchings with
average occurances of the word in a random text. Programs could also try to
recognize synonymous or related
words and calculate that into the ranking.
Some studies have found that the best result come from performing a number of
different methods and
combining the results. Particular articles tend to be
matched well by particular methods, and using one matching
model usually
filters out too many close matches. The ultimate goal in the field of
artificial intelligence is to
emulate the understanding of ideas with a
computer, and current research is aspiring to results much greater than
those
mentioned here.
A more realistic problem with alternative voices, however, may be more subtle.
If a complete information
filtration system is built to include daily news, it
will have to not only choose stories of interest, but select the
"best"
relation of that story. However that is determined (and whether that is
determined on a story-by-story
basis or not) would drastically affect a
person's (or a public's) perception of the news.
Bibliography
Belkin, Nicholas J., et al, "Information Filtering and Information Retrieval: Two Sides of the Same Coin?",
Communications of the ACM (Volume 35, No 12, December 1992).
Foltz, Peter W., et al, "Personalized Information Delivery: An Analysis of Information Filtering Methods",
Communications of the ACM (Volume 35, No 12, December 1992).
besser.tsoa.nyu.edu/impact/s94/students/paul/paul_final.html 3/4
12/18/21, 11:45 AM The Future of Information Filtering
Klapp, Orrin, Overload and Boredom: Essays on the Quality of Life in the
Information Society (New York:
Greenwood Press, 1986).
Impact Main Menu
besser.tsoa.nyu.edu/impact/s94/students/paul/paul_final.html 4/4