Professional Documents
Culture Documents
Glossary
Glossary
Glossary
Many thanks to Marcia Lei Zeng of the School of Library and Information Science at Kent State University,
who reviewed the glossary and provided extremely valuable input.
subject of the query. Relevance depends context of the World Wide Web, the term social bookmarking
on the searcher’s subjective perception usually refers to a program that searches The decentralized practice and method
of the degree to which the document a large index of Web pages generated by by which individuals and groups create,
fulfills the information need, which may an automated Web crawler. See also Web classify, store, discover, and share Web
or may not have been expressed fully or search engine. bookmarks or “favorites” in an online
with precision in the search statement. “social” environment.
semantic interoperability
Measures of the effectiveness of infor-
The ability of different agents, services, social tagging
mation retrieval, such as precision and
and applications to communicate data The decentralized practice and method
recall, depend on the relevance of search
while ensuring accuracy and preserving by which individuals and groups create,
results. (Definition from ODLIS, Online
the meaning of the data (definition based manage, and share terms, names, and so
Dictionary for Library and Information
on Marcia Bates and Mary Niles Maack, on (called tags), to annotate and catego-
Science, http://lu.com/odlis/.)
Encyclopedia of Library and Information rize digital resources in an online “social”
relevance ranking Sciences, 3rd ed. [New York: Marcel environment. A folksonomy is the result
The algorithmic process, a feature of Dekker, forthcoming]). of social tagging. Also referred to as
many search software applications, by collaborative tagging, social classifica-
Semantic Web
which results in a result set are sorted tion, social indexing, mob indexing, folk
An evolving, collaborative effort led
or ranked according to their relevance. categorization. See also folksonomy,
by the W3C whose goal is to provide a
In OPACs, for example, relevance is tagging.
common framework that will allow data
computed based upon the number of
to be shared and re-used across various spamming
occurrences of the search term in the
applications as well as across enterprise Used in reference to meta tags. The abuse
record that is retrieved, and the weight
and community boundaries. It derives of metadata that creators include in the
assigned to the field(s) in which the
from W3C director and inventor of the HTML header area of their Web pages
search term appears. (Definition from
World Wide Web Sir Tim Berners-Lee’s in order to increase the number of visi-
ODLIS, Online Dictionary for Library
vision of the Web as a universal medium tors to a Web site. Keyword spamming
and Information Science, http://lu.com/
for data, information, and knowledge entails repeating keywords multiple times
odlis/.) Google’s PageRank™ is an
exchange. in order to appear at the top of search
example of a relevance ranking algorithm.
engine result listings or listing keywords
server
resource discovery that are irrelevant to the site in order to
An application that supplies resources or
The process of searching for specific attract visitors under false pretenses.
resource manifestations. Often used to
information objects on the Web.
refer to a networked computer that acts as spider
robot a source of data and/or applications used See Web crawler.
See Web crawler. by multiple client computers or devices.
SRU/SRW (Search and Retrieve
See also client.
schema via URL/Search and Retrieve Web
A set of rules for encoding information service provider (OAI Service)
that supports specific communities of nomenclature) Companion protocols for Web search
users. Also called “scheme.” The plural An institution or organization that queries utilizing the CQL Common
forms of the word schema are schemas harvests metadata from data providers Query Language. http://www.loc
and schemata. See also XML schema. and uses the aggregated metadata as a .gov/standards/sru/.
basis for building value-added services.
schema registry surrogate
An authoritative source of names, SGML (Standard Generalized See digital surrogate.
semantics, and syntaxes for one or more Markup Language)
tagging
schemas. International Standards Organization
In the context of the Web, the act of
standard ISO/IEC 8879:1986; a markup
screen scraping associating terms (called tags) with
language first used by the publishing
A technique in which display data an information object (e.g., a Web
industry, for defining, specifying, and
(usually unstructured) is automatically page, an image, a streaming video
creating digital documents that can be
retrieved and extracted, for example, from clip), thus describing the item and
delivered, displayed, linked, and manipu-
a Web page. enabling keyword-based classification
lated in a system-independent manner.
and retrieval. Tags—a form of user-
search engine XML and HTML are derived from SGML.
generated metadata—from communities
A computer program that allows users
of users can be aggregated and analyzed,
to search electronic resources. In the
providing useful information about the host and directory path. For example, on the Web and puts them in an index
collection of objects with which the tags urn:issn:0167-6423 is the URN for or database that Web users can search
have been associated. See also social the journal Science of Computer in a variety of ways. The search results
tagging. Programming. provide links back to the pages matching
the user’s search in their original
taxonomy Visible Web
location.
An orderly classification that explicitly The subset of the World Wide Web that
expresses the relationships, usually hier- is visible to Web browsers and indexable wiki
archical (e.g., genus/species, whole/part, by search engines’ Web crawlers. To be A collaborative Web site that contains
class/instance), between and among the accessible to Web crawlers, the pages pages that any authorized user can edit.
things being classified. must be accessible simply by following Wikis typically retain all former versions
links (i.e., not generated dynamically in of each page, allowing the revision
TCP/IP (Transmission Control
response to user input) and not protected history of a page to be tracked and for
Protocol/ Internet Protocol)
by a password. unwanted revisions to be reversed.
The ISO standardized suite of network
protocols that enables information VRA Core 4.0 Wikipedia
systems to communicate with other infor- An XML schema for describing works A free, collaborative, volunteer-driven
mation systems on the Internet, regard- of art and architecture and their visual Web-based encyclopedia that utilizes wiki
less of their computer platforms. surrogates. http://www.vraweb.org/ software to allow anyone to edit articles.
projects/vracore4/index.html http://en.wikipedia.org/wiki/.
TEI (Text Encoding Initiative)
An international cooperative effort to W3C (World Wide Web Consortium) World Wide Web
develop guidelines for standard encoding The main international standards organi- A vast distributed wide-area client-server
schemes (i.e., the TEI and TEI Lite DTDs) zation for the World Wide Web. architecture for retrieving hypermedia
for literary and linguistic texts. http:// documents over the Internet.
Web 2.0
www.tei-c.org/.
A phrase used loosely by the Web devel- XHTML (Extensible HyperText
URI (Uniform Resource Identifier) opment community to refer to a perceived Markup Language)
A short string that uniquely identifies a “second generation” of Web technologies A reformulation of HTML in XML.
resource such as an HTML document, an and applications. Wikis, folksonomies,
XML (Extensible Markup Language)
image, a downloadable file, or a service. gaming, podcasting, blogging, and so on,
A simple, flexible markup language
URLs and URNs are types of URIs. are all considered Web 2.0 applications.
derived from SGML. Originally designed
URL (Uniform Resource Locator) Web browser for large-scale electronic publishing,
A type of URI consisting of an Internet A software application that enables users XML is now playing an increasingly
address that tells users how and where to view and interact with information and important role in the publication and
to locate a specific file on the World media files on the Web. Internet Explorer, exchange of a wide variety of data on
Wide Web. A URL includes not only the Mozilla Firefox, and Netscape Navigator the Web.
name of a file but also the name of the are examples of Web browsers.
XML schema
host computer, the directory path to get
Web crawler (robot, spider) A machine-readable definition of
to that file, and the protocol needed in
A software program that systematically the structure, elements, and attri-
order to use it (e.g., http://www.getty.edu/
traverses the Web, either for the purpose butes allowed in a valid instance of
research/conducting_research/standards/
of generating a searchable index of Web a conforming XML document. XML
intrometadata/intro.html specifies that the
content or to gather statistics. schemas are expressed using the
hypertext transfer protocol “http” should
XML Schema Definition language, a
be used to retrieve the document intro. Web server
W3C standard. http://www
html from the host www.getty.edu in the A computer that is able to respond to
.w3.org/TR/xmlschema-0/.
directory research/conducting_research/ HTTP requests from clients known as
standards/intrometadata. Web browsers and return the appropriate XMP (Extensible Metadata
HTTP responses—most typically serving Platform)
URN (Uniform Resource Name)
an HTML page. A markup language, based on RDF, for
A type of URI consisting of a unique,
recording and embedding metadata
location-independent identifier of a Web search engine/Internet
about digital assets. Developed by Adobe
file available on the Internet. The file search engine
Systems and supported across the
remains accessible by its URN regard- A software program that collects data
company’s range of software products
less of changes that might occur in its taken from the content of files available
http://www.getty.edu/research/conducting_research/standards/intrometadata/