Professional Documents
Culture Documents
The Approach 1: Seminar Report '03 Project Oxygen
The Approach 1: Seminar Report '03 Project Oxygen
CONTENTS
1. THE APPROACH 1
2. SYSTEM TECHNOLOGIES 3
3. USER TECHNOLOGIES 13
4. SOFTWARE TECHNOLOGIES 20
6. PERCEPTUAL TECHNOLOGIES 25
7. CHALLENGES 29
8. CONCLUSION 30
9. REFERENCES 31
ABSTRACT
INTRODUCTION
HISTORY
FEATURES
Human-centered computation
Freely available
Communicate naturally
Boost our productivity
TYPES OF TECHNOLOGIES
System Technologies
User Technologies
SYSTEM TECHNOLOGIES
Users can select hand-held devices, called H21s, appropriate to the tasks
they wish to perform. These devices accept speech and visual input, can
reconfigure themselves to perform a variety of useful functions, and support a
range of communication protocols. Among other things, H21s can serve as
cellular phones, beepers, radios, televisions, geographical positioning systems,
cameras, or personal digital assistants, thereby reducing the number of special-
People use Oxygen to accomplish tasks that are part of their daily lives.
Universally available network connectivity and computational power enable
decentralized Oxygen components to perform these tasks by communicating and
cooperating much as humans do in organizations. Components can be delegated
to find resources, to link them together in useful ways, to monitor their progress,
and to respond to change.
N21 NETWORKS
are comfortable integrating Oxygen into our personal lives. N21s support
multiple communication protocols for low-power local, building-wide, and
campus-wide communication, enabling us to form collaborative regions that
arise, adapt, and collapse as needed.
Flexible,
decentralized networks, called N21s, connect dynamically changing
configurations of self-identifying mobile and stationary devices. N21s integrate
different wireless, terrestrial, and satellite networks into one seamless internet.
Through algorithms, protocols, and middleware, they
COLLABORATIVE REGIONS
SECURITY
ADAPTATION
SOFTWARE ARCHITECTURE
USER TECHNOLOGIES
Spoken language and visual cues, rather than keyboards and mice, define
the main modes of interaction with Oxygen. By integrating these two
technologies, Oxygen can better discern our intentions, for example, by using
vision to augment speech understanding through the recognition of facial
expressions, gestures, lip movements, and gaze. These perceptual technologies
are part of the core of Oxygen, not just afterthoughts or interfaces to separate
applications.
KNOWLEDGE ACCESS
DATA REPRESENTATION
DATA ACQUISITION
The arrival of new data triggers automated services, which, in turn, obtain
further data or trigger other services. Automatic services fetch web pages, extract
text from postscript documents, identify authors and titles in a document,
recognize pairs of similar documents, and create document summaries that can
be displayed as a result of a query. The system allows users to script and add
more services, as they are needed.
Since automated services can go only so far in carrying out these tasks,
the system allows users to provide higher quality annotations on the information
they are using, via text, speech, and other human interaction modalities.
AUTOMATED OBSERVERS
Subsystems watch the queries that users make, the results they dwell
upon, the files they edit, the mail they send and receive, the documents they read,
and the information they save. The system exploits observations of query
behavior by converting query results into objects that can be annotated further.
New observers can be added to exploit additional opportunities. In all cases, the
observations are used to tune the data representation according to usage patterns.
AUTOMATION
Basic automation objects are "black boxes" of low-level actions that can
be managed by higher-level automation processes. The objects can be either
physical or virtual. A basic physical object senses or actuates a physical entity—
it may sense the temperature or whether an office door is open, and it may crank
up the heat or send an image to a display. A basic virtual object collects,
generates, or transforms information—it may extract designated items from
incoming electronic forms, operate on them in a designated manner, and send the
results to a particular device.
A common intelligent interface connects basic physical objects to the network.
The interface consists of a chip containing a microprocessor, a network adapter,
main memory, and non-volatile storage. It makes different sensors, actuators,
and appliances more powerful, provides device status information, reduces the
bandwidth they require, and downloads commands and new low-level software.
Any software object can be a basic virtual object. Electronic forms are
particularly common basic virtual objects, because they serve as convenient
"interfaces" for exchanging information among people and organizations.
COLLABORATION
noticing when information required for those tasks has been developed, and
making conclusions when appropriate.
SOFTWARE TECHNOLOGIES
ABSTRACTION
SPECIFICATIONS
PERCEPTUAL TECHNOLOGIES
SPEECH
SPEECH RECOGNITION
LANGUAGE UNDERSTANDING
LANGUAGE GENERATION
SPEECH SYNTHESIS
VISION
OBJECT RECOGNITION
during operation. The component recognizes objects even if they are new to the
system or move freely in an arbitrary setting against an arbitrary background. As
people do, it adapts to objects, their physical characteristics, and their actions,
thereby learning to improve object-specific performance over time.
CHALLENGES
CONCLUSION
REFERENCES
1. WWW.LCS.MIT.EDU
2. WWW.AI.MIT.EDU
3. WWW.GLOBAL.ACER.COM
4. WWW.DELTACA.COM
5. WWW.HP.COM
6. WWW.NTT.CO.JP
7. WWW.NOKIA.COM
8. WWW.RESEARCH.PHILIPS.COM
9. IEEE Spectrum March 2002