Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

www.ijecs.

in
International Journal Of Engineering And Computer Science ISSN:2319-7242
Volume1 Issue 2 Nov 2012 Page No. 59-62

THE NEW ERA OF BROWSING -VOICE BROWSING


Khushbu1 , Manika Kapoor2 , Ayesha Tafsir3

1
Student, CSE, Dronacharya College of Engineering, Gurgaon, India, khushi.bhardwaj1991@gmail.com
2
Student, CSE, Dronacharya College of Engineering, Gurgaon, India, manikakapoor91@gmail.com
3
Student, CSE, Dronacharya College of Engineering, Gurgaon, India, tafsirayesha@gmail.com

Correspondence Author: Khushbu1Student, CSE, Dronacharya College of Engineering, Gurgaon, India


: khushi.bhardwaj1991@gmail.com

ABSTRACT:
The technology of voice browsing is rapidly evolving these days. It is because the use of
cellphones is increasing at a very high rate, as compared to connected PCs. Listening and
speaking are the natural modes of communication and information gathering. As a result we are
now heading towards a more voice based approach of browsing rather than operating on textual
mode. This paper concentrates on this new technique, voice browsing, which unites speech
recognition and speech synthesis that can be very fruitful in the coming years.

1.INTRODUCTION: speech or voice browser? The software


system procures its information from the
A voice browser can simply be defined as an internet. From a user's outlook, the goal is to
appliance or a gear which helps in provide to the devices which do not have
interpreting a markup language(the markup full-browsers or even the screens to support
language referred here is 'voice') and them, a service which is similar to what the
producing a voice output. It translates a visual web browsers and the related
given voice input into a voice output. It is technologies offer today.
the web browser which provide the users
with an interactive voice user interface. It 2. STANDARDIZATION:
also provides the user with an interface to
the PSTN(Public Switched Telephone Standardization to voice browsing technique
Network). It is obvious from the first word were given by:
of the name that the system deals with pages
The World Wide Web Consortium (W3C)
that specify voice dialogues, just as our
develops interoperable technologies
visual web pages deals with HTML pages.
(specifications, guidelines, software, and
But the question remains-how does a
tools) to lead the Web to its full potential as
software system reciprocates to the user via
a forum for information, commerce,
Khushbu International Journal Of Engineering And Computer Science 1:2 Nov 2012(59-62)

communication, and collectiveunder who are familiar with Web development


standing. W3C which includes: techniques. So, applications are written
using parts of speech interface framework.
1 .Voice Browser Working Group Thus speech applications are written in
VoiceXML and are rendered through a
2. Speech Interface Framework Voice Browser. In much the same way as
2.1 Voice Browser Working Group Web applications are written in html and
run on a Web browser.
It was established on 26 March 1999 and
re-chartered through 31 January 2009. W3C As per estimation, over 85% of Interactive
voice browser working group made the Voice Response (IVR) applications for
speech interface framework possible . This telephones (including mobile) use W3C's
framework allows developers to create VoiceXML standard.
speech enabled applications that are based Voice Browser Working Group are
on Web technologies. coordinating their efforts to make the Web
The framework also provides developer with available on more devices and in more
an environment that will be familiar to those situations.
2.1(a )Aim: telephony, and mixed initiative
The Aim of the W3C Working Group is to conversations. Some of its versions are:
enable users to speak and listen to Web VoiceXML 1.0: designed for
applications by making standard languages creating audio dialogs.
for developing Web-based speech VoiceXML 2.0: uses form
applications. This Working Group interpretation algorithm(FIA).
concentrates on languages for capturing and VoiceXML 2.1: 8 additional
producing speech and managing the elements.
conversation between user and computer Voice XML 3.0: relationship
system, while a related Group, the between semantics and syntax.
Multimodal Interaction Working Group, 2.2(b) Speech Recognition Grammar
works on additional input modes including Specification (SRGS) 1.0: a document
keyboard and mouse, ink and pen, etc. language that can be used by developers to
2.1(b) W3C Recommendations: Its specify the words and patterns of words to
recommendations have been reviewed by be listened for by a speech recognizer or
w3c group Members, by software other grammar processor.
developers, and other interested parties, and
are also endorsed by the Director as Web 2.2(c) Speech Synthesis Markup
Standards. Language (SSML): a markup language for
rendering a combination of prerecorded
2.2 Speech Interface Framework: speech, synthetic speech, and music. Some
These framework includes: of its versions are:
2.2(a) VoiceXML: a language for Speech Synthesis Markup
creating audio dialogs that feature Language (SSML) 1.0
synthesized speech, digitized audio, Speech Synthesis Markup
60

recognition of spoken and DTMF key Language (SSML) 1.1


input, recording of spoken input,
Page
Khushbu International Journal Of Engineering And Computer Science 1:2 Nov 2012(59-62)

2.2(d) Semantic Interpretation (SISR): This is the second level through which we
document format that represents annotations can examine the voice browsing technique.
to grammar rules for extracting the semantic We can browse the websites through voice
results from recognition. which offer voice portals. In order to
maintain customer loyalty these portals offer
Eg: Semantic Interpretation(SISR)1.0 voice browsing of personal content. To have
version a claim in this sector or we can say market,
AOL bought quack.com.
2.2(e) Pronunciation Lexicon
Specification (PLS) a representation of Almost everywhere the voice portal market
phonetic information for use in speech is being expanded, be it Europe, the US or
recognition and synthesis . Asia.
Eg: Pronunciation Lexicon Specification 3.3 The Voice Web
(PLS) 1.0 version
The level three is the voice web. Many
companies have introduced forums for
3. LEVELS OF VOICE BROWSING programmers for setting up voice sites so as
In order to understand it better in its current to enhance the interest in voice browsing
form, voice browsing can be examined and speech recognition. The forums further
under three levels. become voice webs which sprout voice
auction sites and voice based chat rooms.
3.1 voice browsing
4. Future Of Voice Technology
This is the first level in examining the The speech technology is supposed to grow
technique of voice browsing. The simplest rapidly. The voice portal market is going to
way of understanding the voice browser and reach billions in just a few years. It is
the voice web would be to take the web estimated by the kelsey group that voice
itself into consideration. We know that browsing market will reach 6.5 billion
people all over the world visit websites and dollars, while OVUM estimates a world
get visual feedback. Now, the voice web market of 26 billion dollars. Anyone may
contains voice sites where the feedback is guess the actual growth of the industry of
delivered through dialogues. The most basic voice technology due to variations in these
example would be calling up our cellphone figures. It is very difficult to navigate on a
operator portal, where speech recognition WAP to scroll through many lists. Hands-
software provides the caller with a series of free interaction enables us to develop an
options like recharging your account, talking easy communication between the user and
to the customer care executive or listen to the system.
the new offers. The technology of voice
browsing has brought down the cost voice browsing can be used to access three
considerably, since earlier it would cost a kinds of information:
rupee per minute for human operators to talk
to customers while 10paise per minute for (a)Business: information like automated
an automated call. Voice browsing also has telephone ordering services, support desks,
its use in the corporate sector specially in order tracking, airline arrival and departure
61

banks and airlines. information services, cinema and theatre


booking services, home banking services,
Page

3.2 voice browsing the world wide web.


Khushbu International Journal Of Engineering And Computer Science 1:2 Nov 2012(59-62)

etc can be retrieved using voice browsing 5. Conclusion:


very easily.
In order to make technology more familiar
(b)public: voice browser can be used to to the user its access should be made more
access services like local , national and easier. As we know that visual internet
international news alongwith community access experiences various limitations such
information such as weather forecasting, as people who are physically handicapped
traffic conditions, school closure and events. (specially blind users) cannot use keypads or
it can also be used to gather information on touch screens for giving instructions. Above
national and international stock market all these limitations todays generation
information and also business and e- demands to use internet independent of PCs
commerce transactions. and also hands free access to it. For this
VOICE BROWSING is an intelligent idea.
(c) personal use : It is used in accessing This allows user to access web even in
personal information like voice mails,
situations like driving etc where user operate
personal horoscope ,personal newsletter, web just by listening and speaking rather
calendars, address and telephone lists etc.
then typing. Thus at last we conclude that
In future it is expected that voice browsing Voice browsing provides a natural way of
will become visual i.e MULTI MODAL. accessing webs. Now it is up to the
But greatest achievement would be when developers to take up some inventory
voice browsing is integrated with all types measures in order to bring this technology to
of operating system .This success would us in a more colourful way.
surly make voice browsing available to each
and every application.

62
Page

You might also like