Professional Documents
Culture Documents
Text To Speech: A Simple Tutorial: D.Sasirekha, E.Chandra
Text To Speech: A Simple Tutorial: D.Sasirekha, E.Chandra
Text To Speech: A Simple Tutorial: D.Sasirekha, E.Chandra
Published By:
Retrieval Number: A0435022112/2012©BEIESP Blue Eyes Intelligence Engineering
275 & Sciences Publication
Text To Speech: A Simple Tutorial
Mrs. - Misses
St. Joseph St. - Saint Joseph Street
Acoustic Prosodic Modeling
Processing & Intonation (iii). Acronym converter: Acronyms are replaced by single
letter components.
S. I. - S I
Speech as output
(iv). Word segmentation:
Sentences are a group of word segments. Special delimiter
Fig 1: System Overview of TTS
to separate segments.
(i.e. „||‟).Segments can be an acronym, a single word or a
A. Text Analysis and Detection numeral.
The Text Analysis part is preprocessing part which analyse Examples of acronyms:
the input text and organize into manageable list of words. It
consists of numbers, abbreviations, acronyms and idiomatics “NATO” - “nayto”
and transforms them into full text when needed. An “HIV” - “aitch eye ve”
important problem is encountered as soon as the character “Henry IV” - “Henry the fourth”
level : that of punctuation ambiguity (sentence end “Chapter IV”- “Chapter four”
detection). It can be solved, to some extent, with elementary Punctuation marks are also identified.
regular grammars
Linearization is the process of giving a hyper text link to
Text detection is localize [8] the text areas from any kind of give the user a quick overview of the page. Then the TTS
printed documents. Most of the previous researches were system will help to read out the linearized data.This feature
concentrated on extracting text from video. We aim at helps in selecting the text and reading and also to list the
developing a technique that work for all kind of documents links in the hyper text.
like newspapers, books etc
C. Phonetic Analysis
B. Text Normalization and Linearization
Phonetic Analysis converts the orthographical symbols
Text Normalization is the transformation of text to into phonological ones using a phonetic alphabet. Basically
pronounceable form. Text normalization is often performed known as “grapheme-to-phoneme” conversion.
before text is processed in some way, such as generating
Phone is a sound that has definite shape as a sound wave.
synthesized speech or automated language translation. The
Phone is the smallest sound unit. A collection of phones that
main objective of this process is to identify punctuation
constitute minimal distinctive phonetic units are called
marks and pauses between words. Usually the text
Phoneme. Number of phonemes is relatively smaller than the
normalization process is done for converting all letters of graphemes, only 44.
lowercase or upper case, to remove punctuations, accent
marks , stopwords or “too common words “and other
diacritics from letters .
Phoneme Set (English)
Text normalization is useful for example for comparing
two sequences of characters which represented differently but Vowels (19) : /a/, /ae/, /air/, /ar/, /e/, /ee/, /i/, /ie/, /o/,
mean the same. “Don‟t” vs “Do not”, “I‟m” vs “I am”, /oe/, /oi/, /oo/,
“Can‟t” vs “cannot” are some of the examples. /ow/, /or/, /u/,
The main 4 phases of Text Normalization are
Published By:
Retrieval Number: A0435022112/2012©BEIESP Blue Eyes Intelligence Engineering
276 & Sciences Publication
International Journal of Soft Computing and Engineering (IJSCE)
ISSN: 2231-2307, Volume-2 Issue-1, March 2012
Published By:
Retrieval Number: A0435022112/2012©BEIESP Blue Eyes Intelligence Engineering
277 & Sciences Publication
Text To Speech: A Simple Tutorial
AUTHOR PROFILE
D.Sasirekha , completed her BSc (CS)-2003 in
Avinashlingam University for Women, coimbatore and
M.Sc (CS)-2005 in Annamalai University, Currently
doing Ph.D (PT) (CS) in Karpagam University,
Coimbatore and working as a staff in Avinashilingam
University for Women, Coimbatore,India.
Published By:
Retrieval Number: A0435022112/2012©BEIESP Blue Eyes Intelligence Engineering
278 & Sciences Publication