Professional Documents
Culture Documents
TTS Notes
TTS Notes
- Determine location of all punctuation marks in - Handle large range of text issues
input text - Include abbreviation and acronyms
- Decide their significant based on text sentences
and paragraph structure
Abbreviations must expanded to full words
but not always and depend on context
ISSUE OF TEXTS Text Markup Interpretation
1. vNon delimited words 1. Control how a TTS engine render its output
2. Expansion of digit sequences 2. Make TTS output sound intelligent
3. Pronunciation of ordinary word and names (combine with emotion)
need morphological analysis 3. Speech Synthesis MarkUp Language (xml-
based markup standard for speech
LINGUISTIC ANALYSIS synthesis
PROSODIC ANALYSIS
Dutoit (1997), prosody refer to audible changes in pitch, loudness and syllable length.
To other authors, prosody related to speech timing such rythm and speech rate
Prosody operate on longer linguistic unit than phones and hence called study of suprasegmental phenomena.
TONES Intonation
- Significant contrast between words signal by - Rise and fall of voice during speaking
speech differences. May be lexical in Mandarin - To emphasize focus, new info, relationship
Language. May be grammatical in African between words, finality segmentation of
Language sentences into group of syllables
LINEAR PREDICTIVE CODING SYNTHESIS
- Used for encoding, transmitting and decoding a digital signal by reducing redundant information
- Estimate vocal tracts resonances from signal waveform, remove effect from speech signal (inverse filtering)
to get residue / source signal
- LPC Synthesis – inverse process of LPC Analysis
SYNTHESIS METHOD
2. HMM synthesis
DECODING PROCESS
- To choose optimal string of units of a given phonetic string that match desired prosody the best
- For unit selection, use Objective Function
- Quality of unit string dominated by
1. Spectral
2. Pitch discontinuities at unit boundaries