Professional Documents
Culture Documents
Speech Signal Content
Speech Signal Content
Speech signals are sound signals, defined as pressure variations travelling through the air. These
variations in pressure can be described as waves (sound waves), these particular waves have various
form depends on the how to produce the sound, these variations of sounds have their own meaning
depend on what sound of the speaker produce trough their speech organ, and this meaningful sound
that catches by our hearing, we known it as spoken language
A speech signal carries linguistic information for the sharing of information and ideas.
Speech signals also contain very rich information that can be obtained including:
words (spoken words), speaker identity, accent, style of speech, expression, emotion, gender, age etc
Created at the Vocal cords, Travels through the Vocal tract, and Produced at speakers mouth and gets
to the listeners ear as a pressure wave.
The vocal tract is the cavity in human bodies where the sound produced at the sound source
The vocal tract
The vocal tract, including the larynx, pharynx and oral cavities, have a great effect on the timbre of the
sound.
Vocal consonant, voiced unvoiced
The resonance of the vocal tract, especially the two lowest resonances, known as the formants F1 and
F2 (see figure below).
Resonance is when a harmonic from the vocal folds lines up with the pitch of the air (formant) in the
vocal tract.
Resonansi terjadi ketika suara menggetarkan objek lainnya dengan frekuensi yang sama
the resonance structure can be easily examined by drawing an "envelope" above the spectrum, that is,
to draw a smooth line which goes just above the spectrum, as seen on the figure on the right. We thus
obtain the spectral envelope, which characterizes the macro-shape of the spectrum of a speech signal,
and which is often used to model speech signals.
Signal amplitude or intensity over time is another important characteristic and in its most crude form
can be the difference between speech and silence.
Furthermore, there are phonemes characterized by their temporal structure; in particular, stop and
plosive-consonants, where airflow is stopped and subsequently released (e.g. /p/, /t/ and /k/). While the
stop-part is not prominently audible, it is the contrast of a silence before a burst of energy which
characterizes these consonants.
Kuat atau lemahnya suatu sumber bunyi bergantung pada simpangan getar atau amplitudo. Makin besar
amplitudo, maka akan semakin keras bunyi yang dihasilkan.
Besides their acoustic characteristics, speech signals can be characterized in terms of their linguistic
structure. Linguistic structure refers to the recurring regularities in spoken language as described by
linguistic theories, such as what are the basic building blocks of speech and how they organized.
Linguistic descriptions provide a means for systematic interpretation, conceptualization, and
communication of speech-related phenomena.