Professional Documents
Culture Documents
Thesis - M3
Thesis - M3
*******name*********
IT*******
Specializing in *********
Sri Lanka
month year
topic
******Name*****
IT*****
Dissertation submitted in partial fulfillment of the requirements for the Bachelor of Science
Special (Honors) in Information Technology
Sri Lanka
month year
DECLARATION
I declare that this is my own work, and this proposal does not incorporate without
acknowledgment any material previously submitted for a degree or diploma in any other
university or institute of higher learning, and to the best of our knowledge and belief, it does
not contain any material previously published or written by another person except where the
acknowledgment is made in the text.
Signature: Date:
i
ABSTRACT
This component seeks to improve the children's overall communication abilities and targets
children with impairments who have difficulty speaking vocally. The target audience for this
component includes children with impairments who fall into this category. The kid will first
hear an audio recording of how to properly pronounce the name of the item, and then they
will watch a demonstration of how to spell the word using an image of the item. This will
take place on its own. Students will have the ability to practice pronouncing words in a step-
by-step manner on a platform that our group worked on constructing. At the same time, other
users will be able to determine whether or not they are correctly speaking the phrase. After
that, a specialized direct voice recognition software program that is employed to detect the
students' pronunciation of the word is utilized in order to record the students' pronunciation
of the word that is being stated by the application. The algorithm will first perform an audio
check to ensure that the pronunciation is accurate before moving on. If a student earns a
grade that is lower than fifty percent, the system will suggest a few simple things to say to
them to help them feel better. These things can be said at any time during the grading
process. One of the most important aspects of teaching is paying attention to the student's
vocal reaction and helping the learner improve their vocabulary so that they can express their
thoughts more effectively.
ii
ACKNOWLEDGEMENT
iii
TABLE OF CONTENT
DECLARATION..........................................................................................................................i
ABSTRACT................................................................................................................................ii
ACKNOWLEDGEMENT............................................................................................................iii
TABLE OF CONTENT................................................................................................................iv
LIST OF TABLES.......................................................................................................................vi
LIST OF FIGURES.....................................................................................................................vi
LIST OF ABBREVIATIONS........................................................................................................vii
LIST OF APPENDICES..............................................................................................................vii
1 INTRODUCTION...............................................................................................................1
1.1 BACKGROUND & LITERATURE REVIEW....................................................................7
1.2 RESEARCH GAP......................................................................................................13
1.3 RESEARCH PROBLEM.............................................................................................14
1.4 RESEARCH OBJECTIVES..........................................................................................15
1.4.1 MAIN OBJECTIVE...........................................................................................15
1.4.2 SPECIFIC OBJECTIVES.....................................................................................15
2 METHODOLOGY............................................................................................................16
1.1. System Architecture..............................................................................................16
2.1 Development Process............................................................................................17
2.1.1 Tools and technologies..................................................................................18
2.2 Component............................................................................................................19
2.3 COMMERCIALIZATION ASPECTS OF THE PRODUCT...............................................20
2.4 TESTING & IMPLEMENTATION..............................................................................21
2.4.1 Testing...........................................................................................................21
3 RESULTS & DISCUSSION................................................................................................24
3.1 RESULTS.................................................................................................................24
3.2 RESEARCH FINDINGS.............................................................................................25
3.3 DISCUSSION...........................................................................................................26
4 SUMMARY OF EACH STUDENT’S CONTRIBUTION.........................................................27
5 CONCLUSION.................................................................................................................28
6 REFERENCES..................................................................................................................29
iv
7 APPENDICES..................................................................................................................30
v
LIST OF TABLES
Table 2.1 TEST CASES........................................................................................................12
LIST OF FIGURES
Figure 1.1 VOKI APPLICATION........................................................................................10
Figure 1.2 FOTOBABBLE LOGO.......................................................................................11
Figure 1.3 AV Phonetics APPLICATION............................................................................12
Figure 2.1 Component Diagram............................................................................................17
vi
LIST OF ABBREVIATIONS
Abbreviations Description
LIST OF APPENDICES
vii
1 INTRODUCTION
Even while your eyes move quickly over the words, your brain seems to be able to
figure out what they mean nearly immediately. Writing gives us the ability to
generate and interpret words even when we do not have access to sound, which is an
important development considering that sound may have been the original mode of
communication. This method of assisting pupils in their academic pursuits has been
utilized for a significant amount of time by educators all around the world. The role
that visuals play in assisting people in remembering words hasn't received a lot of
attention from researchers in the scientific community, for the most part. During the
encoding process, the development of mental images occurs when visual information
is paired with verbal information. The significance of this image is communicated
through the use of both visual and verbal clues. The codes are processed and stored
in the brain in a manner that is dependent on the particular region of the brain that is
responsible for doing so. The user has the option of using the image in addition to the
word's translation in their native language while they are fetching a word. This
choice is available to them. Even if the written code is misplaced, it is still possible
to obtain the message in the form of a visual representation. There is a possibility
that the Dual Code Theory has a neurological basis. The writers came to the
conclusion that learning words just through reading and listening was not as
beneficial as learning them through images. It has been discovered that different
regions of the brain are responsible for the processing and storage of words and
images. As a consequence of this, concepts can be thought of as huge networks of
interconnected neurons in the brain. These networks have the capacity to maintain
and bolster the information that they possess collectively. [02]
When compared to other primates, human beings are far superior in terms of their
ability to pick up new languages. By the time they are in their early 20s, people who
speak their native language have a vocabulary that contains at least 20,000 basic
terms and the morphologically complicated variants of those words; however, other
estimates suggest that this number is actually much higher. The process of fast-
mapping is an illustration of how young infants are able to associate new words and
1
ideas within a short amount of time, which gives the appearance that they are
effortlessly and quickly learning new words. Due to the undeniable superiority of
young humans as word learners, researchers have been inspired to investigate the
process of word acquisition in newborns and children separately and to approach the
mechanics of word learning in a way that is distinct from how they approach the
acquisition of other components of mental computation.
Students take part in a variety of oral language activities designed to improve their
conceptual and vocabulary knowledge throughout the course of the school day. In
addition, students listen to stories and nonfiction materials being read aloud during
this time. Both all-inclusive reading programs and more specialist phonics and word-
recognition training programs include components that teach spelling and word
recognition, respectively. Many teachers instruct their pupils in phonics and word
recognition by utilizing a basal reading program that is supplied by the school
system. This is typically a curriculum that includes of materials that are appropriate
for the student's grade level, such as a guide for teachers and reading materials for
students, in addition to supplementary resources. When teaching children phonics
and word-recognition strategies, it is important to provide them with opportunities to
become comfortable with a variety of reading-related skills and concepts. These
reading-related skills and concepts include, but are not limited to, alphabetic
knowledge, phonemic awareness, sound-symbol relations, word-identification
strategies, spelling and writing connections, related reading practice, and reading
fluency. [03]
Users with and without disabilities can benefit from a variety of applications that are
available in voice recognition and speech-to-text software. Speech-to-text software
has been used to assist those who struggle with writing to increase their writing
production and to enable individuals who have physical limitations with an
alternative means of accessing a computer. Other applications include the use of
speech recognition for the study of other languages, voice-activated goods for the
blind, and many other applications in mainstream technology. Many well-known
applications in the customer service business have been pushed to the forefront of
innovation as a direct result of recent technological advancements. We have all made
2
use of voice recognition technologies in our day-to-day lives, and the majority of the
time, we have done so without even realizing it. A few examples of this include
voice-activated dialing on our mobile phones, automated phone menus and
directories, and integrated voice commands on smartphones. Voice recognition
software is used on a daily basis in the legal and medical fields by experts to
transcribe essential information and dictation notes[12]. Newer applications of the
technology include those in the military, navigation systems, speech recognition in
automobiles (such as Ford SYNC), "smart" homes designed with voice command
devices, and video games such as End War, which allow the player to give orders to
their troops using only their voice.
Students who have disabilities still face difficulties while using voice recognition
technology in its current form, despite the significant progress that has been made in
the field over the previous 20 years. In the beginning, students need to get used to
using speech-to-text, just like they would any other new piece of technological
equipment. This includes teaching the software to recognize their voices, becoming
familiar with a new mode of writing, comprehending the differences between
speaking and writing, and making corrections to errors that are found within the text.
It's possible that this will make the writing process much more frustrating for
students who already struggle with reading and writing, students who have learning
challenges, or students who are very young. Students need to become accustomed to
speaking more slowly so that the technology can more accurately record their
thoughts. Even though the software has improved, speech-to-text technologies are
not always capable of identifying the voices of young children. Students need to be
aware of the differences between speaking in conversation and speaking to write
since these two activities demand different skill sets. Speaking to write is an activity
that requires students to speak to write. This may be difficult for early authors
because they have not yet learned to differentiate between the two. When younger
kids use speech recognition technology, it may be more challenging for them to
begin differentiating between speaking and writing. Therefore, it is absolutely
necessary to combine the use of speech recognition technology with teaching on
writing skills, including brainstorming, drafting, and organizing[12].
3
The need for error correction and monitoring of words that were incorrectly detected
is an additional essential component that is included in the utilization of voice
recognition algorithms. Students need to be vigilant in looking for errors that the
program does not recognize because newer programs do not generate spelling errors
and improve when users correct words that they have incorrectly recognized. Even
though this step might be challenging for those with reading difficulties, an
application that is able to read text aloud to the user can be of great assistance to
them when it comes to editing and modifying their work.
4
handwriting or the requirement to transcribe thoughts while brainstorming, because
students are often able to write more quickly with speech recognition software. When
writers with learning difficulties are unclear about the right spelling of a term, they
will frequently skip over it. This can result in pieces of writing that are either too
brief, do not contain the necessary components, or do not accurately reflect the
student's genuine ability. Speech recognition and word processors have the ability to
alleviate some of these worries by enabling students to get their ideas down on paper
without having to worry about these or any other aspects of technical writing.
5
recognition capabilities have recently been added to a number of well-known foreign
language software applications specifically for the purpose of fulfilling this
requirement.
Ability gains in reading and writing at the fundamental level - In the domains
of reading and writing, voice recognition technologies have been demonstrated to be
useful as a form of remediation for kids who have learning difficulties, according to
research. Students are able to get valuable insight into crucial aspects of phonemic
awareness, such as sound-symbol correspondence, when the teacher let the students
to see the words on the screen as they dictate. The speech-to-text tool gives pupils a
firsthand demonstration of the relationship between the way a word looks and how it
sounds as they talk and watch their words appear on the screen as they do so. It is
believed that the effectiveness of speech recognition in resolving reading and
spelling deficiencies is due, at least in part, to the fact that the text can be presented
in two different ways: visually and audibly.
6
1.1 BACKGROUND & LITERATURE REVIEW
The group of researchers at the Georgetown University Medical Center scanned the
twelve individuals' brains using functional MRI. The visual word form area
(VWFA), which is situated on the surface of the brain behind the left ear, was the
primary focus of the investigation by the researchers. The fusiform face area is the
area that corresponds to the right hemisphere of the VWFA. When it comes to young
children and people who are illiterate, both the VWFA and the fusiform face area
respond to faces. When reading, word recognition takes place in the VWFA area of
the brain. The fluctuation of the audio signal over time is analogous to the changes
that occur in the level of activity in the brain over time. Electrocorticograms (ECoG)
were recorded from the brain activity of sixteen individuals by Lorenzo Magrassi, a
neurosurgeon at the University of Pavia in Italy, and his team. The researchers were
able to record people's brain activity directly from the surface of Broca's region
regardless of whether or not they read the text out loud .[01]
Children who have difficulties communicating through speech can benefit from the
development of fundamental language skills as well as other means of expression
and communication by using these applications, which are enjoyable, educational,
and simple to access. It is possible to communicate in a variety of ways thanks to the
availability of tools such as text-to-speech converters, visual communication
platforms, news websites designed specifically for people with special needs and
learning issues, notetaking and writing applications, and text-to-speech tools. This
article contains additional tactics that can be used to empower students who have
difficulties with their speech.
Words that appear often throughout written material are referred to be high-
frequency words. The top 100 most common words make up around half of the
information we take in, and the top 25 most common terms account for the remaining
one third (Fry & Kress, 2012). A child who is able to recognize the 100 most
common keywords has the foundation necessary to read the majority of a typical
work. Because high-frequency words are so common, beginning readers should learn
to recognize them. In many different subjects, students are shown graphic
7
representations of these terms that are used frequently. Youngsters can be helped to
restart this reading strategy, which is how experienced readers comprehend written
words that are known to them (Ehri, 2005). This can be accomplished by explicitly
introducing high-frequency sight words to the children .[04]
App for English articulation and pronouncing is the name of a mobile application
that can help improve your English pronunciation and articulation. It is possible that
it will teach letter sounds as well as word pronunciation. There are approximately a
thousand words to target that have 22 letter sounds. The software allows kids to
engage with letter sounds in three ways: word games, sentences, and stories.
Children will learn correct pronunciation, letter and word recognition, and
fundamental sentence structure while participating in these activities. Students have
the opportunity to record and listen to their own voices as they pronounce letters and
words coupled with narration when using the Articulation Station. As a consequence
of this, students develop their self-awareness and may become aware of errors in
their own pronunciation. There are two-word activities, and each activity has
multiple levels. You have the option of concentrating on the initial, medial, or final
sound of the letter. It is possible that it will teach letter sounds as well as word
pronunciation. There are approximately a thousand words to target that have 22 letter
sounds. The software allows kids to engage with letter sounds in three ways: word
games, sentences, and stories. Children will learn correct pronunciation, letter and
word recognition, and fundamental sentence structure while participating in these
activities. Students have the opportunity to record and listen to their own voices as
they pronounce letters and words coupled with narration when using the Articulation
Station. As a consequence of this, students develop their self-awareness and may
become aware of errors in their own pronunciation. There are exercises involving
two words, such as flashcards or matching, and each activity has a variety of
difficulty levels. You have the option of concentrating on the initial, the middle, or
the last letter sound .[05]
8
but, when the situation calls for it, images can also be utilised to convey abstract
ideas such as feelings. Visual dictionaries, in contrast to traditional dictionaries, are
not sorted alphabetically but rather thematically, and an image is associated with
each theme in the dictionary. Traditional dictionaries also do not contain visual
dictionaries. Visual dictionaries are useful resources for a variety of people,
including students, teachers, translators, and even parents. Visual dictionaries are
helpful resources that can be applied to a variety of contexts. The use of these
dictionaries is beneficial to people of all ability levels, so everyone can take
advantage of them .[06]
The fact that children need to be taught to correlate the word with the meaning
represented by the picture it includes could be one of the reasons for this. This is a
step that was not explicitly done in many of the research that were conducted on
picture-embedded words. Learners have been assisted in making an association
between the word and the concept represented by the picture through the use of a
range of strategies in previous research in related fields. One method entails
explaining to children in no uncertain terms that a picture that is shown alongside a
word is a representation of that word. Lang and Solman discovered that this method
led to increased learning in comparison to situations in which the association was not
made explicitly clear[07]. The second strategy is called picture-to-text matching, and
it involves asking children to actively match words with the pictures that represent
them. According to Fossett and Mirenda, this method is more effective than just
teaching using pictures next to their corresponding words[08].
9
and utilized individual sessions. They found that some participants learned terms
more rapidly using the picture-matching approach as opposed to the text-alone
strategy; nevertheless, neither the picture-matching method nor the picture-fading
method resulted in greater overall mastery than text alone did in the final posttest.
The fourth strategy is called "word morphing," and it involves using a sequence of
images or animations in which a picture transforms into a word. This provides a
gradual transition that is meant to assist the youngster in developing a link between
the two concepts. Miller and Miller discovered that learning was enhanced when
animations transitioned from a picture to a word rather than when the picture was
placed next to the word[11]. Later on, Sheehy found that a technique that was
comparable to this one led to greater learning than a teaching method that focused
only on words.
Voki is also available either online or as an app for Apple or Android devices, Voki
gives users the ability to select a "talking head" character, modify its appearance and
clothing, and then add narration by either recording their own voices or typing a
script to be read by an artificial voice. Voki can be accessed either through the
website or downloaded from the app store. Voki is aimed primarily for educators
who wish to produce brief instructional videos, but students are just as capable of
making effective use of the platform to convey either information or narratives to
their peers. A shorter number of character options are available in the free version,
whereas a greater variety of options and a longer recording period are available in the
commercial
editions[13].
12
Webster Learner's Dictionary and the Longman Dictionary of Contemporary English
Online, as well as any dictionary app or stand-alone electronic dictionary, can
provide a practical model of the correct pronunciation of individual words. These are
very helpful for students who want to independently verify how correctly they are
pronouncing a new word.
13
14
1.2 RESEARCH GAP
Features
Systems View Pronounce View Pronounce Students are Analysis of
and selected and selected allowed to pronunciation
select picture elect word repronounce difference
pictures name words words and
pictures
Proposed
System
Learn With
Rufus
Montessori
Numbers
Proloquo2Go
Pumpic App
Video
Scheduler
The
Sounding
Out Machine
Super Why
15
1.3 RESEARCH PROBLEM
16
1.4 RESEARCH OBJECTIVES
develop the system to analyze the voice response of the student and assist in
vocabulary improvement
17
2 METHODOLOGY
18
2.2 Development Process
SciPy
SciPy is an open-source library written in Python that is utilized for the purpose of
resolving mathematical, scientific, engineering, and technical issues. A large variety
of high-level Python commands are made available to users, enabling them to alter
data as well as visualize it. NumPy is an extension of Python that serves as the
foundation for SciPy. It's possible to say "Sigh Pi" when you're referring to SciPy.
Numpy is an extension for Python that serves as the foundation for SciPy, which is a
collection of mathematical methods and convenience functions. It does this by
presenting the user with high-level commands and classes that can be used to
manipulate and visualize data. This provides a huge boost to the capabilities of the
interactive Python session. SciPy is a scientific Python open source library that is
published under the BSD licensed umbrella and is used to carry out mathematical,
scientific, and engineering computations. The name is pronounced as "Sigh Pi."
19
Figure 2.2 SCIPY
20
NumPy
NumPy is the essential package for carrying out computational tasks in the scientific
community using Python. A multidimensional array object, various derived objects
(such as masked arrays and matrices), and an assortment of routines for fast
operations on arrays are all provided by this Python library. These operations include
mathematical, logical, shape manipulation, sorting, selecting, I/O, discrete Fourier
transforms, basic linear algebra, basic statistical operations, random simulation, and a
great deal more. The ndarray object is the fundamental building block of the NumPy
library. This wraps n-dimensional arrays of similar data types, and for optimal
efficiency, many operations are carried out in compiled code.
The term "vectorization" refers to the absence of any explicit looping, indexing, or
other similar operations in the code. These operations are, of fact, still taking place,
but they are doing so "behind the scenes" in optimized, pre-compiled C code. NumPy
provides complete support for an object-oriented methodology, beginning once more
with the ndarray data structure. As an illustration, ndarray is a class that includes a
variety of different methods and characteristics. As a result of the fact that many of
its methods are mirrored by functions in the NumPy namespace's outermost
namespace, the programmer is free to write code in whichever paradigm best suits
their needs. Because of this versatility, the NumPy array dialect and the NumPy
ndarray class have become the standard language for the interchange of multi-
dimensional data in Python.
21
manipulating those arrays. Due to the fact that it enables users to execute mathematical and
logical operations on arrays, it is one of the Python packages that is utilized the most for
scientific computing. NumPy is a scripting language that is written in Python. In 2005,
Travis Oliphant developed NumPy by extensively altering the predecessor program Numeric
and adding features from the rival program Numarray. The forerunner to NumPy was a
program called Numeric, which was created in 1995 by Jim Hugunin with the assistance of a
number of other programmers. A developer for NumPy named Travis Oliphant was
successful in uniting the community in support of a single array package; thus, he moved the
functionality of Numarray to Numeric and released NumPy 1.0 in the year 2006. Now that
we have an understanding of What is NumPy in Python, let's look over its history. Let's
move on to the next topic and talk about why we use it. Python provides us with lists that can
function as arrays, however the processing of these lists is rather slow. NumPy's array object
should be up to 50 times faster than Python's standard list object, according to the project's
stated goal. A diverse set of mathematical procedures based on arrays are all within the
scope of its applicability. It extends Python with advanced analytical structures that ensure
fast computations with arrays and matrices, as well as a large library of high-level
mathematical functions that work with these arrays and matrices. Additionally, it adds fast
computations with arrays and matrices to the capabilities of the Python programming
language. Unlike lists, NumPy arrays are stored in memory in a single continuous area,
making it easier for
programs to swiftly
access and change the
data contained
within them.
22
23
Matplotlib
Matplotlib is a data visualization and graphical plotting toolkit that can be used with
Python and its numerical extension NumPy. It is compatible with multiple platforms.
As such, it provides a feasible open source alternative to the software package
MATLAB. Developers also have the option of utilizing the Application
Programming Interfaces (APIs) provided by matplotlib in order to incorporate plots
within GUI programs.
Because of the way a Python matplotlib script is organized, the generation of a visual
data plot can typically be accomplished with only a few lines of code in the vast
majority of cases.
Two application programming interfaces, such as and, are covered by the matplotlib
scripting layer. The matplotlib library sits atop the pyplot application programming
interface (API), which is a hierarchy of Python code objects. pyplot in addition to an
Object-Oriented Application Programming Interface (OO API) collection of objects
that may be assembled with more flexibility than pyplot. This application
programming interface (API) gives users access to the backend layers of Matplotlib.
The pyplot API offers a user-friendly stateful interface that is modeled after
MATLAB. In point of fact, matplotlib was first conceived of and developed as an
open-source alternative to MATLAB. It is considered to be more difficult to use,
despite the fact that the OO API and its interface offer greater levels of customization
and power than pyplot.
Uncompiled source files for Matplotlib are also available for download. In order to
compile from source, it is necessary for your local system to have the correct
compiler for your operating system, as well as all dependencies, setup scripts,
configuration files, and patches that are necessary. Because of this, the installation
can end up being really difficult. You may also think about utilizing the ActiveState
Platform, which will automatically generate matplotlib from its source code and
package it for your operating system. A user interface (UI) and menu structure are
built whenever matplotlib is used to make a plot. The user interface allows for
customization of the plot, as well as the ability to pan, zoom, and toggle different
24
sections. Data manipulation and analysis are the primary purposes for which
matplotlib makes use of the Pandas package. Dataframe is the name of the object that
represents a 2D data table that is stored in memory by Pandas. pandas, on the other
hand, is not a mandatory dependency of matplotlib like numpy is.
25
2.3 Component
Identify and develop the pronunciation components of words and images by having
students pronounce them and then analyzing the differences between actual
pronunciation and the pronunciation of the students Additionally, provide a word and
image selection window, as well as a pronunciation component, to aid in the
knowledge advancement of kids with special needs. As part of this component,
tailored direct speech recognition is developed to detect students' pronunciation of
words pronounced by the program, as well as a system to analyze students' voice
responses and assist them in improving their vocabulary.
26
2.4 COMMERCIALIZATION ASPECTS OF THE PRODUCT
It may be necessary for individuals who have trouble reading or who have visual
impairments to modify the display of text in order to make it simpler to read. Their
capacity to change the appearance of the text is restricted when it is presented as a
picture of the text. When you write content with the use of an online content editor,
the formatting will be done for you automatically. If you believe that you require text
that deviates from the style, you should be able to adjust the style for that text using
the formatting tools that are supplied by online content editors. You should only ever
supply a picture of text rather than actual text in extremely unusual instances, such as
when a logo is being used. Screen-reader users will be able to view the text even if
you do this, but you will be required to give the exact same text as the image's text
alternative.
27
2.5 TESTING & IMPLEMENTATION
2.5.1 Testing
Test Test case Test input Values Test Expected Actual Test
case Name Procedure Output Result Resul
ID t
T_1
T_2
28
T_3
T_4
29
T_5
T_6
30
3 RESULTS & DISCUSSION
3.1 RESULTS
31
3.2 RESEARCH FINDINGS
32
3.3 DISCUSSION
The proposed system's architecture dictates that it be hosted on a server so that it may
always be accessed by end users. This server not only provides hosting for the hosted
system but also a domain name and database space. If users try to access the system
using that domain name, they will be successful. Keep an eye out for details
regarding any actions taken within the system that are recorded here. All the parents
need is the domain name to connect to the system from anywhere in the world. If a
person logs in and looks at their children's activities, the data will be stored in the
database for future use. Since this is a novel concept for a system, it would be
excellent if parents and students with unique needs served as beta testers.
For children who have Learning Disabilities as well as children who have other kinds
of disabilities, there is still a need for additional research on the effectiveness of
speech recognition. However, technological advancement is continuing, and new
solutions are being developed for many of the difficulties that have been experienced
in the past. For instance, many of the most recent versions of speech recognition
software now feature voice profiles for youngsters, which means that they are getting
more accurate at differentiating between words uttered by users who are younger. It
makes sense for students to obtain some experience with speech recognition as voice
recognition technology becomes increasingly integrated into the day-to-day
operations of several industries, including the military, the medical field, and the
judicial system. Some technologies that were originally developed for users with
disabilities have since made the transition into mainstream technology, becoming
something that all of us have grown to rely on in our day-to-day lives. As a result of
this, experts in the technology industry are beginning to believe that all children
should obtain a technology education that represents the future of human-computer
interactions, which they predict will be predominantly accomplished through the use
of voice and touch.
33
4 SUMMARY OF EACH STUDENT’S CONTRIBUTION
34
5 CONCLUSION
The current state of pronunciation is far from ideal for numerous reasons, and it
would be unfair to place the blame on any one set of people, whether teachers,
pronunciation experts, or academics. The study of language pronunciation, despite its
theoretical and practical importance, has fallen out of favor in recent decades.
Incorporating pronunciation into oral skills and other classes, and focusing on both
segmental and supra-segmental features, when possible, is something teachers should
do whenever they have the chance and time to do so, with a particular eye on
students with special needs of varying levels and abilities. It's not enough to simply
focus on how words are spoken in isolation. Instead, it should be recognized as an
essential component of conversation and incorporated into academic work. Sounds,
syllables, stress, and intonation are all components that teachers can emphasize to aid
their students. Students will have a better idea of where to put their efforts if they
have a foundational understanding of how these parts work. Teachers can use class
time to raise students' awareness of pronunciation and provide practice with correct
pronunciation. Because altering one's pronunciation habits is laborious and calls for
the student's time and effort, teachers should urge their students to carefully examine
their own pronunciation and practice speaking skills as often as possible in and out of
the classroom. Gaining proficiency in listening to and speaking in both formal and
casual settings can help students gain confidence in their interactions with native
speakers both within and outside of the classroom. As a result, it's safe to say that
improving students' pronunciation can have a significant impact on their ability to
communicate once they've put in the time and effort to do so.
35
6 REFERENCES
[01] Sutherland, S., 2022. What Happens in the Brain When We Read?.
[02] Repetto, C., Pedroli, E. and Macedonia, M., 2017. Enrichment Effects of Gestures and
Pictures on Abstract Words in a Second Language. Frontiers in Psychology, 8.
[03] Reading Rockets. 2022. Phonics and Word Recognition Instruction in Early Reading
Programs: Guidelines for Children with Reading Disabilities. [online] Available at:
<https://www.readingrockets.org/article/phonics-and-word-recognition-instruction-early-
reading-programs-guidelines-children-reading> [Accessed 19 January 2022].
[04] Strauber, C., Sorcar, P., Howlett, C. and Goldman, S., 2020. Using a picture-embedded
method to support acquisition of sight words. Learning and Instruction, 65, p.101248.
[05] Common Sense Education. 2022. Articulation Station Review for Teachers | Common
Sense Education. [online] Available at:
<https://www.commonsense.org/education/app/articulation-station> [Accessed 22 January
2022].
[06] 7ESL. 2022. Picture Dictionary | English Visual Dictionary: 150+ Useful Topics •
7ESL. [online] Available at: <https://7esl.com/picture-dictionary/> [Accessed 23 January
2022].
[07] https://doi.org/10.1111/j.2044-8279.1979.tb02408.x
[08] https://doi.org/10.1016/j.ridd.2005.05.006
[09] https://doi.org/10.1111/j.1365-2788.1984.tb01012.x
[10] https://doi.org/10.1002/jaba.354
[11] Miller, A. and Miller, E.E., 1971. Symbol accentuation, single-track functioning and
early reading. American Journal of Mental Deficiency, 76(1), pp.110-7.
[12] https://www.readingrockets.org/article/speech-recognition-learning
[13] https://www.voki.com/
36
37
7 APPENDICES
38