Professional Documents
Culture Documents
Research Paper Format
Research Paper Format
READTRIX
A SOFTWARE THAT CAN CONVERT IMAGES INTO SPEECH IN JUST
ONE SCAN.
CLERILYN P. BADUA
Research Teacher II
Research Adviser
S.Y. 2022-2023
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
INTRODUCTION
Reading comprehension is a demanding task that requires multiple cognitive and linguistic
processes (Nation, 2019). According to a World Bank (WB) report, 91% of ten-year-old Filipino
children still have difficulty reading simple texts, indicating that the country's educational quality
is deteriorating (Ph, 2023). Huang et al. (2019) said that reading comprehension is an essential
part of how people read and interpret the text; however, it has received less attention in the
earlier literature on machine reading. In the words of Hussain and Omar (2020), developers
nowadays do not prioritize people with special needs who are visually impaired as much as they
do people who can see. The following paper seeks to close this knowledge gap. Children who
have dyslexia also have difficulty writing compared to their peers with similar reading skills.
(APA PsycNet, n.d.) Consequently, the significance of these two skills in literacy instruction
needs to be emphasized. Thus, they would benefit from a device that can scan all types of textual
•Is there a discernible difference in comprehension between the proposed software and Braille
materials?
•Is there a difference in speed between the proposed Software and Braille materials?
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
•Is there a significant difference in comprehension between people with dyslexia who will use
the software and those who will read without the software?
•Is there a significant difference in the attention span of people with dyslexia who use text-to-
HYPOTHESIS
•There is no significant difference between the software for Visually Impaired People and Braille
•There is no difference between the said software and Braille materials in terms of speed in
reading
•There is no significant difference in comprehension between people with dyslexia who will use
the software and those who will read without the software.
•There is no significant difference in the attention span of people with dyslexia who use text-to-
The findings of this study will redound to the benefit of society, considering that a lack of
reading comprehension is such a prominent problem in the country. The limitations and lack of
assistance for those with vision impairments are also one of the primary reasons for the
importance of this study. Accessibility features enable people with low vision to be more
independent. The application aids in the extraction and conversion of textual information
embedded in images into speech. It can be used to extract text from PDF files and convert it to
speech. This app, among other things, improves the accessibility of online digital content and
makes it easier for people with visual impairments. It will benefit not only people with eye
conditions but also children with dyslexia. Not only will the software fill the gap, but it will also
The participants in this study will be polled about their experiences with the software and Braille
materials. It will also include a survey of children and adults who have dyslexia. The software will not
function if a word's spelling is incorrect. If the photograph is unclear, it will be unable to read it. The
study restricts its coverage to visually impaired people and people suffering from dyslexia because its
primary goal is to benefit those with visual impairments and dyslexic patients' comprehension. This will
concentrate on the integration of the proposed assistive software and Braille materials.
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
DEFINITION OF TERMS
Braille - a system of touch reading and writing for blind persons in which raised dots represent
Canny edge detection - a technique to extract useful structural information from different vision
objects.
Bilateral Filter - a non-linear, edge-preserving, and noise-reducing smoothing filter for images.
Festival TTS - The Festival Speech Synthesis System is a general multi-lingual speech synthesis
system
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
METHODOLOGY
Materials
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
To program the software, the researchers will need a computer with a large storage capacity that
can accept all of the codes. To use that application, you must have a phone with a built-in
camera. A bilateral filter is required, which the researchers can obtain via Phyton. A Canny Edge
detector, which can be found on Phyton, is also required. To convert an image to text, OCR, or
optical character recognition, will be used. Tesseract, a software system, will also be used, as
Methods
Image capture: the built-in camera captures photos of the text. The quality of the captured image
removal, warping and cropping, and thresholding are all part of the image pre-processing
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
procedure. Since many OpenCV functions require a grayscale image as an input parameter, the
image will be transformed into a grayscale. A bilateral filter is used to remove noise. Canny edge
detection will be conducted on the grayscale image to improve contour detection. The image will
be warped and cropped to follow the contours. This allows us to recognize and extract only the
text-containing zone while removing the unnecessary backdrop. Finally, thresholding is used so
that the image resembles a scanned document. This is done so that the OCR can transform the
depicted in the diagram above (Fig. 3). The picture pre-processing modules and OCR are the first
blocks. It transforms the previously processed.png image into an a.txt file. Tesseract OCR is
being used.Text-to-speech conversion: The voice processing module is the second block. It turns
the text file into an audio file. The text is transformed into speech using Festival TTS, a speech
synthesizer.
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
Table 1: Research Design for Communicative Pad for Visually Impaired People and Braille Text
printed material))
Braille page of printed material)) page of printed material)) page of printed material))
reading (one (1) page of reading (one (1) page of reading (one (1) page of
Table 1
Table 1 above shows the experimental of the Readtrix and Braille Text in terms of Time Spent
The table shows the first Device used as Readtrix, meanwhile the second Device used the Braille
Text.
The Readtrix and Braille Text will be used in a controlled environment with the test subjects
reading one (1) page of printed material. Each device will be observed three (3) times.
Table 2: Research Design for Readtrix and Traditional Way of Reading of People with
Dyslexiain terms of Comprehension and Time Spent Reading (Text Subject experience)
Traditional (1) page of printed (1) page of printed (1) page of printed
(Dyslexia (speed or time spent (speed or time spent (speed or time spent
Patients) reading (one (1) page of reading (one (1) page of reading (one (1) page of
Table 2
Table 1 above shows the experimental of the Readtrix and Traditional Way of Reading of People
with Dyslexia in terms of Time Spent Reading and Comprehension of the Reader (Text Subject
experience).
The table shows the first Device used as Readtrix, meanwhile the second Device used the
The Readtrix and will be used in a controlled environment with the test subjects reading one (1)
page of printed material. Each device will be observed three (3) times.
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
BIBLIOGRAPHY
Turner, A. (2023, May 1). How Many People Have Smartphones Worldwide (May 2023).
BankMyCell. https://www.bankmycell.com/blog/how-many-phones-are-in-the-
world#:~:text=How%20Many%20Mobile%20Connections%20Are%20There%20Worldwide
%3F&text=According%20to%20GSMA%20real%2Dtime,by%20UN%20digital%20analyst
%20estimates.
Sueq. (n.d.). Low Vision Can’t Be Fully Corrected With Glasses or Contacts. Vision North Eye
Care. https://www.visionnortheyecare.com/eye-care-services/low-vision/
Tournier, V., Topham, C. M., Gilles, A., David, B., Folgoas, C., Moya-Leclair, E., Kamionka,
E., Desrousseaux, M., Texier, H., Gavalda, S., Cot, M., Guémard, E., Dalibey, M., Nomme, J.,
Cioci, G., Barbé, S., M, C., André, I., Duquesne, S., & Marty, A. (2020). An engineered PET
depolymerase to break down and recycle plastic bottles. Nature, 580(7802), 216–219.
https://doi.org/10.1038/s41586-020-2149-4
Snowling, M. J., Hayiou-Thomas, M. E., Nash, H. M., & Hulme, C. (2020). Dyslexia and
Developmental Language Disorder: comorbid disorders with distinct effects on reading
comprehension. Journal of Child Psychology and Psychiatry, 61(6), 672–680.
https://doi.org/10.1111/jcpp.13140
Ph, E. (2023, February 8). COLUMN | Sara look: Low comprehension among students is caused
by inefficiencies of your people. Explained PH | Youth-Driven Journalism.
https://www.explained.ph/2022/09/sara-look-low-comprehension-among-students-is-caused-by-
inefficiencies-of-your-people.html#:~:text=According%20to%20a%20World%20Bank,of
%20education%20in%20the%20country.
Nation, K. (2019). Children’s reading difficulties, language, and reflections on the simple view
of reading. Australian Journal of Learning Difficulties, 24(1), 47–73.
https://doi.org/10.1080/19404158.2019.1609272
ZARAGOZA NATIONAL HIGH SCHOOL
San Rafael, Zaragoza, Nueva Ecija
Martinelli, K., & Cruger, M., PhD. (2022). Understanding Dyslexia. Child Mind Institute.
https://childmind.org/article/understanding-dyslexia/