Data Manning

Automatic Evaluation of Speech Therapy Exercises Based on Image Data* Zuzana Bilkové!, Adam Novozimsky", Adam Dominec!, Simon Gresko! Barbare Zitova!, and Markéta Paroubkovst! ‘The Czech Academy of Sciences, Institute of Information Theory and Automation, Pod Vodldrenskou ve 4, Praha 8, 182 08, Czech Republic Dbilkovatutia.cas.cz Abstract, The prewnted paper proposes a new method for unique automatic evaluation of speech therapy exercises, one part of the future software system for speech therapy support. The method is based on the detection of the lips and tongue movements, whieh will help to evaluate the quality of the exercise implementation. Four different types of ‘exercises are introduced and the corresponding features, capturing the ‘quality of the movements, are shown, ‘The method was tested using man: ually annotated data and the proposed features were evaluated and an- alyzed. At the secord part, the tongue detection is proposed based on the convolutional neural network approach and preliminary results were shoe, Keywords: Lip moven therapy. dotection + Tongue segmentation - Speech 1 Introduction Automatic evaluation of speech therapy exercises is one of the focuses in our research to create a software system to support speech therapy for children and adults with inborn and cequired motor speech disorders. The system aims to improve articulation and a tongue motion based on individual treatment using exercises recommended by a therapist. The key component of the system is the evaluation of a tongue and lips motion based on image data from an ordinary ‘web camera, Existing applications only passively show exercises in the form of images, ‘videos or text descriptions for the voice formulation but none of the available applications offers the possibility of an antomatie assessment with commonly available cameras. The use of digital imaging techniques is new and enables the degree of interactivity where the treatment itself becomes more effective and a patient's return to normal life is accelerated. In addition, the methodology of Tanguage motion detection is also applicable in other areas, such as controlling various devices in quadtiplegics, where existing solutions include special sensors that are stuck to the tongue. * Supported by TACR, grent No. TJO1OONIS1, and the grant SV" 2017-260852.2 Z, Bilkovd et al 1.1 Related work ‘To automatically evaluate the speech therapy exercises, tongue, its tip and lips motion must be detected. In the preliminary step a mouth must be loeated in the face image. In the literature, ther» are several methods for face detection and detection of key facial features. Effective object detection proposed by Viola et.al. (8) isa machine learning algorithm using Haar feature-based cascade classifiers Widely used object detection method introduced in [2] by Dalal et al. is based on histogram of oriented gradients and support vector machines. ‘There are several methods for tongue segmentation. These methods are often Uuseal as Ue initial step for Longue diaguosis used in Chinese medicine, Zam et al, [9] use tongue color, texture and and geometry features to detect diabetes HST coor space was used for tongue segmentation in (3,1), Another common segmentation algorithm is the methiod of active contours, which is used in [6, 10. All of these methods are slow and they are not robust. They also suppose the tongue to be stucked out in only one position whieh is not sufficient to evaluate the complex tongue motion. We have not fonnd any paper concerning detection of the tip of the tongue which is a very complicsted problem due to the self-similarity of the tongue tissue. Another part of the solution is the detection of lips, which can be often found in the voice activity detection systems, for example in (5), but there are xno papers about the lips detection for speech therapy. A systematic review of studies on computer-based speech therapy systems or virtual speech therapists is presented in [I] 1.2 Speech therapy software ‘The proposed software system will offer an adjustable set of exercises recom- ‘mended by an expert and its evaluation and it will also motivate the patients by incorporating an augmen‘ed reality, such as an augmented picture of a ladybug sitting on the nose tip when the tongue should be stucked out. The augmented reality is employed into Ue suluGion ( increase understating of Une proper exercise movements. Moreover, the system will offer session archiving allowing the therapist to evaluate treatment progress and adjust its schedule. The main part of the software evaluates the quality of the exercise imple- entation. Using the therapist expertise, we have distinguished three groups of exercises based on the information necessary for their evaluation. They work with tracking of the patient's lips, with the segmentation of the tongue and of the tongue tip detection, respectively: In the next Section we will focus on processing of images of mouth to obtain lips movement patterns and in the Section 3 the tongue segmentation and localization of its tip is discussed, 2 Evaluation of the lips exercises ‘The evaluation of exercises when the lips have to follow specified motion patterns is based on the lips detection and their tracking. In the preliminary stepAutomatic Evaluation of Speech Therapy Exercises Based on Image Data 3 4 patient's mouth is located in the face image. ‘The proposed solution for detection of face parts uses Dib library [4], based on an ensemble of regression trees that are trained to estimate the facial landmark positions directly from the pixel intensities. The Dlib library automatically detects 68 points in the face image in real-time allowing an easy detection of the mouth and the lips. The features are deseribed in Table 1 For the further processing, we focus on the lips and mouth location only, thus the image of a face is cropped according the position of selected points, as it is shown in Figure 1. The cropped images are normalized with respect to their size and to the orientation between eyes. In order to evaluate the lips exercises we hhave proposed six features, derived from the positions of the detected lipa points ‘They are illustrated in Figure 1 by blue drawings and described in Table 1. They capture main movement patterns of the lips. In our tests, these features proved to be sufficient to evaluate the quality of the patient exercises, see Section 2.1. Fig. 1. Selection of key detected facil points and features for analysis of lips exercises. ‘Table 1. Description of main features, [Notation] Name [Dseription [CR [Commer right| Vertical shift ofthe right comer with respect to the center, [cL [Corner - left [Vertical shift of the left corner with respect to the center. ILD Lip down [Height of the lower lip. LU |Lip-up [Hight of the upper lip. op [open Lips distance when mouth is open wt___|Wide Horizontal distance of left and right corner.4 Z, Bilkovd et al For this paper, we selected four representative lip exercises to show how the features are used in the evaluation process. The exercises are closed smile, ‘open smile and ervoked smile, when only one mouth corner, left or right, is lifted. Table 2 shows visualization of individual exercises with a formula for the corresponding feature calculation. If t is executed correctly the values of the individual features ate maximized. In the case of the closed sunile both comers have to be lifted, therefore we control the minimum height of both comers, min(CL, CR), value -OP is maximized when the mouth is closed and finally the mouth widens when we smile, which captures the WI feature. The only difference in the open sinile exercise is with the feature OP which controls the Tips distance and her it shonld be maximized. In the case of the crooked smile, one of the corners is supposed to lift up and the other has to stay low and the mouth also widens, as it is shown in the last row of the table, ‘Table 2. Features used for evaluation of selected lips exercises Name [Features Caleulation [Visualization | ae YS wr kx. 94. a We (Closed smile jmin(CL, CR), -OP, WI (Open smile Imia(CL, CR), OP, WI Wi kw ——_ 9 ———}a er \Crooked smile - right OR, -CL, WI Wi ‘The individual exercises have to be practised several times. The number of repetition is set by the therapist. Figure 2 shows the automated counting of smiles performed. ‘The successful attempts are appraised (the green smile) and counted, while the erroneous performance is highlighted by the yellow icon and the count of smiles stays intact. [2] — Fig. 2. Automated counting of smilesAutomatic Evaluation of Speech Therapy Exercises Based on Image Data 5 2.1 Analysis of the feature performance ‘Testing of the method for the exercise evaluation based on the feature compu- tation was realized using manually annotated data. The positions of lips points ‘wore detected manually aad the proposed features were evaluated and plotted to s0e if they reflect the expected motion patterns. Figure 4 shows the values of the four features from Table 2 (in different colors) for the four chosen exercises. Bach, exercise is three times repeated. On the x axis values for acquired video frames are shown. We ean sce that the chosen features correctly capture the desired characteristics of the exercises, namely the high values for width and height of both comers and no significant change in the feature deseribing opening for the closed smile and similar expected beliavior for the other exercises. Fig. 3. Feature bars representing the value of features of selected lips exercises. Our tests demonstrate the validity of our solution for the evaluation of the speech therapy exercises 3 Segmentation of the tongue ‘The group of exercises based on the tongue and the tongue tip movements utilizes segmentation of the tongue body and of its tip. For the speech therapy exercises we need to detect the tongue and its tip in real-time and in all the positions.6 Z, Bilkové et al 4, Value of individual features in a video of performance of selected exercises. ‘Our solution is based on the convolutional neural network U-net [7] which proves to be sufficiently fast and robust. We use a single neural network to output both results - segmentation of the tongue and its tip - to achieve higher speed. ‘The output of the network has thus two frames - one is a mask for the tongue segmentation and the other is a mask for its tip. The network is trained on data ‘we partially recorded and partially downloaded from the Intemet with different quality to ensure robustness of our method with respect to the data quality. The data were manually annotated. We are still enlarging our database to provide bigger and more versatile training dataset. The results of our current network based on U-net for segmentation of tongue and its tip is shown in Figure 5. Green line and dot represent the ground truth - ‘manually segmented tongue contour and the tip. The red lines are the prediction Of tke tongue contour. Te red dots are the center of the mass of a mask predicted by the network, as shown in the middle cohumn in the last row. We can see Hat the results correspoid well to the ground truth even in the image where the whole tongue is in the mouth and the tip is hard to detect. However, to achieve required final robustness of the tongue detection method more testing and dataset collection creation is still needed. 4 Conclusion ‘The presented paper demonstrates the method for an automatic evaluation of speech therapy exercises. This is a part of the future software system for speech therapy support. The method is based on the detection of the lips and tongue movements and further evaluation of the quality of realized motion patterns, fol- lowing the therapist recommendation. The efficiency of the method was demon-Automatic Evaluation of Speech Therapy Exercises Based on Image Data 7 Ny = S, d fediction Prediction You might also like
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5820)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
4/5 (1093)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (845)
Magazines
Podcasts
Sheet music
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (609)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1717)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (590)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1104)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (898)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (540)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (2104)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (349)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1018)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (474)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (822)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (1867)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (122)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (271)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (441)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (1947)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (401)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (4771)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2259)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (808)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (98)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (266)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4208)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (1929)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (1903)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (234)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2522)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (738)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (3811)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (74)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (789)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (792)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
3.5/5 (104)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (137)
Little Women
From Everand
Little Women
Louisa May Alcott
4/5 (104)
Lec6 PDF
Document31 pages
Lec6 PDF
Emon Khan
No ratings yet
Only For Computer Science Students: Da/en
Document14 pages
Only For Computer Science Students: Da/en
Emon Khan
No ratings yet
The Basic Elements of A Research Proposal
Document6 pages
The Basic Elements of A Research Proposal
Emon Khan
No ratings yet
Image Denoising
Document58 pages
Image Denoising
Emon Khan
No ratings yet
Lec5 PDF
Document21 pages
Lec5 PDF
Emon Khan
No ratings yet
Minimum Spanning Tree (Prim Kruskal)
Document39 pages
Minimum Spanning Tree (Prim Kruskal)
Emon Khan
No ratings yet
Publications Wangshanshan
Document5 pages
Publications Wangshanshan
Emon Khan
No ratings yet
Guideline On Course Selection (For Foreign Postgraduates) : Graduate School Autumn Semester of 2015-2016
Document14 pages
Emon Khan
No ratings yet
Chapter 1 Encapsulation
Document65 pages
Chapter 1 Encapsulation
Emon Khan
No ratings yet
Image Fusion Algorithm Based On Biorthogonal Wavelet
Document7 pages
Image Fusion Algorithm Based On Biorthogonal Wavelet
Emon Khan
No ratings yet
26 Chinese Universities and English-Taught Master Programs: Echanical Engineering
Document5 pages
26 Chinese Universities and English-Taught Master Programs: Echanical Engineering
Emon Khan
No ratings yet
Document14 pages
Emon Khan
No ratings yet
Foreign National Form
Document1 page
Foreign National Form
Emon Khan
No ratings yet
International Student - S Hand Book 2014-Hohai University
Document25 pages
International Student - S Hand Book 2014-Hohai University
Emon Khan
No ratings yet

Data Manning

Uploaded by

Copyright:

Available Formats

You might also like

Data Manning

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Data Manning

Uploaded by

Copyright:

Available Formats

You might also like