Abstract
Millions of people around the world are diagnosed with neurological disorders like Parkinson’s, Cerebral Palsy or Amyotrophic Lateral Sclerosis. Due to the neurological damage as the disease progresses, the person suffering from the disease loses control of muscles, along with speech deterioration. Speech deterioration is due to neuro motor condition that limits manipulation of the articulators of the vocal tract, the condition collectively called as dysarthria. Even though dysarthric speech is grammatically and syntactically correct, it is difficult for humans to understand and for Automatic Speech Recognition (ASR) systems to decipher. With the emergence of deep learning, speech recognition systems have improved a lot compared to traditional speech recognition systems, which use sophisticated preprocessing techniques to extract speech features.
In this digital era there are still many documents that are handwritten many of which need to be digitized. Offline handwriting recognition involves recognizing handwritten characters from images of handwritten text (i.e. scanned documents). This is an interesting task as it involves sequence learning with computer vision. The task is more difficult than Optical Character Recognition (OCR), because handwritten letters can be written in virtually infinite different styles. This thesis proposes exploiting deep learning techniques like Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) for offline handwriting recognition. For speech recognition, we compare traditional methods for speech recognition with recent deep learning methods. Also, we apply speaker adaptation methods both at feature level and at parameter level to improve recognition of dysarthric speech.
Library of Congress Subject Headings
Automatic speech recognition; Neural networks (Computer science); Articulation disorders--Data processing; Pattern recognition systems
Publication Date
5-2017
Document Type
Thesis
Student Type
Graduate
Degree Name
Computer Science (MS)
Department, Program, or Center
Computer Science (GCCIS)
Advisor
Raymond Ptucha
Advisor/Committee Member
Zack Butler
Advisor/Committee Member
Emily Prud’hommeaux
Recommended Citation
Pillai, Suhas, "Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks" (2017). Thesis. Rochester Institute of Technology. Accessed from
https://repository.rit.edu/theses/9407
Campus
RIT – Main Campus
Plan Codes
COMPSCI-MS
Comments
Physical copy available from RIT's Wallace Library at TK7895.S65 .P48 2017