Abstract

This thesis was developed out of a need to reduce the time required to correct Linear Predictive Code (LPC) data used for training a formant tracker. A program was written to select peaks from LPC data and interpret them as Fl, F2, and F3, using knowledge about the phonetic transcription, the sex of the speaker, knowledge about individual phonemes, and a few heuristics. The system was tested on a database of eight speakers, four male and four female, each of whom produced ten sentences. This data set comprised 1,011 resonant phonemes covering 17,363 5-msec. frames. Overall the system correctly matched Fl in 98.9% of the frames, F2 in 92.2% of the frames, and F3 in 88.8% of the frames.

Library of Congress Subject Headings

Speech processing systems--Design; Formants (Speech)--Data processing; Automatic speech recognition; Speech synthesis--Data processing

Publication Date

1989

Document Type

Thesis

Department, Program, or Center

Computer Science (GCCIS)

Advisor

Hillenbrand, James

Advisor/Committee Member

Biles, John

Advisor/Committee Member

Anderson, Peter

Comments

Note: imported from RIT’s Digital Media Library running on DSpace to RIT Scholar Works. Physical copy available through RIT's The Wallace Library at: TK7882.S65 R525 1989

Campus

RIT – Main Campus

Share

COinS