Abstract
Automated speech recognition (ASR) for spontaneous speech poses extra challenge compared to read speech as it contains varied speaking rates, poor phonation and disfluencies. Studies have shown that filled pause is one of the most common disfluencies of spontaneous speech characteristic where it presents considerable problems for ASR performance. In many filled pause studies, the hindering factor is that filled pause being often recognized as short words which particularly has semantic meaning, such as „um” can be recognized as „thumb” or „arm”. This problem becomes especially pertinent where a vowel sound of normal word being relatively long at any position in an utterance, both within a word as well as between words which formerly known as elongation. The existence of elongation causes normal word falsely detected as filled pause due to their similar acoustical feature patterns. Classifying elongation as filled pause affects ASR”s performance as eliminating normal words from recognition may modify the intended context of a speech. Therefore, the main aim of this research is to classify filled pause and elongation into its own classes by constructing a discriminative classification model from the extracted acoustical features. A large number of signal features have been employed for the problem of discriminating filled pause and elongation. Several wellestablished features such as Formant Frequency (FF), Fundamental Frequency (F0), Mel Frequency Cepstral Coefficients (MFCC), Zero Crossing Rates (ZCR) and Short Time Energy (STE) were used in this research. These features are carefully chosen to emphasize signal characteristics that differ between filled pause and elongation…
Metadata
Item Type: | Book Section |
---|---|
Creators: | Creators Email / ID Num. Hamzah, Raseeda UNSPECIFIED |
Subjects: | L Education > LB Theory and practice of education > Higher Education > Dissertations, Academic. Preparation of theses > Malaysia P Language and Literature > PL Languages and literatures of Eastern Asia, Africa, Oceania > Malay language. General works. History |
Divisions: | Universiti Teknologi MARA, Shah Alam > Institut Pengajian Siswazah (IPSis) : Institute of Graduate Studies (IGS) |
Series Name: | IGS Biannual Publication |
Volume: | 10 |
Number: | 10 |
Keywords: | Abstract; Abstract of thesis; Newsletter; Research information; Doctoral graduates; IPSis; IGS; UiTM; |
Date: | 2016 |
URI: | https://ir.uitm.edu.my/id/eprint/20064 |
Download
ABS_RASEEDA HAMZAH TDRA VOL 10 IGS 16.pdf
Download (624kB) | Preview