Evaluation of intonation features on emphasized Malay words / Syazwani Nasaruddin

Nasaruddin, Syazwani (2017) Evaluation of intonation features on emphasized Malay words / Syazwani Nasaruddin. Masters thesis, Universiti Teknologi MARA (UiTM).

Abstract

Detection of emphasized has become one active research in speech processing. Nowadays, in automatic speech recognition, listeners not only want to be able automatically understand which words the speakers have said but also how the speakers deliver the speech. This paper presents detection of emphasis in Malay sentences. Thus, the purpose of this research is consisting, to construct emphasized Malay speech, to evaluate pitch and intensity features on Malay words and lastly to classify the Malay words using intonation features. First step to construct emphasized Malay speech is by listening to 96 sentences and 314 words by 2 different speakers. In this step, user hear the speech and used the hand labelling method discuss in chapter 2. User listened to the speech audio and mark the words that the user finds the speaker voice is high. If the voice of speaker is high, user classified the word as emphasized words. This step is to determine the emphasized and non-emphasized words. All the features of emphasized words detect by user is extracted. After detection of emphasized and non-emphasized words for speakerl and speaker2, intensity and pitch features need to be evaluated. Intensity and pitch features that need to be evaluated are minimum value of pitch and intensity, maximum value of pitch and intensity and mean value of pitch and intensity. In this research, all 314 words from 2 different male speakers' speeches are clustered manually based on features of intensity and pitch pattern. The pattern of the features such as intensity and pitch are observed and clustered to find out how many patterns can be identified from all the words. Result from the intensity and pitch features can classify Malay phrases. Activities in this section are, for testing part, 314 words from 2 different speakers are evaluated by using clustering method. WEKA is a set of machine learning algorithm for data mining task. The data are clustered to determine emphasized and non-emphasized words. From the research done shows that he highest percentage in detecting emphasized words in Malay words is by combination of all features such as intensity and pitch features.

Metadata

Item Type:	Thesis (Masters)
Creators:	Creators Email / ID Num. Nasaruddin, Syazwani 2015694058
Contributors:	Contribution Name Email / ID Num. Thesis advisor Mohamed Hanum, Haslizatul Fairuz UNSPECIFIED
Subjects:	P Language and Literature > P Philology. Linguistics > Language and languages
Divisions:	Universiti Teknologi MARA, Shah Alam > Faculty of Computer and Mathematical Sciences
Programme:	Master of Computer Science
Keywords:	Automatic speech recognition, Malay speech, pitch features
Date:	2017
URI:	https://ir.uitm.edu.my/id/eprint/98181