Abstract: This study aims to investigate the acquisition process of intonation. It examines the intonation structure of Tokyo Japanese and its realization by Iranian learners of Japanese. Seven Iranian learners of Japanese, differing in fluency, and two Japanese speakers participated in the experiment. Two sentences were used to test the phonological and phonetic characteristics of lexical pitch-accent as well as the intonation patterns produced by the speakers. Both sentences consisted of similar words with the same number of syllables and lexical pitch-accents but different syntactic structure. Speakers were asked to read each sentence three times at normal speed, and the data were analyzed by Praat. The results show that lexical pitch-accent, Accentual Phrase (AP) and AP boundary tone realization vary depending on sentence type. For sentences of type XdeYwo, the lexical pitch-accent is realized properly. However, there is a rise in AP boundary tone regardless of speakers’ level of fluency. In contrast, in sentences of type XnoYwo, the lexical pitch-accent and AP boundary tone vary depending on the speakers’ fluency level. Advanced speakers are better at grouping words into phrases and produce more native-like intonation patterns, though they are not able to realize downstep properly. The non-native speakers tried to realize proper intonation patterns by making changes in lexical accent and boundary tone.
Abstract: The enthusiasm of foreigners studying the Indonesian language by Foreign Speakers (BIPA) was documented in a sitcom "International Class". Tone and stress when they speak the Indonesian language is unique and different from Indonesian pronunciation. By using the Praat program, this research aims to describe prosodic Indonesian language which is spoken by ‘International Class” actors consisting of Abbas from Nigeria, Lee from Korea, and Kotaro from Japan. Data for the research are taken from the video sitcom "International Class" that aired on Indonesian television. The results of this study revealed that pitch movement that arises when pronouncing Indonesian sentences was up and down gradually, there is also a rise and fall sharply. In terms of stress, respondents tend to contain a lot of stress when pronouncing Indonesian sentences. Meanwhile, in terms of temporal structure, the duration pronouncing Indonesian sentences tends to be longer than that of Indonesian speakers.
Abstract: Gestures play a major role in comprehension and
memory recall due to the fact that aid the efficient channel of
the meaning and support listeners’ comprehension and memory. In
the present study, the assistance of two kinds of gestures (iconic
and beat gestures) is tested in regards to memory and recall. The
hypothesis investigated here is whether or not iconic and beat gestures
provide assistance in memory and recall in Greek and in Greek
speakers’ second language. Two groups of participants were formed,
one comprising Greeks that reside in Athens and one with Greeks
that reside in Copenhagen. Three kinds of stimuli were used: A video
with words accompanied with iconic gestures, a video with words
accompanied with beat gestures and a video with words alone. The
languages used are Greek and English. The words in the English
videos were spoken by a native English speaker and by a Greek
speaker talking English. The reason for this is that when it comes to
beat gestures that serve a meta-cognitive function and are generated
according to the intonation of a language, prosody plays a major
role. Thus, participants that have different influences in prosody may
generate different results from rhythmic gestures. Memory recall was
assessed by asking the participants to try to remember as many
words as they could after viewing each video. Results show that
iconic gestures provide significant assistance in memory and recall
in Greek and in English whether they are produced by a native or
a second language speaker. In the case of beat gestures though, the
findings indicate that beat gestures may not play such a significant
role in Greek language. As far as intonation is concerned, a significant
difference was not found in the case of beat gestures produced by a
native English speaker and by a Greek speaker talking English.
Abstract: Thai language is difficult in all four language skills,
especially reading. The first year students may have different abilities
in reading, so a teacher is required to find out a student’s reading
level so that the teacher can help and support them till they can
develop and resolve each problem themselves. This research is aimed
to study the prosody problem among Thai students and will be
focused on first year Thai students in the second semester. A total of
58 students were involved in this study. Four obstacles were found:
1. Interpretation from what they read and write
2. Incorrectness Pronunciation of Prosody
3. Incorrectness in Rhythm of the Poem
4. Incorrectness of the Thai Poem Pronunciation
Abstract: The paper presents the design concept of a unitselection
text-to-speech synthesis system for the Slovenian language.
Due to its modular and upgradable architecture, the system can be
used in a variety of speech user interface applications, ranging from
server carrier-grade voice portal applications, desktop user interfaces
to specialized embedded devices.
Since memory and processing power requirements are important
factors for a possible implementation in embedded devices, lexica
and speech corpora need to be reduced. We describe a simple and
efficient implementation of a greedy subset selection algorithm that
extracts a compact subset of high coverage text sentences. The
experiment on a reference text corpus showed that the subset
selection algorithm produced a compact sentence subset with a small
redundancy.
The adequacy of the spoken output was evaluated by several
subjective tests as they are recommended by the International
Telecommunication Union ITU.
Abstract: This paper deals with automatic sentence modality
recognition in French. In this work, only prosodic features are
considered. The sentences are recognized according to the three
following modalities: declarative, interrogative and exclamatory
sentences. This information will be used to animate a talking head for
deaf and hearing-impaired children. We first statistically study a real
radio corpus in order to assess the feasibility of the automatic
modeling of sentence types. Then, we test two sets of prosodic
features as well as two different classifiers and their combination. We
further focus our attention on questions recognition, as this modality
is certainly the most important one for the target application.
Abstract: In this paper, we propose a method of alter duration in
frequency domain that control prosody in real time after pitch
alteration. If there has a method to alteration duration freely among
prosody information, that may used in several fields such as speech
impediment person's pronunciation proof reading or language study.
The pitch alteration method used control prosody altered by PSOLA
synthesis method which is in time domain processing method.
However, the duration of pitch alteration speech is changed by the
frequency domain. In this paper, we altered the duration with the
method of duration alteration by Fast Fourier Transformation in
frequency domain. Consequently, the intelligibility of the pitch and
duration are controlled has a slight decrease than the case when only
pitch is changed, but the proposed algorithm obtained the higher MOS
score about naturalness.