Characteristics of clear and vocoded Persian vowels and their preliminary comparison with English vowels

Document Type : .

Authors

Abstract

Vowel perception occurs through the spectral and temporal information transference. Presence of temporal information in the envelope makes the speech signals perceivable, even when spectral information weakens in the synthesized signals. The present study deals, for the first time, with determining and comparing the acoustic characteristics of clear and vocoded vowels of the speakers of English and Persian languages. Ten native Persian (5 female and 5 male) and one male native English speakers produced their simple vowels of their mother tongue for three times. Tokens were of 180 Persian and 24 English phonetic samples. The vowels were then vocoded using MATLAB software and 270 Persian Vocoded samples and 54 English Vocoded samples were extracted. The waveforms, spectrograms and LPC diagrams of clear and vocoded vowels were analyzed using the Praat software. Results showed that in Persian speakers F0, F1, vowel space and duration of clear vowels were higher for females than males but the intensity was lower. Formant frequencies for F1-F3 were different across two languages. Vowel space was greater in Persian and English vowels duration and intensity were longer and lower respectively. In both languages, F0 and formants of vocoded vowels were not identifiable, and their LPC diagrams had rounded peaks with lower amplitudes. Duration of clear and vocoded vowels was similar, but intensity of the latter decreased in Persian and enhanced in English. Results showed spectral differences between vocoded and clear vowels in both languages.

Keywords


فایل پی دی فاف را دریافت نمایید

بی‌جن‌خان، محمود. (1369). تجزیه و تحلیل صوتی واکه‌های ساده و مرکب زبان فارسی براساس نظریۀ فانت. پایان‌نامه کارشناسی ارشد، دانشگاه تهران.
بی‌جن‌خان، محمود. (1392). نظام آوایی زبان فارسی. تهران: سمت.
سپنتا، ساسان. (1377). آواشناسی فیزیکی زبان فارسی (توصیف مشخصه‌های صوت شناختیِ واج‌های زبان فارسی). اصفهان: نشر گل‌ها.
صالحی، فرانک، اکبر بهرامی، فرهاد ترابی‌نژاد، و محمد کمالی. (1387). «ساختار سازه‌ای واکه‌های زبان فارسی در دانش آموزان 7 تا 9 سالۀ عادی و مبتلا به افت شنوایی متوسط و شدید شهر اصفهان». شنوایی‌شناسی، 17، 57-42.
عادل‌قهرمان، منصوره. صدایی، مهین. فتاحی، جمیله. محمد‌خانی، قاسم. (1385). فرهنگ جامع شنوایی‌شناسی. تهران: دانش‌فریار.
علی‎نژاد، بتول و فهمیه حسینی‌بالام. (1392). مبانی آواشناسی آکوستیکی. اصفهان: دانشگاه اصفهان.
فشندکی، شهره. (1390). بررسی صوت‌شناختی (آکوستیکی) واکه‌های سادۀ زبان فارسی معیار. پایان‌نامۀ کارشناسی ارشد، دانشگاه الزهرا.
محمدی، هیوا. محمدی، ریحانه. ترابی‌نژاد، فرهاد و منصور رضایی. (1390). «تعیین ساختار سازه‌ای و فضای واکه‌ای در واکه‌های زبان فارسی». شنوایی‌شناسی، 20(2)، 85-79.
مدرسی‌قوامی، گلناز. (1392). آواشناسی: بررسی علمی گفتار. تهران: سمت.
مدرسی‌قوامی، گلناز. (1393). «نگاهی دیگر به ویژگی‌های صوت‌شناختی واکه‌های کوتاه و بلند در زبان فارسی». پژوهش‌های زبان‌شناسی تطبیقی، 8، 9-1.
 
Ansarin, Ali Akbar, (2004). “An acoustic analysis of modern Persian vowels”. Paper presented at the SPECOM’2004: 9th Conference Speech and Computer September 20-22, St. Petersburg, Russia.
Baken, R. J., Orlikoff, R. F. (2000). Clinical measurement of speech and voice. California: Singular Thomson Learning.
Behrman, A. (2007). Speech and voice science. San Diego: Plural Pub.
Bijankhan, M. (2000). “Farsi vowel compensatory lengthening: an experimental approach”. The Fifth International Conference on Spoken Language Processing, Beijing, China.
Boersma, P., Weenink, D. (2007). Praat: doing phonetics by computer Retrieved on July 27, 2007, 34.
Chen, F. (2012). “Predicting the intelligibility of cochlear-implant vocoded speech from objective quality measure”. Journal of Medical and Biological Engineering. 32, 189-94.
Chen, F., Loizou, P. C. (2011). “Predicting the intelligibility of vocoded speech”. Ear and hearing. 32(3), 331-38.
Cho, T., Ladefoged, P. (1999). “Variation and universals in VOT: evidence from 18 languages”. Journal of Phonetics. 27(2), 207-29.
Ellermeier, W., Kattner, F., Ueda, K., Doumoto, K., Nakajima, Y. (2012). “Effects of noise-vocoded speech in the irrelevant-sound paradigm”. In 38. Jahrestagung für Akustik–DAGA 2012. Galrev: Berlin, 336
Fairbanks, G., House, A. S., Stevens, E. L. (1950). “An experimental study of vowel intensities”. Journal of the Acoustical Society of America. 22(4), 457-59.
Fu, Q. J., Shannon, R. V. (1999). “Effects of electrode configuration and frequency allocation on vowel recognition with the Nucleus-22 cochlear implant”. Ear and hearing. 20(4), 332-44.
Greenwood, D. D. (1990). “A cochlear frequency-position function for several species-29 years later”. Journal of the Acoustical Society of America. 87(6): 2592-605.
Hayward, K. (2000). Experimental phonetics. London: Longman.
Henton, C., Elenius, K., Branderud, P., (1995). “Cross-language variation in the vowels of female and male speakers”: The 13th international congress of phonetic sciences. Vol.4, 420-423.
Hervais-Adelman, A. G., Davis, M. H., Johnsrude, I. S., Taylor, K. J., Carlyon, R. P. (2011). “Generalization of perceptual learning of vocoded speech”. Journal of experimental psychology: Human perception and performance. 37(1), 283-95.
Hillenbrand, J., Getty, L. A., Clark, M. J., Wheeler, K. (1995). “Acoustic characteristics of American English vowels”. Journal of the Acoustical Society of America. 97(5 Pt 1), 3099-111.
Jianping H. (2002). Cross-Racial Studies of Human Vowel Tract Dimentions and Formant Structures. Ph.D. dissertation, Ohio State University.
Kent, R. D., & Kim, Y. (2008). Acoustic analysis of speech. In M. J. Ball, M. R. Perkins, N. Muller & S. Howard (eds.), The Handbook of Clinical Linguistics. Oxford: Blackwell Publishing.
Ladefoged, P and Disner, S. F. (2005). Vowels and consonants, 2nd ed., Oxford: Wiley-Blackwell.
Ladefoged, P. (2006). A course in phonetics, 5th ed., Boston: Thomson Wadsworth.
Lea, W. A. (1973). “Segmental and suprasegmental influences on fundamental frequency contours, Consonant types and tones”. In Hyman, L M., editor. Consonant types and tones. Los Angeles: Linguistics Program, University of Southern California. 15-70.
Lehiste, I., Peterson, G. E. (1961). “Transitions, glides, and diphthongs”. Journal of the Acoustical Society of America, 33(3), 268-77.
Licklider, J. C. R., Pollack, I. (1948). “Effects of differentiation, integration, and infinite peak clipping upon the intelligibility of speech”. Journal of the Acoustical Society of America, 20(1), 42-51.
Liu, C., Azimi, B., Tahmina, Q., Hu, Y. (2012). “Effects of low harmonics on tone identification in natural and vocoded speech”. Journal of the Acoustical Society of America, 132(5), 378-84.
Mermelstein, P. (1978). “Difference limens for formant frequencies of steady-state consonant-bound vowels”, J Acoust Soc Am. 63, 572-80.
Mildner, V. (2008). The cognitive neuroscience of human communication. New York: Taylor & Francis.
Peterson, G. E, Barney, H. L. (1952). “Control methods used in a study of the vowels”. Journal of the Acoustical Society of America. 24(2),175-84.
Riquimaroux, H. (2006). “Perception of noise-vocoded speech sounds: Sentences, words, accents and melodies”. Acoustical Science and Technology, 27(6), 325-31.
Rosen, S. (1992). “Temporal information in speech: acoustic, auditory and linguistic aspects”. Philosophical transactions of the Royal Society of London Series B, Biological sciences, 336(1278), 367-73.
Schvartz, K. C., Chatterjee, M. (2012). “Gender identification in younger and older adults: use of spectral and temporal cues in noise-vocoded speech”. Ear and hearing. 33(3), 411-20.
Scott S. K. (2005). “The neurobiology of speech perception”. In: Cutler A, editor. Twenty-First Century Psycholinguistics: Four Corner Stones. 1st ed. Mahwah, New Jersey: Lawrence Erlbaum Associates. 141-157.
Shannon, R. V., Zeng, F. G., Kamath, V., Wygonski, J., Ekelid, M. (1995). “Speech recognition with primarily temporal cues”. Science, 270(5234), 303-4.
Shannon, R. V., Zeng, F. G., Wygonski, J. (1998). “Speech recognition with altered spectral distribution of envelope cues”. Journal of the Acoustical Society of America. 104(4), 2467-76.
Simpson, A. P. (2000). “Gender-specific differences in the articulatory and acoustic realization of interword vowel sequences in American English”. 5th Seminar on Speech Production: Models and Data Kloster Seeon.
Simpson, A. P., Ericsdotter, C. (2003). “Sex-specific durational differences in English and Swedish”. Proceedings of 15th ICPhS, (1113-1116) Barcelona.
Titze, I. R. (1989). “Physiologic and acoustic differences between male and female voices”. Journal of the Acoustical Society of America, 85(4), 1699-1707.
Ueda, K., Nakajima, Y. (2008). “A consistent clustering of power fluctuations in British, English, French, German, and Japanese”. Trans Tech Comm Psychol Physiol Acoust, 38(8), 771-76.
Whalen, D. H., Levitt, A. G. (1995). “The universality of intrinsic F0 of vowels”. Journal of Phonetics, 23(3), 349-66.
Zwicker, E., Terhardt, E. (1980). “Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency”. Journal of the Acoustical Society of America, 68(5), 1523-25.