نوع مقاله : علمی-پژوهشی
نویسندگان
1 مؤسسه آموزش عالی علومشناختی، تهران، ایران
2 مهندسی برق و کامپیوتر، دانشگاه امیرکبیر، تهران، ایران
چکیده
کلیدواژهها
عنوان مقاله [English]
نویسندگان [English]
Different environmental stimuli are able to influence and change the human emotional states. Speech expresses changes in emotional state in two ways; verbally through vocabulary and syntax and also non-verbally through tone and intonation. Tone and intonation have a paralinguistic role and can alter the meaning of speech. The quantitative processing and study of emotions were first introduced with the concept of affective computing in computer science. The main idea was that the machine could recognize and interpret human emotional states and respond or behave appropriately according to them. The quantitative study of emotional speech is known as speech emotion recognition. Recognition or classification of emotional speech means being able to recognize the speaker’s emotional state using analysis performed on the speech signal. The first step for conducting this type of study is to have a rich, standard, high-quality, and appropriately sized dataset for evaluating speech emotion recognition algorithms. There are a wide variety of standard speech emotion datasets in popular languages. The lack of such a database in the Persian language for researchers in the field of speech emotion recognition has resulted in low quality understanding of emotional patterns and their impact in the Persian language. This point underscores the necessity of creating and producing a speech emotion dataset in the Persian language. In this paper, we describe the designing, preparing, and producing a speech emotion dataset in standard Persian language, similar to the approach the Berlin Emotional Speech Database, which can be used in speech emotion recognition studies.
کلیدواژهها [English]