انجمن کامپیوتر ایران

برای عضویت کلیک کنید

مشاهده‌ مشخصات مقاله

Language Identification from Text Using HMMs

Oktie Hassanzadeh, Ehsan Zamiri

نویسنده (ها)

دوازدهمین کنفرانس بین‌المللی سالانه انجمن کامپیوتر ایران

مربوط به کنفرانس

Language identification from text has received less attention than identification from other forms of input. This is due to the fact that it is considered an easy problem. Several techniques exist and it is possible to gain perfect accuracy in identifying the language of the text in some methods. Nevertheless, there has been very few works on the accuracy and performance of different techniques with limited input, i.e., accurate detection of the language with less input length and identification of the language of a short sentence or a single word. In this paper, we present a method based on Hidden Markov Models (HMMs) for language identification from text. We use the power of HMMs for detecting language of character strings and show the benefits of using this model over a simple model. We will show how an extremely simple realization of this model outperforms simple models in accurately identifying languages of short input strings.

چکیده

برای اعضای سایت : ۱٠٠,٠٠٠ ریال
برای دانشجویان عضو انجمن : ۲٠,٠٠٠ ریال
برای اعضای عادی انجمن : ۴٠,٠٠٠ ریال

قیمت