학술논문

A Double Metaphone encoding for Bangla and its application in spelling checker
Document Type
Conference
Source
2005 International Conference on Natural Language Processing and Knowledge Engineering Natural Language Processing and Knowledge Engineering Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on. :705-710 2005
Subject
Computing and Processing
Encoding
Natural languages
Clustering algorithms
Modems
Bangla
Bengali
Phonetic Encoding
Double Metaphone
Spelling suggestions
Spelling Checker
Language
Abstract
We present a Double Metaphone encoding for Bangla that can be used by spelling checkers to improve the quality of suggestions for misspelled words. The complex rules of Bangla spelling present a significant challenge in producing suggestions for a misspelled word when employing the traditional edit-distance methods; one must take phonetic similarity into account for the suggested alternatives to be reasonably accurate. We propose a Double Metaphone encoding for Bangla, taking into account the various context-sensitive rules, including those involving the large repertoire of consonant clusters in Bangla, and present a comparison with the traditional edit-distance based methods in producing suggestions for misspelled words.