학술논문

Transformer-based Detection of Multiword Expressions in Flower and Plant Names
Document Type
Working Paper
Source
Subject
Computer Science - Computation and Language
Language
Abstract
Multiword expression (MWE) is a sequence of words which collectively present a meaning which is not derived from its individual words. The task of processing MWEs is crucial in many natural language processing (NLP) applications, including machine translation and terminology extraction. Therefore, detecting MWEs in different domains is an important research topic. In this paper, we explore state-of-the-art neural transformers in the task of detecting MWEs in flower and plant names. We evaluate different transformer models on a dataset created from Encyclopedia of Plants and Flower. We empirically show that transformer models outperform the previous neural models based on long short-term memory (LSTM).
Comment: Submitted to The 5th Workshop on Multi-word Units in Machine Translation and Translation Technology at Europhras2022