학술논문

TCS_WITM_2021 @FinSim-2: Transformer based Models for Automatic Classification of Financial Terms
Document Type
Conference
Source
Companion Proceedings of the Web Conference 2021. :311-315
Subject
Automatic Classification of Financial Term
Ontology
TFIDF Vectors
Text Classification
Transformers
Language
English
Abstract
Recent advancement in neural network architectures has provided several opportunities to develop systems to automatically extract and represent information from domain specific unstructured text sources. The Finsim-2021 shared task, collocated with the FinNLP workshop, offered the challenge to automatically learn effective and precise semantic models of financial domain concepts. Building such semantic representations of domain concepts requires knowledge about the specific domain. Such a thorough knowledge can be obtained through the contextual information available in raw text documents on those domains. In this paper, we proposed a transformer-based BERT architecture that captures such contextual information from a set of domain specific raw documents and then perform a classification task to segregate domain terms into fixed number of class labels. The proposed model not only considers the contextual BERT embeddings but also incorporates a TF-IDF vectorizer that gives a word level importance to the model. The performance of the model has been evaluated against several baseline architectures.

Online Access