학술논문

Automatic Acquisition of Matching Patterns for Pattern-Based Parsing on Specific Chinese Text
Document Type
Conference
Source
2016 IEEE/WIC/ACM International Conference on Web Intelligence Workshops (WIW) WIW Web Intelligence Workshops (WIW), IEEE/WIC/ACM International Conference on. :17-20 Oct, 2016
Subject
Components, Circuits, Devices and Systems
Computing and Processing
Syntactics
Pattern matching
Measurement
Natural language processing
Lenses
Algorithm design and analysis
Education
Language
Abstract
As a generalized approach in natural language processing, pattern matching is seldom applied in syntactic parsing nowadays. In some applications on short text analysis such as microblog opinion mining, the sentences are characterized by obvious patterns. Thus automatic parsing by pattern matching may be more effective than general syntactic parsing method. This paper puts forward a lightweight algorithm of Matching Pattern (MP) acquisition to achieve the syntactic parsing on some specific Chinese text composed of short clauses. The key points of the algorithm are MP generation based on word/POS sequence and MP selection based on weight ranking of mapping between MP groups and sentence groups. Experiments show that this method performs well on Chinese corpus with the following features: (1) The sentences are mainly composed of short clauses, (2) Most of the clauses can be represented by limited patterns.