학술논문

Memory-Augmented Recurrent Neural Networks Can Learn Generalized Dyck Languages

Document Type

Working Paper

Author

Suzgun, Mirac; Gehrmann, Sebastian; Belinkov, Yonatan; Shieber, Stuart M.

Source

Subject

Computer Science - Computation and Language
Computer Science - Machine Learning
Computer Science - Neural and Evolutionary Computing

Language

Abstract

We introduce three memory-augmented Recurrent Neural Networks (MARNNs) and explore their capabilities on a series of simple language modeling tasks whose solutions require stack-based mechanisms. We provide the first demonstration of neural networks recognizing the generalized Dyck languages, which express the core of what it means to be a language with hierarchical structure. Our memory-augmented architectures are easy to train in an end-to-end fashion and can learn the Dyck languages over as many as six parenthesis-pairs, in addition to two deterministic palindrome languages and the string-reversal transduction task, by emulating pushdown automata. Our experiments highlight the increased modeling capacity of memory-augmented models over simple RNNs, while inflecting our understanding of the limitations of these models.

Online Access

Open Access (Arxiv) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송