학술논문

J-TOCC: Japanese Topic-Oriented Conversation Corpus / 『日本語話題別会話コーパス:J-TOCC』
Document Type
Journal Article
Source
計量国語学 / Mathematical Linguistics. 2021, 33(1):11
Subject
Degree of Familiarity
Education of Japanese Language
Grammatical Items
Speaking
Topic
Topic Syllabus
Vocabulary
文法項目
日本語教育
知悉度
話し言葉
話題
話題シラバス
語彙
Language
Japanese
ISSN
0453-4611
2433-0302
Abstract
We constructed the Japanese Topic-Oriented Conversation Corpus (J-TOCC) in order to study the influence of topics on the vocabulary, grammar, and discourse strategies in conversation. The main feature of this corpus is that the topics are fixed. University students were asked to engage in conversation of 15 topics, and each conversation was recorded for precisely 5 minutes. This means that conditions other than the topic are controlled. Eleven topics are related to daily life, and four topics are related to society. In total, 120 pairs participated, so 10 hours of conversation were recorded for each topic. J-TOCC contains about 1.6 million words in total. The pairs were balanced in terms of gender combination and recording site. In addition, speakers’ degrees of familiarity on each topic were surveyed and the data are attached to the corpus.