학술논문
Creating CorCenCC (Corpws Cenedlaethol Cymraeg Cyfoes - The National Corpus of Contemporary Welsh)
Document Type
TEXT
Author
Knight, Dawn; Fitzpatrick, Tess; Morris, Steve; Evas, Jeremy; Rayson, Paul; Spasić, Irena; Stonelake, Mark; Thomas, Enlli Môn; Neale, Steven; Needs, Jennifer; Piao, Scott; Rees, Mair; Watkins, Gareth; Anthony, Laurence; Cobb, Thomas Michael; Deuchar, Margaret; Donnelly, Kevin; McCarthy, Michael; Scannell, Kevin
Source
Subject
Language
English
Abstract
CorCenCC is an interdisciplinary and multiinstitutional project that is creating a large-scale, open-source corpus of contemporary Welsh. CorCenCC will be the first ever large-scale corpus to represent spoken, written and electronicallymediated Welsh (compiling an initial data set of 10 million Welsh words), with a functional design informed, from the outset, by representatives of all anticipated academic and community user groups.