NRC2011
Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-D237-A
Description0-1
Newspaper texts taken from printed and …
Newspaper texts taken from printed and and digital versions of the NRC newspaper (edition 2011). The texts cover blogs, hard news, background articles, opinion articles on related topics. Metadata per text are available in CMDI XML files.
The 'NRC2011' corpus has been created for the CLARIAH sponsored ACAD project.
See https://www.clariah.nl/projecten/research-pilots/acad/acad and https://cesar.science.ru.nl/.
Cooperators:
Micha Hulsbosch - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG
Wilbert Spooren - Radboud University Nijmegen, Faculty of arts, Dutch language
Erwin R. Komen - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG
The corpus contains 2225 newspaper texts taken from printed and and digital versions of the NRC newspaper (year 2011).
The texts cover blogs, hard news, background articles, opinion articles on related topics.
FLAT can be used to open and view the folia-files. See https://flat.science.ru.nl/
Metadata per article are available in CMDI XML files.
The File textlist-folia.json contains an overview of all available texts in json format.
The file NRCLicentieovereenkomst.pdf contains the License Agreement with NRC.
LandingPage1
https://easy.dans.knaw.nl/ui/datasets/id/easy-dataset:112988
Genre(s)0-n
newspaper-article
,
interviews
CLARIN centre0-1
Data Archiving and Networked Services (DANS)
Creator(s)0-n
Wilbert Spooren (Radboud University)
Project(s)0-n
ACAD site (Funder: CLARIAH)
Resource(s)1-n
Resource 1
Description0-1
Newspaper texts taken from printed and …
Newspaper texts taken from printed and and digital versions of the NRC newspaper (edition 2011). The texts cover blogs, hard news, background articles, opinion articles on related topics. Metadata per text are available in CMDI XML files.
Annotation0-n
[1]:
[posTagging] [automatic] [text/x-folia+xml],
[2]:
[lemmatization] [automatic] [text/x-folia+xml],
[3]:
[syntacticAnnotation] [automatic] [text/x-folia+xml]
Provenance(s)0-n
Provenance 1
Country0-1
Netherlands (the) NL
Accessibility0-1
Accessibility
Licence URL(s)0-n
https://creativecommons.org/licenses/by-nc-sa/4.0/
Non-commercial usage0-1
no
Website(s)0-n
https://cesar.science.ru.nl/
Contact(s)0-n
Wilbert Spooren: Radboud University, Faculty of, (w.spooren@let.ru.nl)
Editing is disabled, since you are not signed in