NRC2011

Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-D237-A
Description0-1
Newspaper texts taken from printed and and digital versions of the NRC newspaper (edition 2011). The texts cover blogs, hard news, background articles, opinion articles on related topics. Metadata per text are available in CMDI XML files. The 'NRC2011' corpus has been created for the CLARIAH sponsored ACAD project. See https://www.clariah.nl/projecten/research-pilots/acad/acad and https://cesar.science.ru.nl/. Cooperators: Micha Hulsbosch - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG Wilbert Spooren - Radboud University Nijmegen, Faculty of arts, Dutch language Erwin R. Komen - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG The corpus contains 2225 newspaper texts taken from printed and and digital versions of the NRC newspaper (year 2011). The texts cover blogs, hard news, background articles, opinion articles on related topics. FLAT can be used to open and view the folia-files. See https://flat.science.ru.nl/ Metadata per article are available in CMDI XML files. The File textlist-folia.json contains an overview of all available texts in json format. The file NRCLicentieovereenkomst.pdf contains the License Agreement with NRC.
LandingPage1
https://easy.dans.knaw.nl/ui/datasets/id/easy-dataset:112988
Title(s)1-n
NRC2011
Owner(s)0-n
Radboud University
Genre(s)0-n
newspaper-article , interviews
Language(s)1-n
Dutch (Northern) [nld]
CLARIN centre0-1
Data Archiving and Networked Services (DANS)
Version0-1
1.0
Size(s)0-n
230 MB (compressed)
Creator(s)0-n
Wilbert Spooren (Radboud University)
Project(s)0-n
ACAD site (Funder: CLARIAH)
Resource(s)1-n
Description0-1
Newspaper texts taken from printed and and digital versions of the NRC newspaper (edition 2011). The texts cover blogs, hard news, background articles, opinion articles on related topics. Metadata per text are available in CMDI XML files.
Dublin-Core Type1
Text
subtype0-1
-
Modality1-n
written
Social context0-n
public
Planning type0-n
planned
Interactivity0-n
non-interactive
Audience0-n
large
WC authors0-1
unknown
WC auth. demogr0-1
-
Size0-n
2225 texts
Annotation0-n
[1]: [posTagging] [automatic] [text/x-folia+xml],
[2]: [lemmatization] [automatic] [text/x-folia+xml],
[3]: [syntacticAnnotation] [automatic] [text/x-folia+xml]
Media0-n
text/folia
Provenance(s)0-n
Temporal0-1
2011-2011
Country0-1
Netherlands (the) NL
Linguality0-1
Type0-n
monolingual
Nativeness0-n
native
AgeGroup0-n
adult
Status0-n
normal
Variant0-n
standard
Accessibility0-1
Name1
CC-BY-NC-SA
Availability0-n
academic
License name(s)0-n
CC-BY-NC-SA
Licence URL(s)0-n
https://creativecommons.org/licenses/by-nc-sa/4.0/
Non-commercial usage0-1
no
Website(s)0-n
https://cesar.science.ru.nl/
ISBN0-1
-
ISLRN0-1
-
Contact(s)0-n
Wilbert Spooren: Radboud University, Faculty of, (w.spooren@let.ru.nl)
Medium(s)0-n
internet
Documentation0-1
Language(s)1-n
English [eng]
Type(s)0-n
readme file
File(s)0-n
README.txt
Validation0-1
not specified
 
Editing is disabled, since you are not signed in