Whatsapp corpus Berntzen

Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-D239-8
Description0-1
Whatsapp conversations collected by master students Communication & Information Studies (2013-2014; 2014-2015). All participants in the conversations are over 18 and have signed consent forms. Metadata per conversation are available in CMDI XML files. The corpus has been made available for the CLARIAH sponsored ACAD project. The 'WhatsAppManon' corpus has been made available for the CLARIAH sponsored ACAD project. See https://www.clariah.nl/projecten/research-pilots/acad/acad and https://cesar.science.ru.nl/. Cooperators: Micha Hulsbosch - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG Wilbert Spooren - Radboud University Nijmegen, Faculty of arts, Dutch language Erwin R. Komen - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG Patrick Sonsma - Radboud University Nijmegen, Faculty of arts, Dutch language Original researcher: Manon Berntzen - Radboud University Nijmegen, Faculty of arts, Dutch language The corpus contains 60 WhatsApp chat sessions that have been collected by Manon Berntzen for the course on "New media-new methods" and then for her Bachelor thesis. The exact date of each chat is included in the <event> tag attributes in the .folia.xml files. FLAT can be used to open and view the folia-files. See https://flat.science.ru.nl/ The participants have all indicated that their chats can be used (in an anonymized form) for research purposes. Metadata per chat are available in CMDI XML files. The File textlist-folia.json contains an overview of all available texts in json format. Note: the files are numbered 001-063 consecutively, but 058-060 (as well as 064) are excluded, because they lack permission.
LandingPage1
https://easy.dans.knaw.nl/ui/datasets/id/easy-dataset:112986
Title(s)1-n
[1]: Whatsapp corpus Berntzen,
[2]: Whatsapp corpus Manon Berntzen
Owner(s)0-n
Radboud University
Genre(s)0-n
social-media-texts
Language(s)1-n
Dutch (Northern) [nld]
CLARIN centre0-1
Data Archiving and Networked Services (DANS)
Version0-1
1.0
Size(s)0-n
70 MB (compressed)
Creator(s)0-n
Wilbert Spooren-Manon Berntzen-Erwin Komen-Micha Hulsbosch (Radboud University)
Project(s)0-n
ACAD site (Funder: CLARIAH)
Resource(s)1-n
Description0-1
Whatsappdata collected for the PhD research of Lieke Verheijen (Radboud University). Informed consent only from contributor and not from conversational partner. Consequently, the subcorpus only contains contributions from the submitter. Metadata per conversation are available in CMDI XML files. Ref: Verheijen, L., & Stoop, W. (2016, September). Collecting facebook posts and whatsapp chats. In International Conference on Text, Speech, and Dialogue (pp. 249-258). Springer, Cham.
Dublin-Core Type1
Text
subtype0-1
-
Modality1-n
written
Social context0-n
private
Planning type0-n
spontaneous
Interactivity0-n
interactive
Involvement0-n
elicited
Audience0-n
small
WC authors0-1
unknown
WC auth. demogr0-1
-
Size0-n
218 texts
Annotation0-n
[1]: [posTagging] [automatic] [text/x-folia+xml],
[2]: [lemmatization] [automatic] [text/x-folia+xml],
[3]: [syntacticAnnotation] [automatic] [text/x-folia+xml]
Media0-n
text/folia
Provenance(s)0-n
Temporal0-1
2013-2015
Country0-1
Netherlands (the) NL
Linguality0-1
Type0-n
monolingual
Nativeness0-n
native
AgeGroup0-n
adult
Status0-n
normal
Variant0-n
standard
Accessibility0-1
Name1
CC-BY-NC-SA
Availability0-n
academic
License name(s)0-n
CC BY-NC-SA
Licence URL(s)0-n
https://creativecommons.org/licenses/by-nc-sa/4.0/
Non-commercial usage0-1
None
Website(s)0-n
https://cesar.science.ru.nl
ISBN0-1
-
ISLRN0-1
-
Contact(s)0-n
Wilbert Spooren: Radboud University, Faculty of, (w.spooren@let.ru.nl)
Medium(s)0-n
internet
Documentation0-1
not specified
Validation0-1
Type0-1
unknown
Method(s)0-n
unknown
 
Editing is disabled, since you are not signed in