Whatsapp corpus Berntzen
Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-D239-8
Description0-1
Whatsapp conversations collected by mas…
Whatsapp conversations collected by master students Communication & Information Studies (2013-2014; 2014-2015). All participants in the conversations are over 18 and have signed consent forms. Metadata per conversation are available in CMDI XML files.
The corpus has been made available for the CLARIAH sponsored ACAD project.
The 'WhatsAppManon' corpus has been made available for the CLARIAH sponsored ACAD project.
See https://www.clariah.nl/projecten/research-pilots/acad/acad and https://cesar.science.ru.nl/.
Cooperators:
Micha Hulsbosch - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG
Wilbert Spooren - Radboud University Nijmegen, Faculty of arts, Dutch language
Erwin R. Komen - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG
Patrick Sonsma - Radboud University Nijmegen, Faculty of arts, Dutch language
Original researcher:
Manon Berntzen - Radboud University Nijmegen, Faculty of arts, Dutch language
The corpus contains 60 WhatsApp chat sessions that have been collected by Manon Berntzen for the course on "New media-new methods" and then for her Bachelor thesis.
The exact date of each chat is included in the <event> tag attributes in the .folia.xml files.
FLAT can be used to open and view the folia-files. See https://flat.science.ru.nl/
The participants have all indicated that their chats can be used (in an anonymized form) for research purposes.
Metadata per chat are available in CMDI XML files.
The File textlist-folia.json contains an overview of all available texts in json format.
Note: the files are numbered 001-063 consecutively, but 058-060 (as well as 064) are excluded, because they lack permission.
LandingPage1
https://easy.dans.knaw.nl/ui/datasets/id/easy-dataset:112986
Title(s)1-n
[1]:
Whatsapp corpus Berntzen,
[2]:
Whatsapp corpus Manon Berntzen
CLARIN centre0-1
Data Archiving and Networked Services (DANS)
Creator(s)0-n
Wilbert Spooren-Manon Berntzen-Erwin Komen-Micha Hulsbosch (Radboud University)
Project(s)0-n
ACAD site (Funder: CLARIAH)
Resource(s)1-n
Resource 1
Description0-1
Whatsappdata collected for the PhD rese…
Whatsappdata collected for the PhD research of Lieke Verheijen (Radboud University). Informed consent only from contributor and not from conversational partner. Consequently, the subcorpus only contains contributions from the submitter. Metadata per conversation are available in CMDI XML files. Ref: Verheijen, L., & Stoop, W. (2016, September). Collecting facebook posts and whatsapp chats. In International Conference on Text, Speech, and Dialogue (pp. 249-258). Springer, Cham.
Annotation0-n
[1]:
[posTagging] [automatic] [text/x-folia+xml],
[2]:
[lemmatization] [automatic] [text/x-folia+xml],
[3]:
[syntacticAnnotation] [automatic] [text/x-folia+xml]
Provenance(s)0-n
Provenance 1
Country0-1
Netherlands (the) NL
Accessibility0-1
Accessibility
Licence URL(s)0-n
https://creativecommons.org/licenses/by-nc-sa/4.0/
Non-commercial usage0-1
None
Website(s)0-n
https://cesar.science.ru.nl
Contact(s)0-n
Wilbert Spooren: Radboud University, Faculty of, (w.spooren@let.ru.nl)
Editing is disabled, since you are not signed in