Whatsapp corpus Verheijen

Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-D238-9
Description0-1
Whatsappdata collected for the PhD research of Lieke Verheijen (Radboud University). Informed consent only from contributor and not from conversational partner. Consequently, the subcorpus only contains contributions from the submitter. Metadata per conversation are available in CMDI XML files. Ref: Verheijen, L., & Stoop, W. (2016, September). Collecting facebook posts and whatsapp chats. In International Conference on Text, Speech, and Dialogue (pp. 249-258). Springer, Cham. The corpus has been made available for the CLARIAH sponsored ACAD project. See https://www.clariah.nl/projecten/research-pilots/acad/acad and https://cesar.science.ru.nl/. Cooperators: Micha Hulsbosch - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG Wilbert Spooren - Radboud University Nijmegen, Faculty of arts, Dutch language Erwin R. Komen - Radboud University Nijmegen, Faculty of Arts, Humanities Lab, TSG Patrick Sonsma - Radboud University Nijmegen, Faculty of arts, Dutch language Original researcher: Lieke Verheijen - Radboud University Nijmegen, Faculty of arts, Dutch language The corpus contains 218 WhatsApp chat sessions that have been collected by Lieke Verheijen in 2012-2014 in the Netherlands. The exact date of each chat is included in the <event> tag attributes in the .folia.xml files. FLAT can be used to open and view the folia-files. See https://flat.science.ru.nl/ The participants have all indicated that their chats can be used (in an anonymized form) for research purposes. Metadata per chat are available in CMDI XML files. The File textlist-folia.json contains an overview of all available texts in json format.
LandingPage1
https://easy.dans.knaw.nl/ui/datasets/id/easy-dataset:112987
Title(s)1-n
[1]: Whatsapp corpus Verheijen,
[2]: Whatsapp corpus Lieke Verheijen
Owner(s)0-n
Radboud University
Genre(s)0-n
social-media-texts
Language(s)1-n
Dutch (Northern) [nld]
CLARIN centre0-1
Data Archiving and Networked Services (DANS)
Version0-1
1.0
Size(s)0-n
70 MB (compressed)
Creator(s)0-n
Lieke Verheijen-Wessel Stoop-Wilbert Spooren-Erwin Komen-Micha Hulsbosch (Radboud University)
Project(s)0-n
ACAD site (Funder: CLARIAH)
Resource(s)1-n
Description0-1
Whatsappdata collected for the PhD research of Lieke Verheijen (Radboud University). Informed consent only from contributor and not from conversational partner. Consequently, the subcorpus only contains contributions from the submitter. Metadata per conversation are available in CMDI XML files. Ref: Verheijen, L., & Stoop, W. (2016, September). Collecting facebook posts and whatsapp chats. In International Conference on Text, Speech, and Dialogue (pp. 249-258). Springer, Cham.
Dublin-Core Type1
Text
subtype0-1
-
Modality1-n
written
Social context0-n
private
Planning type0-n
spontaneous
Interactivity0-n
interactive
Involvement0-n
elicited
Audience0-n
small
WC authors0-1
unknown
WC auth. demogr0-1
-
Size0-n
218 texts
Annotation0-n
[1]: [posTagging] [automatic] [text/x-folia+xml],
[2]: [lemmatization] [automatic] [text/x-folia+xml],
[3]: [syntacticAnnotation] [automatic] [text/x-folia+xml]
Media0-n
text/folia
Provenance(s)0-n
Temporal0-1
2013-2015
Country0-1
Netherlands (the) NL
Linguality0-1
Type0-n
monolingual
Nativeness0-n
native
AgeGroup0-n
adult , child
Status0-n
normal
Variant0-n
standard
Accessibility0-1
Name1
CC-BY-NC-SA
Availability0-n
academic
License name(s)0-n
CC BY-NC-SA
Licence URL(s)0-n
https://creativecommons.org/licenses/by-nc-sa/4.0/
Non-commercial usage0-1
None
Website(s)0-n
https://cesar.science.ru.nl
ISBN0-1
-
ISLRN0-1
-
Contact(s)0-n
Wilbert Spooren: Radboud University, Faculty of, (w.spooren@let.ru.nl)
Medium(s)0-n
internet
Documentation0-1
not specified
Validation0-1
Type0-1
unknown
Method(s)0-n
unknown
 
Editing is disabled, since you are not signed in