D-LUCEA
Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-CAA7-5
Description0-1
The LUCEA corpus (Longitudinal Universi…
The LUCEA corpus (Longitudinal University College utrecht Corpus of English Accents) was collected to study this type of phonetic convergence in a multilingual environment. Students and teachers at University College Utrecht (UCU) come from various countries and native languages, yet they all use English as the lingua franca on campus. Hence, phonetic convergence may result in a unique international version of English, influenced by the speakers’ native languages and accents.
The corpus now contains data from about 850 interviews from 282 unique students. Each interview contains about 20 minutes of speech. The speech corpus is augmented with participants’ responses from entry and exit questionnaires, and supplementary data about the participants and about each recording. When finished in 2016, the total corpus will contain about 3 TB (about 3000 GB) of audio data.
LandingPage1
https://hdl.handle.net/1839/00-C3AD1CEE-985D-42E5-8528-730774C187C1@view
Title(s)1-n
[1]:
D-LUCEA,
[2]:
the LUCEA corpus,
[3]:
the LUCEA database,
[4]:
Database of the Longitudinal Utrecht Collection of English Accents
Owner(s)0-n
University College Utrecht
Genre(s)0-n
interviews
,
conversation
,
academic-nonfiction
,
prompted speech
,
academic-nonfiction
,
academic-nonfiction
,
fiction
Domain(s)0-n
The database is of interest for research and development in linguistics, language education (pronunciation training), speech technology (foreign accent detection, language recognition, speech recognition), and sociophonetics.
CLARIN centre0-1
MPI for Psycholinguistics
Persistent identifier(s)0-n
https://hdl.handle.net/1839/00-58F6586A-55F4-4B45-8341-6E2F7FF0668C
Size(s)0-n
282 stud.
,
850 intrv
,
3000 GB
Creator(s)0-n
Dr Hugo Quené (Max Planck Institute for Psycholinguistics, Nijmeg)
Project(s)0-n
UCU Accent site (Funder: University College Utrecht-Utrecht Institute of Linguistics OTS-CLARIN-NL)
Resource(s)1-n
Resource 1
Description0-1
The LUCEA database is a database of exi…
The LUCEA database is a database of existing speech recordings of L1 and L2 speakers of English. The recorded speakers are students from an international student community where English is used as lingua franca. These students are being recorded longitudinally throughout their 3-year period on campus, using read and spontaneous speech in L1 and in L2 English (or in L1 English only). The database is of interest for research and development in linguistics, language education (pronunciation training), speech technology (foreign accent detection, language recognition, speech recognition), and sociophonetics.
The corpus now contains data from about 850 interviews from 282 unique students. Each interview contains about 20 minutes of speech. The speech corpus is augmented with participants’ responses from entry and exit questionnaires, and supplementary data about the participants and about each recording. When finished in 2016, the total corpus will contain about 3 TB (about 3000 GB) of audio data.
Recording condition0-n
8 microphones were used
,
Microphone 1 is a close-talking headset microphone, 30 cm in front of speaker
Channel0-n
experimental-setting
,
face-to-face
SC duration speech0-1
20 mins per recording
SC duration full0-1
unknown
SC sp. demogr0-1
- 70% female, 30% male
Annotation0-n
[orthographicTranscription] [unknown] [other]
Media0-n
audio/x-wav, text/xml
Provenance(s)0-n
Provenance 1
Country0-1
Netherlands (the) NL
Accessibility0-1
Accessibility
Non-commercial usage0-1
yes
Website(s)0-n
http://lucea.wp.hum.uu.nl/summary/
Contact(s)0-n
Dr Hugo Quené: Utrecht inst of Linguistics OT, (H.Quene@uu.nl)
,
Dr Rosemary Orr: University College Utrecht and, (r.orr@uu.nl)
Documentation0-1
Documentation
URL(s)0-n
https://portal.clarin.nl/node/4183
,
http://lucea.wp.hum.uu.nl/summary/
Editing is disabled, since you are not signed in