IFA speech

Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-CAB1-9
Description0-1
The IFA Spoken Language corpus is a free (GPL) database of hand-segmented Dutch speech. It was constructed with off-the-shelf software using speech from 8 speakers in a variety of speaking styles. For a total of 50,000 words (41 minutes/speaker), speech acquisition and preparation took around 3 person-weeks per speaker. Hand segmentation took 1,000 hours of labeling altogether. The asymptotic segmentation speed was about one word, or four boundaries, per minute.
LandingPage1
https://hdl.handle.net/1839/00-0000-0000-0003-46DA-E@view
Title(s)1-n
[1]: IFA Corpus,
[2]: IFA speech corpus,
[3]: IFA Spoken Language Corpus
Owner(s)0-n
The Dutch Language Union
Genre(s)0-n
conversation
Language disorder(s)0-n
none
Domain(s)0-n
speaking styles
Language(s)1-n
Dutch (Northern) [nld]
CLARIN centre0-1
the dutch language union
Persistent identifier(s)0-n
https://hdl.handle.net/1839/00-0000-0000-0003-46DA-E
Version0-1
1.0
Size(s)0-n
50000 words
Creator(s)0-n
R.J.J.H. van Son (The Dutch Language Union)
Project(s)0-n
IFA site (Funder: Netherlands Organisation for Scientific Research / NWO)
Resource(s)1-n
Description0-1
IFA speech database The IFA Spoken Language corpus is a free (GPL) database of hand-segmented Dutch speech. It was constructed with off-the-shelf software using speech from 8 speakers in a variety of speaking styles. For a total of 50,000 words (41 minutes/speaker), speech acquisition and preparation took around 3 person-weeks per speaker. Hand segmentation took 1,000 hours of labeling altogether. The asymptotic segmentation speed was about one word, or four boundaries, per minute.
Dublin-Core Type1
Sound
subtype0-1
speech
Modality1-n
speech
Recording environment0-n
unknown
Recording condition0-n
Medium= audio cd microphone= Sennheiser MKH 105 HF condenser and Shure SM10A dynamic noise= unspecified digitisation.recording= 44.1 kHz, 16 bit linear digitisation.
Channel0-n
face-to-face
Social context0-n
unknown
Planning type0-n
semi-spontaneous
Interactivity0-n
non-interactive
Involvement0-n
non-elicited
Audience0-n
no
SC duration speech0-1
41 minutes per speaker
SC duration full0-1
330 minutes
SC speakers0-1
8
SC sp. demogr0-1
4 female; 4 male; 15-66 age
Size0-n
41 minutes per speaker
Annotation0-n
[1]: [orthographicTranscription] [automatic] [text/praat-textgrid],
[2]: [lemmatization] [automatic] [text/praat-textgrid],
[3]: [posTagging] [automatic] [text/praat-textgrid],
[4]: [transliteration] [automatic] [text/praat-textgrid]
Media0-n
audio/x-wav
Provenance(s)0-n
Temporal0-1
2008-2008
Country0-1
Netherlands (the) NL
Linguality0-1
Type0-n
monolingual
Nativeness0-n
native
AgeGroup0-n
adult
Status0-n
normal
Variant0-n
standard
MultiType0-n
unknown
Accessibility0-1
Name1
IFA speech
Availability0-n
public
License name(s)0-n
GNU General Public License
Licence URL(s)0-n
http://www.gnu.org/copyleft/gpl.html
Non-commercial usage0-1
yes
Website(s)0-n
http://www.fon.hum.uva.nl/IFA-SpokenLanguageCorpora/IFAcorpus/
ISBN0-1
-
ISLRN0-1
-
Contact(s)0-n
R.J.J.H. van Son: Institute of Phonetic Sciences, (Rob.van.Son@hum.uva.nl)
Medium(s)0-n
internet
Documentation0-1
Language(s)1-n
English [eng]
Type(s)0-n
manual , website
URL(s)0-n
http://www.fon.hum.uva.nl/IFA-SpokenLanguageCorpora/IFAcorpus/
Validation0-1
Type0-1
unknown
Method(s)0-n
manual
 
Editing is disabled, since you are not signed in