FAME Radio Broadcast Corpus

Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-D20F-8
Description0-1
A large broadcast database is created by collecting recordings from the archives of the regional broadcaster Omrop Fryslân, and annotating them with various information such as the language switches and speaker details. The collection comprises over 3000 hours and the transcription and speaker annotation have been performed automatically by the speech and speaker recognition technology developed in the NWO FAME! project. Metadata provided on the paper labels of the original audio tapes were digitized by Fryske Hannen under supervision of Omrop Fryslân and Tresoar. The stereo audio data has a sampling frequency of 48 kHz and 16-bit resolution per sample. Transcriptions with time alignments are provided as CTM files. Speaker information is provided in RTTM files.
LandingPage1
https://fame.frl
Title(s)1-n
FAME! Radio Broadcast Corpus
Owner(s)0-n
[1]: Omrop Fryslân,
[2]: Tresoar,
[3]: Radboud University
Genre(s)0-n
radio/TV-broadcast
Language disorder(s)0-n
none
Language(s)1-n
Western Frisian [fry] , Dutch (Northern) [nld]
CLARIN centre0-1
CLST
Version0-1
1.0
Size(s)0-n
3000 hours
Relation(s)0-n
[FAME Radio Broadcast Corpus] isSiblingOf [FAME Speech Corpus]
Creator(s)0-n
[1]: Frederik Kampstra (Omrop Fryslan),
[2]: (),
[3]: Emre Yilmaz-Henk van den Heuvel-David van Leeuwen (CLST, Radboud University)
Project(s)0-n
FAME! site (Funder: NWO Creative Industry)
Resource(s)1-n
Description0-1
A large broadcast database is created by collecting recordings from the archives of the regional broadcaster Omrop Fryslân, and annotating them with various information such as the language switches and speaker details. The collection comprises over 3000 hours and the transcription and speaker annotation have been performed automatically by the speech and speaker recognition technology developed in the NWO FAME! project. Metadata provided on the paper labels of the original audio tapes were digitized by Fryske Hannen under supervision of Omrop Fryslân and Tresoar. The stereo audio data has a sampling frequency of 16 kHz and 16-bit resolution per sample. Transcriptions with time alignments are provided as CTM files. Speaker information is provided in RTTM files. The FAME! Speech Corpus was used to train the speech and speaker recognisers used for transcribing and annotating the corpus.
Dublin-Core Type1
Sound
subtype0-1
speech
Modality1-n
speech , transcribed
Recording environment0-n
home/office , studio , public-place
Channel0-n
broadcasting
Social context0-n
public
Planning type0-n
semi-spontaneous , spontaneous
Interactivity0-n
interactive , semi-interactive
Involvement0-n
unknown
Audience0-n
large
SC duration speech0-1
unknown
SC duration full0-1
over 3000 hours
SC speakers0-1
unknown
SC sp. demogr0-1
-
Size0-n
3000 hours
Annotation0-n
[1]: [orthographicTranscription] [automatic] [text/plain],
[2]: [alignment] [automatic] [text/plain],
[3]: [speakerIdentification] [automatic] [text/plain]
Media0-n
audio/x-wav
Provenance(s)0-n
Temporal0-1
1966-2015
Cities0-n
Frisia
Country0-1
Netherlands (the) NL
Linguality0-1
Status0-n
normal
Variant0-n
dialect , standard
MultiType0-n
codeSwitching
Accessibility0-1
Name1
FAME Radio
Availability0-n
academic , restricted
License name(s)0-n
Upon request
Non-commercial usage0-1
yes
ISBN0-1
-
ISLRN0-1
-
Contact(s)0-n
Henk van den Heuvel: CLST, Radboud Univbersity, Nij, (clst@let.ru.nl) , Frederik Kampstra: Omrop Fryslân, (frederik.kampstra@omropfryslan.nl)
Medium(s)0-n
internet
Documentation0-1
Language(s)1-n
English [eng]
Type(s)0-n
readme file
Validation0-1
not specified
 
Editing is disabled, since you are not signed in