FAME Radio Broadcast Corpus
Persistent Identifierauto
http://hdl.handle.net/21.11114/COLL-0000-000B-D20F-8
Description0-1
A large broadcast database is created b…
A large broadcast database is created by collecting recordings from the archives of the regional broadcaster Omrop Fryslân, and annotating them with various information such as the language switches and speaker details. The collection comprises over 3000 hours and the transcription and speaker annotation have been performed automatically by the speech and speaker recognition technology developed in the NWO FAME! project.
Metadata provided on the paper labels of the original audio tapes were digitized by Fryske Hannen under supervision of Omrop Fryslân and Tresoar.
The stereo audio data has a sampling frequency of 48 kHz and 16-bit resolution per sample.
Transcriptions with time alignments are provided as CTM files.
Speaker information is provided in RTTM files.
LandingPage1
https://fame.frl
Title(s)1-n
FAME! Radio Broadcast Corpus
Owner(s)0-n
[1]:
Omrop Fryslân,
[2]:
Tresoar,
[3]:
Radboud University
Language(s)1-n
Western Frisian [fry]
,
Dutch (Northern) [nld]
Relation(s)0-n
[FAME Radio Broadcast Corpus] isSiblingOf [FAME Speech Corpus]
Creator(s)0-n
[1]:
Frederik Kampstra (Omrop Fryslan),
[2]:
(),
[3]:
Emre Yilmaz-Henk van den Heuvel-David van Leeuwen (CLST, Radboud University)
Project(s)0-n
FAME! site (Funder: NWO Creative Industry)
Resource(s)1-n
Resource 1
Description0-1
A large broadcast database is created b…
A large broadcast database is created by collecting recordings from the archives of the regional broadcaster Omrop Fryslân, and annotating them with various information such as the language switches and speaker details. The collection comprises over 3000 hours and the transcription and speaker annotation have been performed automatically by the speech and speaker recognition technology developed in the NWO FAME! project.
Metadata provided on the paper labels of the original audio tapes were digitized by Fryske Hannen under supervision of Omrop Fryslân and Tresoar.
The stereo audio data has a sampling frequency of 16 kHz and 16-bit resolution per sample.
Transcriptions with time alignments are provided as CTM files.
Speaker information is provided in RTTM files.
The FAME! Speech Corpus was used to train the speech and speaker recognisers used for transcribing and annotating the corpus.
Recording environment0-n
home/office
,
studio
,
public-place
Planning type0-n
semi-spontaneous
,
spontaneous
Interactivity0-n
interactive
,
semi-interactive
SC duration speech0-1
unknown
SC duration full0-1
over 3000 hours
Annotation0-n
[1]:
[orthographicTranscription] [automatic] [text/plain],
[2]:
[alignment] [automatic] [text/plain],
[3]:
[speakerIdentification] [automatic] [text/plain]
Provenance(s)0-n
Provenance 1
Country0-1
Netherlands (the) NL
Accessibility0-1
Accessibility
Non-commercial usage0-1
yes
Contact(s)0-n
Henk van den Heuvel: CLST, Radboud Univbersity, Nij, (clst@let.ru.nl)
,
Frederik Kampstra: Omrop Fryslân, (frederik.kampstra@omropfryslan.nl)
Editing is disabled, since you are not signed in