FAME Radio Broadcast Corpus
  
  
    
    
      
        Persistent Identifierauto
        http://hdl.handle.net/21.11114/COLL-0000-000B-D20F-8
       
    
    
    
      Description0-1
      
        
          A large broadcast database is created b…
          
            A large broadcast database is created by collecting recordings from the archives of the regional broadcaster Omrop Fryslân, and annotating them with various information such as the language switches and speaker details. The collection comprises over 3000 hours and the transcription and speaker annotation have been performed automatically by the speech and speaker recognition technology developed in the NWO FAME! project.
Metadata provided on the paper labels of the original audio tapes were digitized by Fryske Hannen under supervision of Omrop Fryslân and Tresoar. 
 
The stereo audio data has a sampling frequency of 48 kHz and 16-bit resolution per sample. 
Transcriptions with time alignments are provided as CTM files.
Speaker information is provided in RTTM files.
          
         
       
     
    
    
      
  
    LandingPage1
    
      
        https://fame.frl
      
    
   
    
      
  
    Title(s)1-n
    
      
        
        
          
          FAME! Radio Broadcast Corpus
        
        
      
     
   
    
      
  
    Owner(s)0-n
    
      
        
        
          [1]: 
          Omrop Fryslân, 
        
        
        
          [2]: 
          Tresoar, 
        
        
        
          [3]: 
          Radboud University
        
        
      
     
   
    
      
  
    
      
  
    
      
  
    Language(s)1-n
    
      
        
        
          Western Frisian [fry]
          , 
        
          Dutch (Northern) [nld]
          
        
        
      
     
   
    
      
  
    
      
  
    
      
  
    
      
  
    Relation(s)0-n
    
      
        
        
          
          [FAME Radio Broadcast Corpus] isSiblingOf [FAME Speech Corpus]
        
        
      
     
   
    
      
  
    Creator(s)0-n
    
      
        
        
          [1]: 
          Frederik Kampstra (Omrop Fryslan), 
        
        
        
          [2]: 
           (), 
        
        
        
          [3]: 
          Emre Yilmaz-Henk van den Heuvel-David van Leeuwen (CLST, Radboud University)
        
        
      
     
   
    
      
  
    Project(s)0-n
    
      
        
        
          
          FAME! site (Funder: NWO Creative Industry)
         
        
      
     
   
    
    
    
      Resource(s)1-n
      
        
          
          
            Resource 1
            
              
                  
                  
                    Description0-1
                    
                      
                        A large broadcast database is created b…
                        
                          A large broadcast database is created by collecting recordings from the archives of the regional broadcaster Omrop Fryslân, and annotating them with various information such as the language switches and speaker details. The collection comprises over 3000 hours and the transcription and speaker annotation have been performed automatically by the speech and speaker recognition technology developed in the NWO FAME! project.
Metadata provided on the paper labels of the original audio tapes were digitized by Fryske Hannen under supervision of Omrop Fryslân and Tresoar. 
 
The stereo audio data has a sampling frequency of 16 kHz and 16-bit resolution per sample. 
Transcriptions with time alignments are provided as CTM files.
Speaker information is provided in RTTM files.
The FAME! Speech Corpus was used to train the speech and speaker recognisers used for transcribing and annotating the corpus.
                        
                       
                     
                   
                  
                    
  
                  
                    
  
                  
                    
  
                  
                    
  
    Recording environment0-n
    
      
        
        
          home/office
          , 
        
          studio
          , 
        
          public-place
          
        
        
      
     
   
                  
                    
  
                  
                    
  
                  
                    
  
    Planning type0-n
    
      
        
        
          semi-spontaneous
          , 
        
          spontaneous
          
        
        
      
     
   
                  
                    
  
    Interactivity0-n
    
      
        
        
          interactive
          , 
        
          semi-interactive
          
        
        
      
     
   
                  
                    
  
                  
                    
  
                  
                    
  
    SC duration speech0-1
    
      
        unknown
      
    
   
                  
                    
  
    SC duration full0-1
    
      
        over 3000 hours
      
    
   
                  
                    
  
                  
                    
  
                  
                    
  
                  
                    
  
    Annotation0-n
    
      
        
        
          [1]: 
          [orthographicTranscription] [automatic] [text/plain], 
        
        
        
          [2]: 
          [alignment] [automatic] [text/plain], 
        
        
        
          [3]: 
          [speakerIdentification] [automatic] [text/plain]
        
        
      
     
   
                  
                    
  
                  
               
             
           
          
        
       
     
    
    
      Provenance(s)0-n
      
        
          
            Provenance 1
            
              
                  
                    
  
                  
                    
  
                  
                    
  
    Country0-1
    
      
        Netherlands (the) NL
      
    
   
                  
               
             
           
        
       
     
    
    
    
    
      Accessibility0-1
      
        
          Accessibility
          
            
              
                
                  
  
                
                  
  
                
                  
  
                
                  
  
    Non-commercial usage0-1
    
      
        yes
      
    
   
                
                  
  
                
                  
  
                
                  
  
    Contact(s)0-n
    
      
        
        
          Henk van den Heuvel: CLST, Radboud Univbersity, Nij, (clst@let.ru.nl)
          , 
        
          Frederik Kampstra: Omrop Fryslân, (frederik.kampstra@omropfryslan.nl)
          
        
        
      
     
   
                
                  
  
                
              
             
           
         
       
     
    
    
    
    
     
   
  
  
    
  
    Editing is disabled, since you are not signed in