IFA speech
  
  
    
    
      
        Persistent Identifierauto
        http://hdl.handle.net/21.11114/COLL-0000-000B-CAB1-9
       
    
    
    
      Description0-1
      
        
          The IFA Spoken Language corpus is a fre…
          
            The IFA Spoken Language corpus is a free (GPL) database of hand-segmented Dutch speech. It was constructed with off-the-shelf software using speech from 8 speakers in a variety of speaking styles.  For a total of 50,000 words (41 minutes/speaker), speech acquisition and preparation took around 3 person-weeks per speaker. Hand segmentation took 1,000 hours of labeling altogether. The asymptotic segmentation speed was about one word, or four boundaries, per minute.
          
         
       
     
    
    
      
  
    LandingPage1
    
      
        https://hdl.handle.net/1839/00-0000-0000-0003-46DA-E@view
      
    
   
    
      
  
    Title(s)1-n
    
      
        
        
          [1]: 
          IFA Corpus, 
        
        
        
          [2]: 
          IFA speech corpus, 
        
        
        
          [3]: 
          IFA Spoken Language Corpus
        
        
      
     
   
    
      
  
    
      
  
    
      
  
    
      
  
    
      
  
    
      
  
    CLARIN centre0-1
    
      
        the dutch language union
      
    
   
    
      
  
    Persistent identifier(s)0-n
    
      
        
        
          https://hdl.handle.net/1839/00-0000-0000-0003-46DA-E
          
        
        
      
     
   
    
      
  
    
      
  
    
      
  
    Creator(s)0-n
    
      
        
        
          
          R.J.J.H. van Son (The Dutch Language Union)
        
        
      
     
   
    
      
  
    Project(s)0-n
    
      
        
        
          
          IFA site (Funder: Netherlands Organisation for Scientific Research / NWO)
         
        
      
     
   
    
    
    
      Resource(s)1-n
      
        
          
          
            Resource 1
            
              
                  
                  
                    Description0-1
                    
                      
                        IFA speech database
The IFA Spoken L…
                        
                          IFA speech database
The IFA Spoken Language corpus is a free (GPL) database of hand-segmented Dutch speech. It was constructed with off-the-shelf software using speech from 8 speakers in a variety of speaking styles.  For a total of 50,000 words (41 minutes/speaker), speech acquisition and preparation took around 3 person-weeks per speaker. Hand segmentation took 1,000 hours of labeling altogether. The asymptotic segmentation speed was about one word, or four boundaries, per minute.
                        
                       
                     
                   
                  
                    
  
                  
                    
  
                  
                    
  
                  
                    
  
                  
                    
  
    Recording condition0-n
    
      
        
        
          Medium= audio cd microphone= Sennheiser MKH 105 HF condenser and Shure SM10A dynamic noise= unspecified digitisation.recording= 44.1 kHz, 16 bit linear digitisation.
          
        
        
      
     
   
                  
                    
  
                  
                    
  
                  
                    
  
                  
                    
  
                  
                    
  
                  
                    
  
                  
                    
  
    SC duration speech0-1
    
      
        41 minutes per speaker
      
    
   
                  
                    
  
    SC duration full0-1
    
      
        330 minutes
      
    
   
                  
                    
  
                  
                    
  
    SC sp. demogr0-1
    
      
        4 female; 4 male;  15-66 age
      
    
   
                  
                    
  
    Size0-n
    
      
        41 minutes per speaker
      
    
   
                  
                    
  
    Annotation0-n
    
      
        
        
          [1]: 
          [orthographicTranscription] [automatic] [text/praat-textgrid], 
        
        
        
          [2]: 
          [lemmatization] [automatic] [text/praat-textgrid], 
        
        
        
          [3]: 
          [posTagging] [automatic] [text/praat-textgrid], 
        
        
        
          [4]: 
          [transliteration] [automatic] [text/praat-textgrid]
        
        
      
     
   
                  
                    
  
                  
               
             
           
          
        
       
     
    
    
      Provenance(s)0-n
      
        
          
            Provenance 1
            
              
                  
                    
  
                  
                    
  
    Country0-1
    
      
        Netherlands (the) NL
      
    
   
                  
               
             
           
        
       
     
    
    
    
    
      Accessibility0-1
      
        
          Accessibility
          
            
              
                
                  
  
                
                  
  
                
                  
  
    License name(s)0-n
    
      
        
        
          GNU General Public License
          
        
        
      
     
   
                
                  
  
    Licence URL(s)0-n
    
      
        
        
          http://www.gnu.org/copyleft/gpl.html
          
        
        
      
     
   
                
                  
  
    Non-commercial usage0-1
    
      
        yes
      
    
   
                
                  
  
    Website(s)0-n
    
      
        
        
          http://www.fon.hum.uva.nl/IFA-SpokenLanguageCorpora/IFAcorpus/
          
        
        
      
     
   
                
                  
  
                
                  
  
                
                  
  
    Contact(s)0-n
    
      
        
        
          R.J.J.H. van Son: Institute of Phonetic Sciences, (Rob.van.Son@hum.uva.nl)
          
        
        
      
     
   
                
                  
  
                
              
             
           
         
       
     
    
    
      Documentation0-1
      
        
          Documentation
          
            
              
                
                  
  
                
                  
  
                
                  
  
    URL(s)0-n
    
      
        
        
          http://www.fon.hum.uva.nl/IFA-SpokenLanguageCorpora/IFAcorpus/
          
        
        
      
     
   
                
              
             
           
         
       
     
    
    
     
   
  
  
    
  
    Editing is disabled, since you are not signed in