2017-01-20T15:27:16 n442

The QUT-NOISE Databases and Protocols

Viewed: 132

The QUT-NOISE Databases and Protocols

Overview

This distribution contains the QUT-NOISE database and the code required to create the QUT-NOISE-TIMIT database from the QUT-NOISE database and a locally installed copy of the TIMIT database. It also contains code to create the QUT-NOISE-SRE protocol on top of an existing speaker recognition evaluation database (such as NIST evaluations). Further information on the QUT-NOISE and QUT-NOISE-TIMIT databases is available in our paper: 


D. Dean, S. Sridharan, R. Vogt, M. Mason (2010) "The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms", in Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan.

This paper is also available in the file: docs/Dean2010, The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithm.pdf, distributed with this database.

Further information on the QUT-NOISE-SRE protocol is available in our paper: 
D. Dean, A. Kanagasundaram, H. Ghaemmaghami, M. Hafizur, S. Sridharan (2015) The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition". In Proceedings of Interspeech 2015, September, Dresden, Germany.

Licensing

The QUT-NOISE data itself is licensed CC-BY-SA, and the code required to create the QUT-NOISE-TIMIT database and QUT-NOISE-SRE protocols is licensed under the BSD license. Please consult the approriate LICENSE.txt files (in the code and QUT-NOISE directories) for more information. To attribute this database, please include the following citation: 


D. Dean, S. Sridharan, R. Vogt, M. Mason (2010) The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms", in Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan.

If your work is based upon the QUT-NOISE-SRE, please also include this citation: 
D. Dean, A. Kanagasundaram, H. Ghaemmaghami, M. Hafizur, S. Sridharan (2015) The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition". In Proceedings of Interspeech 2015, September, Dresden, Germany.

Download and Installation

Download the following QUT-NOISE*.zip files:

Creating QUT-NOISE-TIMIT

Obtaining TIMIT

In order to construct the QUT-NOISE-TIMIT database from the QUT-NOISE data supplied here you will need to obtain a copy of the TIMIT database from the Linguistic Data Consortium. If you just want to use the QUT-NOISE database, or you wish to combine it with different speech data, TIMIT is not required.

Creating QUT-NOISE-TIMIT

  • Once you have obtained TIMIT, download and install a copy of VOICEBOX: Speech Processing Toolbox for MATLAB and install it in your MATLABPATH.
  • Run matlab in the QUT-NOISE/code directory, and run the function: createQUTNOISETIMIT('/location/of/timit-cd/timit'). This will create the QUT-NOISE-TIMIT database in the QUT-NOISE/QUT-NOISE-TIMIT directory.
  • If you wish to verify that the QUT-NOISE-TIMIT database matches that evaluated in our original paper, please check that the md5sums (use md5sum on unix-based OSes) match those in the QUT-NOISE-TIMIT/md5sum.txt file.
  • Using the QUT-NOISE-SRE protocol
    • The code related to the QUT-NOISE-SRE protocol can be used in two ways:
      1. To create a collection of noisy audio files across the scenarios in the QUT-NOISE database at different noise levels, or,
      2. To recreate a list of file names based on the QUT-NOISE-SRE protocl produced by another researcher, having already done (1). This allows existing research to be reproduced without having to send large volumes of audio around.
    • If you are interested in creating your own noisy database from an existing SRE database (1 above), please look at the example script exampleQUTNOISESRE.sh in the QUT-NOISE/code directory. You will need to make some modifications, but it should give you the right idea.
    • If you are interested in creating our QUT-NOISE-NIST2008 database published at Interspeech 2015, you can find the list of created noisy files in the QUT-NOISE-NIST2008.train.short2.list and QUT-NOISE-NIST2008.test.short3.list files in the QUT-NOISE/code directory.
    • These files can be recreated as follows (provided you have access to the NIST2008 SRE data): 

      Run matlab in the QUT-NOISE/code directory, and run the following functions: 

      createQUTNOISESREfiles('NIST2008.train.short2.list', ... 
      'QUT-NOISE-NIST2008.train.short2.list', ... 
      '

Geographical area of data collection

text
Brisbane

Publications

The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms http://eprints.qut.edu.au/38144/
The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition http://eprints.qut.edu.au/85240/

Research areas

Speech Processing; Speaker Recognition; Biometrics;

Cite this collection

Dean,David; Sridharan,Sridha; Vogt,Robert; Mason,Michael. (2010): The QUT-NOISE Databases and Protocols. [Queensland University of Technology]. https://doi.org/10.4225/09/58819f7a21a21

Licence


Creative Commons Attribution-Share Alike 4.0 (CC-BY-SA)
http://creativecommons.org/licenses/by-sa/4.0/

Copyright

© 2010 QUT and is licensed under the Creative Commons Attribution-ShareAlike 3.0 Australia License.

Contacts

Name: Dr Simon Denman

Other

Date record created:
2016-07-22T11:13:47
Date record modified:
2017-01-20T15:27:16
Record status:
Published - Open Access