The QUT-NOISE Databases and Protocols
The QUT-NOISE Databases and Protocols
Overview
This distribution contains the QUT-NOISE database and the code required to create the QUT-NOISE-TIMIT database from the QUT-NOISE database and a locally installed copy of the TIMIT database. It also contains code to create the QUT-NOISE-SRE protocol on top of an existing speaker recognition evaluation database (such as NIST evaluations). Further information on the QUT-NOISE and QUT-NOISE-TIMIT databases is available in our paper:
D. Dean, S. Sridharan, R. Vogt, M. Mason (2010) "The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms", in Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan.
This paper is also available in the file: docs/Dean2010, The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithm.pdf, distributed with this database.
Further information on the QUT-NOISE-SRE protocol is available in our paper:
D. Dean, A. Kanagasundaram, H. Ghaemmaghami, M. Hafizur, S. Sridharan (2015) The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition". In Proceedings of Interspeech 2015, September, Dresden, Germany.
Licensing
The QUT-NOISE data itself is licensed CC-BY-SA, and the code required to create the QUT-NOISE-TIMIT database and QUT-NOISE-SRE protocols is licensed under the BSD license. Please consult the approriate LICENSE.txt files (in the code and QUT-NOISE directories) for more information. To attribute this database, please include the following citation:
D. Dean, S. Sridharan, R. Vogt, M. Mason (2010) The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms", in Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan.
If your work is based upon the QUT-NOISE-SRE, please also include this citation:
D. Dean, A. Kanagasundaram, H. Ghaemmaghami, M. Hafizur, S. Sridharan (2015) The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition". In Proceedings of Interspeech 2015, September, Dresden, Germany.
Download and Installation
Download the following QUT-NOISE*.zip files:
- QUT_NOISE.zip (26.7 MB, md5sum: 672461fd88782e9ea10d5c2cb7a84196)
- QUT_NOISE_CAFE.zip (1.6 GB, md5sum: f87fb213c0e1c439e1b727fb258ef2cd)
- QUT_NOISE_CAR.zip (1.7 GB, md5sum: d680118b4517e1257a9263b99d1ac401)
- QUT_NOISE_HOME.zip (1.4 GB, md5sum: d99572ae1c118b749c1ffdb2e0cf0d2e)
- QUT_NOISE_REVERB.zip (1.4 GB, md5sum: fe107ab341e6bc75de3a32c69344190e)
- QUT_NOISE_STREET.zip (1.6 GB, md5sum: 68d5ebc2e60cb07927cc4d33cdf2f017)
Creating QUT-NOISE-TIMIT
Obtaining TIMIT
In order to construct the QUT-NOISE-TIMIT database from the QUT-NOISE data supplied here you will need to obtain a copy of the TIMIT database from the Linguistic Data Consortium. If you just want to use the QUT-NOISE database, or you wish to combine it with different speech data, TIMIT is not required.
Creating QUT-NOISE-TIMIT
- Once you have obtained TIMIT, download and install a copy of VOICEBOX: Speech Processing Toolbox for MATLAB and install it in your MATLABPATH.
- Run matlab in the QUT-NOISE/code directory, and run the function: createQUTNOISETIMIT('/location/of/timit-cd/timit'). This will create the QUT-NOISE-TIMIT database in the QUT-NOISE/QUT-NOISE-TIMIT directory.
- If you wish to verify that the QUT-NOISE-TIMIT database matches that evaluated in our original paper, please check that the md5sums (use md5sum on unix-based OSes) match those in the QUT-NOISE-TIMIT/md5sum.txt file.
- Using the QUT-NOISE-SRE protocol
- The code related to the QUT-NOISE-SRE protocol can be used in two ways:
- To create a collection of noisy audio files across the scenarios in the QUT-NOISE database at different noise levels, or,
- To recreate a list of file names based on the QUT-NOISE-SRE protocl produced by another researcher, having already done (1). This allows existing research to be reproduced without having to send large volumes of audio around.
- If you are interested in creating your own noisy database from an existing SRE database (1 above), please look at the example script exampleQUTNOISESRE.sh in the QUT-NOISE/code directory. You will need to make some modifications, but it should give you the right idea.
- If you are interested in creating our QUT-NOISE-NIST2008 database published at Interspeech 2015, you can find the list of created noisy files in the QUT-NOISE-NIST2008.train.short2.list and QUT-NOISE-NIST2008.test.short3.list files in the QUT-NOISE/code directory.
- These files can be recreated as follows (provided you have access to the NIST2008 SRE data):
Run matlab in the QUT-NOISE/code directory, and run the following functions:
createQUTNOISESREfiles('NIST2008.train.short2.list', ...
'QUT-NOISE-NIST2008.train.short2.list', ...
'
- The code related to the QUT-NOISE-SRE protocol can be used in two ways:
Geographical area of data collection
Publications
Research areas
Cite this collection
Data file types
Licence
Creative Commons Attribution-Share Alike 4.0 (CC-BY-SA)
http://creativecommons.org/licenses/by-sa/4.0/