QUT SAIVT: Speech, audio, image and video technologies research
Viewed:
4486
The Speech, Audio, Image and Video Technologies (SAIVT) research group conducts world class research, postgraduate training, industrial consultancy and product development in the areas of Speech, Audio, Image and Video Technologies. A major focus of the research is in applying Machine Learning techniques to solve real world problems in Computer Vision and Speech and Language Processing.
The group was established in 1989 and has graduated 45 PhD students and 10 Masters by Research students in the areas of speech, audio, image and video technologies. Currently 28 full-time PhD students are enrolled within the group.
Research areas
Speech recognition
Person tracking
Multi-camera management
Soft biometrics
Speech detection
Video surveillance
Speaker verification and identification
Keyword spotting/spoken term detection
Audio visual speech recognition
Speech enhancement single/multi-microphone
Crowd monitoring
Abnormal event detection
Gait recognition
Computer Vision
Human action recognition
Human identification at distance
Language identification
Speaker indexing/diarisation/segmentation/clustering
Facial expression recognition
Speech emotion detection
Multimodal biometrics
Speaker role detection
Vehicle tracking
Iris recognition at a distance
Multimodal emotion recognition
Video event detection
Anti-spoofing biometrics
Speech quality estimation
Related information
Publications by SAIVT
https://www.qut.edu.au/research/research-projects/speech-audio-image-and-video-technology-saivt
Research Funding
https://wiki.qut.edu.au/display/saivt/Research+Funding
Connections
Has association with
Is managed by
Contacts
Other
Date record created:
2014-06-16T16:40:20
Date record modified:
2016-07-04T14:50:39
Record status:
Published - Open Access