Technical Program

SPE-P13: Speech Analysis & Acoustic Modeling

Session Type: Poster
Time: Friday, April 24, 10:00 - 12:00
Location: Poster Area C, TICC
Session Chair: Tanja Schultz, Carnegie Mellon University
 
SPE-P13.1: MAIN VOWEL DOMAIN TONE MODELING WITH LEXICAL AND PROSODIC ANALYSIS FOR MANDARIN ASR
         ShiLei Zhang; IBM China Research Lab
         Qin Shi; IBM China Research Lab
         Stephen M. Chu; IBM T. J. Watson Research Center
         Yong Qin; IBM China Research Lab
 
SPE-P13.2: AUTOMATIC PROSODIC EVENTS DETECTION USING SYLLABLE-BASED ACOUSTIC AND SYNTACTIC FEATURES
         Je Hun Jeon; The University of Texas at Dallas
         Yang Liu; The University of Texas at Dallas
 
SPE-P13.3: BAYESIAN FEATURE ENHANCEMENT USING A MIXTURE OF UNSCENTED TRANSFORMATIONS FOR UNCERTAINTY DECODING OF NOISY SPEECH
         Yusuke Shinohara; Toshiba Corporation
         Masami Akamine; Toshiba Corporation
 
SPE-P13.4: TEMPORAL CONTRAST NORMALIZATION AND EDGE-PRESERVED SMOOTHING ON TEMPORAL MODULATION STRUCTURE FOR ROBUST SPEECH RECOGNITION
         Xugang Lu; ATR-SLC
         Shigeki Matsuda; ATR-SLC
         M. Unoki; Japan Advanced Institute of Science and Technology
         Tohru Shimizu; ATR-SLC
         Satoshi Nakamura; ATR-SLC
 
SPE-P13.5: STEREO-BASED STOCHASTIC NOISE COMPENSATION BASED ON TRAJECTORY GMMS
         Heiga Zen; Nagoya Institute of Technology
         Yoshihiko Nankaku; Nagoya Institute of Technology
         Keiichi Tokuda; Nagoya Institute of Technology
 
SPE-P13.6: IMPROVEMENTS ON MINIMUM COVARIANCE BASED SPATIAL CORRELATION TRANSFORMATION
         Tengrong Su; Tsinghua University
         Ji Wu; Tsinghua University
         Zuoying Wang; Tsinghua University
         Jie Hao; Toshiba (China) Company
 
SPE-P13.7: EMOTION RECOGNITION FROM SPEECH: PUTTING ASR IN THE LOOP
         Bjoern Schuller; Technische Universitaet Muenchen
         Anton Batliner; Friedrich-Alexander-Universitaet Erlangen-Nuernberg
         Stefan Steidl; Friedrich-Alexander-Universitaet Erlangen-Nuernberg
         Dino Seppi; Polderland Language & Speech Technology
 
SPE-P13.8: DETECTING BANDLIMITED AUDIO IN BROADCAST TELEVISION SHOWS
         Mark Fuhs; Carnegie Mellon University
         Qin Jin; Carnegie Mellon University
         Tanja Schultz; Carnegie Mellon University
 
SPE-P13.9: A SPEECH FRAGMENT APPROACH TO LOCALISING MULTIPLE SPEAKERS IN REVERBERANT ENVIRONMENTS
         Heidi Christensen; University of Sheffield
         Ning Ma; University of Sheffield
         Stuart N. Wrigley; University of Sheffield
         Jon Barker; University of Sheffield
 
SPE-P13.10: ROBUST TWO-CHANNEL TDOA ESTIMATION FOR MULTIPLE SPEAKER LOCALIZATION BY USING RECURSIVE ICA AND A STATE COHERENCE TRANSFORM
         Francesco Nesta; Fondazione Bruno Kessler, Università di Trento
         Piergiorgio Svaizer; Fondazione Bruno Kessler
         Maurizio Omologo; Fondazione Bruno Kessler
 
SPE-P13.11: PERCEPTUAL TIME VARYING LINEAR PREDICTION MODEL FOR SPEECH APPLICATIONS
         Oron Gamliel; Ben Gurion University of the Negev
         Ilan David Shallom; Ben Gurion University of the Negev
 
SPE-P13.12: PHONOLOGICAL FEATURES IN DISCRIMINATIVE CLASSIFICATION OF DYSARTHRIC SPEECH
         Frank Rudzicz; University of Toronto
 

©2016 Conference Management Services, Inc. -||- email: webmaster@icassp09.com -||- Last updated Tuesday, October 13, 2009