SPE-P13: Speech Analysis & Acoustic Modeling |
| Session Type: Poster |
| Time: Friday, April 24, 10:00 - 12:00 |
| Location: Poster Area C, TICC |
| Session Chair: Tanja Schultz, Carnegie Mellon University |
| SPE-P13.1: MAIN VOWEL DOMAIN TONE MODELING WITH LEXICAL AND PROSODIC ANALYSIS FOR MANDARIN ASR |
| ShiLei Zhang; IBM China Research Lab |
| Qin Shi; IBM China Research Lab |
| Stephen M. Chu; IBM T. J. Watson Research Center |
| Yong Qin; IBM China Research Lab |
| SPE-P13.2: AUTOMATIC PROSODIC EVENTS DETECTION USING SYLLABLE-BASED ACOUSTIC AND SYNTACTIC FEATURES |
| Je Hun Jeon; The University of Texas at Dallas |
| Yang Liu; The University of Texas at Dallas |
| SPE-P13.3: BAYESIAN FEATURE ENHANCEMENT USING A MIXTURE OF UNSCENTED TRANSFORMATIONS FOR UNCERTAINTY DECODING OF NOISY SPEECH |
| Yusuke Shinohara; Toshiba Corporation |
| Masami Akamine; Toshiba Corporation |
| SPE-P13.4: TEMPORAL CONTRAST NORMALIZATION AND EDGE-PRESERVED SMOOTHING ON TEMPORAL MODULATION STRUCTURE FOR ROBUST SPEECH RECOGNITION |
| Xugang Lu; ATR-SLC |
| Shigeki Matsuda; ATR-SLC |
| M. Unoki; Japan Advanced Institute of Science and Technology |
| Tohru Shimizu; ATR-SLC |
| Satoshi Nakamura; ATR-SLC |
| SPE-P13.5: STEREO-BASED STOCHASTIC NOISE COMPENSATION BASED ON TRAJECTORY GMMS |
| Heiga Zen; Nagoya Institute of Technology |
| Yoshihiko Nankaku; Nagoya Institute of Technology |
| Keiichi Tokuda; Nagoya Institute of Technology |
| SPE-P13.6: IMPROVEMENTS ON MINIMUM COVARIANCE BASED SPATIAL CORRELATION TRANSFORMATION |
| Tengrong Su; Tsinghua University |
| Ji Wu; Tsinghua University |
| Zuoying Wang; Tsinghua University |
| Jie Hao; Toshiba (China) Company |
| SPE-P13.7: EMOTION RECOGNITION FROM SPEECH: PUTTING ASR IN THE LOOP |
| Bjoern Schuller; Technische Universitaet Muenchen |
| Anton Batliner; Friedrich-Alexander-Universitaet Erlangen-Nuernberg |
| Stefan Steidl; Friedrich-Alexander-Universitaet Erlangen-Nuernberg |
| Dino Seppi; Polderland Language & Speech Technology |
| SPE-P13.8: DETECTING BANDLIMITED AUDIO IN BROADCAST TELEVISION SHOWS |
| Mark Fuhs; Carnegie Mellon University |
| Qin Jin; Carnegie Mellon University |
| Tanja Schultz; Carnegie Mellon University |
| SPE-P13.9: A SPEECH FRAGMENT APPROACH TO LOCALISING MULTIPLE SPEAKERS IN REVERBERANT ENVIRONMENTS |
| Heidi Christensen; University of Sheffield |
| Ning Ma; University of Sheffield |
| Stuart N. Wrigley; University of Sheffield |
| Jon Barker; University of Sheffield |
| SPE-P13.10: ROBUST TWO-CHANNEL TDOA ESTIMATION FOR MULTIPLE SPEAKER LOCALIZATION BY USING RECURSIVE ICA AND A STATE COHERENCE TRANSFORM |
| Francesco Nesta; Fondazione Bruno Kessler, Università di Trento |
| Piergiorgio Svaizer; Fondazione Bruno Kessler |
| Maurizio Omologo; Fondazione Bruno Kessler |
| SPE-P13.11: PERCEPTUAL TIME VARYING LINEAR PREDICTION MODEL FOR SPEECH APPLICATIONS |
| Oron Gamliel; Ben Gurion University of the Negev |
| Ilan David Shallom; Ben Gurion University of the Negev |
| SPE-P13.12: PHONOLOGICAL FEATURES IN DISCRIMINATIVE CLASSIFICATION OF DYSARTHRIC SPEECH |
| Frank Rudzicz; University of Toronto |