SPE-P3: General Topics in ASR |
| Session Type: Poster |
| Time: Tuesday, April 21, 17:00 - 19:00 |
| Location: Poster Area D, TICC |
| Session Chair: Gerasimos Potamianos, FORTH |
| SPE-P3.1: COSINE - A CORPUS OF MULTI-PARTY CONVERSATIONAL SPEECH IN NOISY ENVIRONMENTS |
| Alex Stupakov; University of Washington |
| Evan Hanusa; University of Washington |
| Jeff Bilmes; University of Washington |
| Dieter Fox; University of Washington |
| SPE-P3.2: EMOTIONAL SPEECH RECOGNITION BASED ON STYLE ESTIMATION AND ADAPTATION WITH MULTIPLE-REGRESSION HMM |
| Yusuke Ijima; Tokyo Institute of Technology |
| Makoto Tachibana; Tokyo Institute of Technology |
| Takashi Nose; Tokyo Institute of Technology |
| Takao Kobayashi; Tokyo Institute of Technology |
| SPE-P3.3: EFFICIENT COMBINATION OF LIKELIHOOD RECYCLING AND BATCH CALCULATION BASED ON CONDITIONAL FAST PROCESSING AND ACOUSTIC BACK-OFF |
| Atsunori Ogawa; NTT Corporation |
| Satoshi Takahashi; NTT Corporation |
| Atsushi Nakamura; NTT Corporation |
| SPE-P3.4: CLASS-DEPENDENT AND DIFFERENTIAL HUFFMAN CODING OF COMPRESSED FEATURE PARAMETERS FOR DISTRIBUTED SPEECH RECOGNITION |
| Young Han Lee; Gwangju Instititue of Science and Technology |
| Deok Su Kim; Gwangju Instititue of Science and Technology |
| Hong Kook Kim; Gwangju Instititue of Science and Technology |
| SPE-P3.5: SPEECH EMOTION RECOGNITION VIA A MAX-MARGIN FRAMEWORK INCORPORATING A LOSS FUNCTION BASED ON THE WATSON AND TELLEGEN'S EMOTION MODEL |
| Sungrack Yun; Korea Advanced Institute of Science and Technology |
| Chang D. Yoo; Korea Advanced Institute of Science and Technology |
| SPE-P3.6: LONG-TIME SPAN ACOUSTIC ACTIVITY ANALYSIS FROM FAR-FIELD SENSORS IN SMART HOMES |
| Jing Huang; IBM Research |
| Xiaodan Zhuang; University of Illinois at Urbana-Champaign |
| Vit Libal; IBM Research |
| Gerasimos Potamianos; IBM Research |
| SPE-P3.7: THE AUSTRALIAN ENGLISH SPEECH CORPUS FOR IN-CAR SPEECH PROCESSING |
| Tristan Kleinschmidt; Queensland University of Technology |
| Michael Mason; Queensland University of Technology |
| Eddie Wong; Queensland University of Technology |
| Sridha Sridharan; Queensland University of Technology |
| SPE-P3.8: A STUDY ON RECOGNIZING DISTORTED SPEECH OVER LOCAL DISTRIBUTED TRANSDUCER NETWORKS |
| Yong Zhao; Georgia Institute of Technology |
| Sunghwan Shin; Georgia Institute of Technology |
| Enrique Robledo-Arnuncio; Georgia Institute of Technology |
| Biing-Hwang (Fred) Juang; Georgia Institute of Technology |
| SPE-P3.9: A CRITERION FOR THE ENHANCEMENT OF TIME-FREQUENCY MASKS IN MISSING DATA RECOGNITION |
| Daniel Pullella; The University of Western Australia |
| Roberto Togneri; The University of Western Australia |
| SPE-P3.10: COMBINING MIXTURE WEIGHT PRUNING AND QUANTIZATION FOR SMALL-FOOTPRINT SPEECH RECOGNITION |
| David Huggins-Daines; Carnegie Mellon University |
| Alexander Rudnicky; Carnegie Mellon University |
| SPE-P3.11: CROSS-LINGUAL SPEECH RECOGNITION UNDER RUNTIME RESOURCE CONSTRAINTS |
| Dong Yu; Microsoft Research |
| Li Deng; Microsoft Research |
| Peng Liu; Microsoft Research |
| Jian Wu; Microsoft Corporation |
| Yifan Gong; Microsoft Corporation |
| Alex Acero; Microsoft Research |
| SPE-P3.12: AUDIO SEGMENTATION FOR SPEECH RECOGNITION USING SEGMENT FEATURES |
| David Rybach; RWTH Aachen University |
| Christian Gollan; RWTH Aachen University |
| Ralf Schlüter; RWTH Aachen University |
| Hermann Ney; RWTH Aachen University |