Recent Publications – Haizhou Li
Patents
1) US Patent number 6311152, Shuanhu Bai, Horng Jyh Paul Wu, Haizhou Li, Gareth Loudon, System
for Chinese tokenization and named entity recognition, Publication date 2001/10/30
2) US Patent Number 6674861, Changsheng Xu, Jiankang Wu, Qibin Sun, Kai Xin, Haizhou Li, Digital
audio watermarking using content-adaptive, multiple echo hopping, Publication date: 2004/1/6
3) US Patent Number 6397181 B1, Haizhou Li, Jiankang Wu, Method and Apparatus for Voice
Annotation and Retrieval of Multimedia Data, Publication date: 2000/8/3
4) US Patent Number 7,917,361 B2, Haizhou Li, Bin Ma, George M. White, Spoken Language
Identification System and Methods for Training and Operating Same, Publication date: March 29, 2011
5) USPTO Application #: #20100299136, Rong Tong, Shuanghu Bai, Haizhou Li, dialogue system and a
method for executing a fully mixed initiative dialogue (fmid) interaction between a human and a
machine, Publish Date: 25 November 2010
6) USPTO Application #: #20150025892, Siu Wa Lee, Ling Cen, Haizhou Li, Yaozhu Paul Chan,
Minghui Dong, Method and system for template-based personalized singing synthesis, Publish Date:
22 January 2015
7) USPTO Application #: #20100198760, Namunu C. Maddage, Haizhou Li, Apparatus and methods for
music signal analysis, Publish Date: 5 August 2010
8) USPTO Application #: #20100004931, Bin Ma, Haizhou Li, Minghui Dong, Apparatus and method for
speech utterance verification , Publish Date: 7 January 2010
Books & Book Chapters
1) Haizhou Li, Kar-Ann Toh, Liyuan Li, Advanced Topics in Biometrics, World Scientific, 2011.
2) Haizhou Li, Bin Ma, and Chin-Hui Lee, Vector-based Spoken Language Classification. In Jacob
Benesty, M. Mohan Sondhi, Arden Huang (editors) Springer Handbook of Speech Processing, Springer,
2007.
3) Chin-Hui Lee, Haizhou Li, Lin-shan Lee, Renhua Wang, and Qiang Huo (editors), Advances in
Chinese Spoken Language Processing, World Scientific, 2007.
4) Shuzhi Sam Ge, Haizhou Li, John-John Cabibihan and Yeow Kee Tan (editors), Social Robotics,
Springer Lecture Notes in Artificial Intelligence 6414, 2010.
5) Qiang Huo, Bin Ma, Eng Siong Chng, and Haizhou Li (editors), Chinese Spoken Language Processing,
Springer Lecture Notes in Artificial Intelligence 4274, 2006.
6) Yinglin Yu and Haizhou Li, Neural Networks and Signal Analysis, South China University of
Technology Press, Guangzhou.
Journal Articles
1) Kaavya Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li, Front-End for
Antispoofing Countermeasures in Speaker Verification: Scattering Spectral Decomposition, IEEE
Journal of Selected Topics in Signal Processing 11(4): 632-643, 2017
2) Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, Multitask Feature Learning for Low-
Resource Query-by-Example Spoken Term Detection, IEEE Journal of Selected Topics in Signal
Processing 11(8): 1329-1339, 2017
3) Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li, An Exemplar-Based
Approach to Frequency Warping for Voice Conversion, IEEE/ACM Trans. Audio, Speech & Language
Processing 25(10): 1863-1876, 2017
4) Hongjie Chen, Lei Xie, Cheung-Chi Leung, Xiaoming Lu, Bin Ma, Haizhou Li, Modeling Latent
Topics and Temporal Distance for Story Segmentation of Broadcast News, IEEE/ACM Trans. Audio,
Speech & Language Processing 25(1): 108-119, 2017
5) Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong
Chng, Haizhou Li, Speech dereverberation for enhancement and recognition using dynamic features
constrained deep neural networks and feature adaptation. EURASIP J. Adv. Sig. Proc. 2016.
6) Zhizheng Wu, Haizhou Li, On the study of replay and voice conversion attacks to text-dependent
speaker verification. Multimedia Tools Appl. 75(9) , pp. 5311-5327, 2016.
7) Nancy F. Chen, Darren Wee, Rong Tong, Bin Ma, Haizhou Li, Large-scale characterization of non-
native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL. Speech
Communication 84, pp. 46-56, 2016.
8) Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen, Total
Variability Modeling Using Source-Specific Priors. IEEE/ACM Trans. Audio, Speech & Language
Processing 24(3), pp. 504-517, 2016.
9) Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, Haizhou Li, Feature Adaptation Using Linear
Spectro-Temporal Transform for Robust Speech Recognition. IEEE/ACM Trans. Audio, Speech &
Language Processing 24(6), pp. 1006-1019, 2016.
10) Qiang Yu, Rui Yan, Huajin Tang, Kay Chen Tan, Haizhou Li, A Spiking Neural Network System for
Robust Sequence Recognition. IEEE Trans. Neural Netw. Learning Syst. 27(3), pp. 621-635, 2016.
11) Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li, Single-channel
Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and
Temporal Structure Normalization. Signal Processing Systems 82(2), pp. 151-161, 2016.
12) Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai, Exploration of Local
Variability in Text-Independent Speaker Verification. Signal Processing Systems 82(2), pp. 217-228 ,
2016.
13) Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li, How the Brain Formulates Memory: A Spatio-
Temporal Model, IEEE Computational Intelligence Magazine, accepted in 2015
14) Qiang Yu, Rui Yan, Huajin Tang, Kay Chen Tan, Haizhou Li, A Spiking Neural Network System for
Robust Sequence Recognition, IEEE Transactions on Neural Networks and Learning Systems,
accepted in 2015 (DOI: 10.1109/TNNLS.2015.2416771)
15) Jonathan Dennis, Huy Dat Tran, Haizhou Li, Generalized Hough Transform for Speech Pattern
Classification, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(11), pp. 1963-
1972, 2015.
16) Chang Huai You, Haizhou Li, and Kong-Aik Lee, “Relevance factor of maximum a posteriori
adaptation for GMM-NAP-SVM in speaker and language recognition”, Computer Speech and
Language, vol.30, no.1, pp.116-134, 2015.
17) Dau-Cheng Lyu, Tien Ping Tan, Eng siong Chng, Haizhou Li, Mandarin-English code-switching
speech corpus in South-East Asia: SEAME. Language Resources and Evaluation 49(3): 581-600 (2015)
18) Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Acoustic Segment Modeling
with Spectral Clustering Methods”, IEEE/ACM Transactions on Audio, Speech and Language
Processing, vol.23, no.2, pp.264-277, 2015.
19) Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Context-dependent Phone Mapping for
Acoustic Modeling of Under-resourced Languages”, International Journal of Asian Language
Processing, vol.23, no.1, pp.21-33, 2015.
20) Haizhou Li, Marcello Federico, Xiaodong He, Helen M. Meng, and Isabel Trancoso, “Introduction to
the Special Section on Continuous Space and Related Methods in Natural Language Processing”,
IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.23, no.3, pp.427-430, 2015.
21) Tze Yuang Chong, Rafael E. Banchs, Eng siong Chng, Haizhou Li, “Decoupling Word-Pair Distance
and Co-occurrence Information for Effective Long History Context Language Modeling,” IEEE/ACM
Transactions on Audio, Speech and Language Processing, vol 23, no. 7, (7): pp. 1221-1232, 2015
22) Rafael E. Banchs, Luis F. D'Haro, and Haizhou Li, “Adequacy-Fluency Metrics: Evaluating MT in the
Continuous Space Model Framework”, IEEE/ACM Transactions on Audio, Speech and Language
Processing, vol.23, no.3, pp.472-482, 2015.
23) Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, and Haizhou Li,
"Spoofing and countermeasures for speaker verification: a survey", Speech Communication, vol.66, pp.
130-153, 2015.
24) Haizhou Li, Inaugural editorial: Embracing Opportunities for Growth, IEEE/ACM Transactions on
Audio, Speech and Language Processing, 23(1): 5-6, 2015.
25) Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Cross-lingual phone mapping for large
vocabulary speech recognition of under-resourced languages”, IEICE Transactions on Information and
Systems, vol.97-D, no.2, pp. 285-295, 2014.
26) Miaolong Yuan, Huajin Tang, and Haizhou Li, “Real-Time Keypoint Recognition Using Restricted
Boltzmann Machine,” IEEE Transactions on Neural Networks and Learning Systems, vol.25, no.11, pp.
2119-2126, 2014.
27) Zhizheng Wu and Haizhou Li, “Voice conversion versus speaker verification: an overview”, APSIPA
Transactions on Signal and Information Processing, vol.3, 2014.
28) Zhizheng Wu, Eng Siong Chng, and Haizhou Li, “Exemplar-based voice conversion using joint
nonnegative matrix factorization”, Multimedia Tools and Applications, Springer, 2014.
29) Zhizheng Wu, Tuomas Virtanen, Eng Siong Chng, and Haizhou Li, “Exemplar-based sparse
representation with residual compensation for voice conversion”, IEEE/ACM Transactions on Audio,
Speech and Language Processing, vol.22, no.10, pp. 1506-1521, 2014.
30) Anthony Larcher, Kong Aik Lee, Bin Ma, and Haizhou Li, “Text-dependent speaker verification:
Classifiers, databases and RSR2015”, Speech Communication, vol.60, pp. 56-77, 2014.
31) S. J. Wright, D. Kanevsky, Li Deng, Xiaodong He, G. Heigold, and Haizhou Li, “Optimization
Algorithm and Applications for Speech and Language Processing”, IEEE Transactions on Audio,
Speech and Language Processing, vol.21, no.11, pp. 2231-2243, 2013.
32) Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Spoken Language
Recognition With Prosodic Features”, IEEE Transactions on Audio, Speech and Language Processing,
vol.21, no.9, pp. 1841-1853, April 2013.
33) Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong Aik Lee, Bin Ma, and Haizhou Li, “Sparse
Classifier Fusion for Speaker Verification”, IEEE Transactions on Audio, Speech and Language
Processing, vol.21, no.8, pp. 1622-1631, August 2013.
34) Qiang Yu, Huajin Tang, Kay Chen Tan, and Haizhou Li, “Precise-Spike-Driven Synaptic Plasticity:
Learning Hetero-Association of Spatiotemporal Spike Patterns”, PLoS ONE, vol.8, no.11, November
2013.
35) Haizhou Li, Kong Aik Lee, and Bin Ma, “Spoken Language Recognition: From Fundamentals to
Practice”, Proceedings of the IEEE, vol. 101, no. 5, pp. 1136-1159, May 2013.
36) Douglas D. O'Shaughnessy, Li Deng, and Haizhou Li, “Speech Information Processing: Theory and
Applications”, Proceedings of the IEEE, vol. 101, no. 5, pp. 1034-1037, May 2013.
37) Jiali Yu, Huajin Tang, and Haizhou Li, “Dynamics Analysis of a Population Decoding Model”, IEEE
Transactions on Neural Networks and Learning Systems, vol. 24, no. 3, pp. 498-503, 2013.
38) Qiang Yu, Huajin Tang, Kay Chen Tan, and Haizhou Li, “Rapid Feedforward Computation by
Temporal Encoding and Learning With Spiking Neurons”, IEEE Transactions on Neural Networks and
Learning Systems, vol.24, no.10, pp. 1539-1552, October 2013.
39) Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “Shifted-Delta MLP Features
for Spoken Language Recognition”, IEEE Signal Processing Letters, vol. 20, no. 1, pp. 15-18, January
2013.
40) Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, and See Swee Lan, “Making Social
Robots More Attractive: The Effects of Voice Pitch, Humor and Empathy”, International Journal of
Social Robotics, vol. 5, no. 2, pp. 171-191, April 2013.
41) Jiali Yu, Huajin Tang, and Haizhou Li, “Continuous attractors of discrete-time recurrent neural
networks”, Neural Computing and Applications, vol. 23, no. 1, pp. 89-96, 2013.
42) Jiali Yu, Huajin Tang, Haizhou Li, and Luping Shi, “Dynamical properties of continuous attractor
neural network with background tuning”, Neurocomputing, vol. 99, pp. 439-447, 2013.
43) Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li, and Luping Shi, “A Spike-Timing-Based Integrated
Model for Pattern Recognition”, Neural Computation, vol. 25, no. 2, pp. 450-472, 2013.
44) Sakriani Sakti, Michael Paul, Andrew Finch, Shinsuke Sakai, Thang Tat Vu, Noriyuki Kimura, Chiori
Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza,
Karunesh Arora, Chi Mai Luong, and Haizhou Li, “A-STAR: Toward Translating Asian Spoken
Languages”, Computer Speech and Language, vol. 27, no. 2, pp. 509-527, 2013.
45) Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li, “Mixture of factor analyzers using
priors from non-parallel speech for voice conversion”, IEEE Signal Processing Letters, vol. 19, no. 12,
pp. 914-917, 2012.
46) Omid Dehzangi, Bin Ma, Eng-Siong Chng, and Haizhou Li, “Discriminative Feature Extraction for
Speech Recognition Using Continuous Output Codes”, Pattern Recognition Letters, vol. 33, pp. 1703-
1709, 2012.
47) Liyuan Li, Shuicheng Yan, Xinguo Yu, Yeow Kee Tan, and Haizhou Li, “Robust Multiperson
Detection and Tracking for Mobile Service and Social Robots”, IEEE Transactions on Systems, Man,
and Cybernetics -PART B: CYBERNETICS, vol. 42, no. 5, 2012.
48) Tomi Kinnunen, Rahim Saeidi, Filip Sedlak, Kong Aik Lee, Johan Sandberg, Maria Hansson-Sandsten,
and Haizhou Li, ”Low-Variance Multitaper MFCC Features: a Case Study in Robust Speaker
Verification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 7, pp. 1990-
2001, September 2012.
49) Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, and Swee Lan See, “Making social
robots more attractive: the effects of voice pitch, humor and empathy”, International Journal of Social
Robotics, vol. 5, no. 2, pp. 171-191, April 2013.
50) Wenliang Chen, Jun'ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang,
Kentaro Torisawa, and Haizhou Li, “Bitext dependency parsing with auto-generated bilingual
treebank”, IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 5, pp. 1461-
1472, 2012.
51) Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, and Haizhou Li, “Broadcast news story
segmentation using conditional random fields and multimodal features”, IEICE Transactions on
Information and Systems, vol. E95-D, no. 5, pp.1206-1215, 2012.
52) Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, and Haizhou Li, “Selective gammatone envelope
feature for robust sound event recognition”, IEICE Transactions, vol. 95-D, no. 5, pp. 1229-1237, 2012.
53) Rui Yan, Keng Peng Tee, Yuanwei Chua, Haizhou Li, and Huajin Tang, “Gesture Recognition Based
on Localist Attractor Networks with Application to Robot Control”, IEEE Computational Intelligence
Magazine, vol. 7, No. 1, pp. 64-74, 2012.
54) Keng Peng Tee, Rui Yan, Yuanwei Chua, Zhiyong Huang, and Haizhou Li, “Modular IK: a Robust
Inverse Kinematic Algorithm for Gesture Imitation in an Upper-Body Humanoid Robot”, International
Journal of Humanoid Robotics, vol. 9, no. 2, June 2012.
55) Jin-Shea Kuo and Haizhou Li, “Learning regional transliteration variants”, Information Processing
and Management, vol. 48, no. 1, pp. 154-169, 2012.
56) Tin Lay Nwe, Hanwu Sun, Bin Ma, and Haizhou Li, “Speaker Clustering and Cluster Purification
Methods for RT07 and RT09 Evaluation Meeting Data”, IEEE Transactions on Audio, Speech and
Language Processing, vol. 20, no. 2, pp. 461-473, 2012.
57) Haizhou Li, “FOREWORD - Special Section on Recent Advances in Multimedia Signal Processing
Techniques and Applications”, IEICE TRANSACTIONS on Information and Systems, vol. 95-D, no. 5,
pp. 1181-1181, May 2012.
58) Haizhou Li , John-John Cabibihan, and Yeow Kee Tan, “Towards an Effective Design of Social
Robots”, International Journal of Social Robotics, vol. 3, no. 4, pp. 333-335, November 2011.
59) Huajin Tang and Haizhou Li, “Book Review: Information Theoretic Learning: Renyi’s Entropy and
Kernel Perspectives”, IEEE Computational Intelligence Magazine, vol. 6, no. 3, August 2011.
60) Eliathamby Ambikairajah, Haizhou Li, Liang Wang, Bo Yin, and Vidhyasaharan Sethu, “Language
Identification: A Tutorial”, IEEE Circuits and Systems Magazine, vol. 11, no. 2, pp. 82-108, 2011.
61) Huajin Tang Haizhou Li, and Zhang Yi, “Online learning and stimulus-driven responses of neurons in
visual cortex”, Cognitive Neurodynamics, vol. 5, no. 1, pp. 77-85, 2011.
62) Omid Dehzangi, Bin Ma, Eng-Siong Chng, and Haizhou Li, “Error Corrective Fusion of Classifier
Scores for Spoken Language”, IEICE Transactions on Information and Systems, vol. E94-D, no.12, pp.
2503-2512, 2011.
63) Deyi Xiong, Min Zhang, and Haizhou Li, “A Maximum Entropy Segmentation Model for Statistical
Machine Translation”, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 8,
November 2011.
64) Huy Dat Tran and Haizhou Li, “Sound Event Recognition with Probabilistic Distance SVMs”, IEEE
Transactions on Audio, Speech and Language Processing, vol. 19, no. 6, pp. 1556-1568, 2011.
65) Jonathan Dennis, Huy Dat Tran, and Haizhou Li, “Spectrogram Image Feature for Sound Event
Classification in Mismatched Conditions”, IEEE Signal Processing Letters, vol. 18, no. 2, pp. 130-133,
February 2011.
66) Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, and Khe Chai Sim, “Using Discrete
Probabilities with Bhattacharyya Measure for SVM-based Speaker Verification”, IEEE Transactions
on Audio, Speech and Language Processing, vol. 19, no. 4, pp. 861-870, May 2011.
67) Donglai Zhu, Bin Ma, and Haizhou Li, “Speaker Verification with Feature-Space MAPLR
Parameters”, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 3, pp. 505-
515, March 2011.
68) Namunu C. Maddage and Haizhou Li, “Beat Space Segmentation and Octave Scale Cepstral Feature
for Sung Language Recognition in Pop Music”, ACM Transactions on Multimedia Computing,
Communications and Applications (TOMCCAP), vol. 7, no. 4, November 2011.
69) Haizhou Li and Ma Bin, “TechWare: Speaker and Spoken Language Recognition Resources”, IEEE
Signal Processing Magazine, vol. 27, no. 6, pp. 139-142, November 2010.
70) Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li, “Linguistically Annotated Reordering Evaluation
and Analysis”, Computational Linguistics, vol. 36, no. 3, pp. 535-568, 2010.
71) Huajin Tang, Haizhou Li, and Zhang Yi, “A Discrete-Time Neural Network for Optimization
Problems with Hybrid Constraints”, IEEE Transactions on Neural Networks, vol. 21, no. 7, pp. 1184-
1189, 2010.
72) Lei Wang, Eng Siong Chng, and Haizhou Li, “A Tree-Construction Search Approach for Multivariate
Time Series Motifs Discovery”, Pattern Recognition Letters, vol. 31, no. 9, pp. 869-875, 2010.
73) Huajin Tang, Haizhou Li, and Rui Yan, “Memory Dynamics in Attractor Networks with Saliency
Weights”, Neural Computation, vol. 22, no. 7, pp. 1899-1926, July 2010.
74) Chang Huai You, Kong Aik Lee, and Haizhou Li, “GMM-SVM Kernel with a Bhattacharyya-Based
Distance for Speaker Recognition”, IEEE Transactions on Audio, Speech and Language Processing,
vol. 18, no. 6, pp. 1300-1312, 2010.
75) Tomi Kinnunen and Haizhou Li, “An Overview of Text-Independent Speaker Recognition: from
Features to Supervectors”, Speech Communication, vol. 52, no. 1, pp. 12-40, 2010. (Speech
Communication Most Cited Article since 2007)
76) Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, and Chin-Hui Lee, “A Study on the
Generalization Capability of Acoustic Models for Robust Speech Recognition”, IEEE Transactions on
Audio, Speech and Language Processing, vol. 18, no. 6, pp. 1158-1169, 2010.
77) Namunu C. Maddage, Khe Chai Sim, and Haizhou Li, “Word Level Automatic Alignment of Music
and Lyrics using Vocal Synthesis”, ACM Transactions on Multimedia Computing, Communications,
and Applications (TOMCCAP), vol. 6, no. 3, 2010.
78) Tee Kiah Chia, Khe Chai Sim, Haizhou Li, and Hwee Tou Ng, “Statistical Lattice-Based Spoken
Document Retrieval”, ACM Transactions on Information Systems, vol. 28, no. 1, 2010.
79) Huy Dat Tran and Haizhou Li, “Jump Function Kolmogorov for Audio Classification in Noise-
mismatch Conditions”, IEEE Transactions on Signal Processing, vol. 57, no. 8, pp. 2908-2918, 2009.
80) Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “A Target-Oriented Phonotactic Front-end for
Spoken Language Recognition”, IEEE Transactions on Audio, Speech and Language Processing, vol.
17, no. 7, pp. 1335-1347, 2009.
81) Chang Hui You, Kong-Aik Lee, and Haizhou Li, “An SVM Kernel with GMM-Supervector Based on
the Bhattacharyya Distance for Speaker Recognition”, IEEE Signal Processing Letters, vol. 16, no. 1,
pp. 49-52, 2009.
82) Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui Lee, “Optimizing the Performance of Spoken
Language Recognition with Discriminative Training”, IEEE Transactions on Audio, Speech and
Language Processing, vol. 16, no. 8, pp. 1642-165, 2008.
83) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Normalization of the Speech Modulation Spectra for
Robust Speech Recognition”, IEEE Transactions on Audio, Speech and Language Processing, vol. 16,
no. 8, pp. 1662-1674, 2008.
84) Haizhou Li, Jin-Shea Kuo, Jian Su, and Chih-Lung Lin, “Mining Live Transliterations using
Incremental Learning Algorithms”, International Journal of Computer Processing of Languages, vol.
21, no. 2, pp. 183-203, 2008.
85) Khe Chia Sim and Haizhou Li, “On Acoustic Diversification Front-end for Spoken Language
Identification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 16, no. 5, pp.
1029-1037, 2008.
86) Jin-shea Kuo, Haizhou Li, and Ying-Kuei Yang, “Active Learning for Constructing Transliteration
Lexicons from the Web”, Journal of the American Society for Information Science and Technology, vol.
59, no. 1, 2008.
87) Bin Ma, Haizhou Li, and Rong Tong, “Spoken Language Recognition with Ensemble Classifiers”,
IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 7, 2007.
88) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Temporal structure normalization of speech feature
for robust speech recognition”, IEEE Signal Processing Letters, vol. 14, no. 7, 2007.
89) Jin-Shea Kuo, Haizhou Li, and Ying-Kuei Yang, “A Phonetic Similarity Model for Automatic
Extraction of Transliteration Pairs”, ACM Transactions on Asian Language Information Processing,
vol. 6, no. 2, September 2007.
90) Tin Lay Nwe and Haizhou Li, “Exploring Vibrato-Motivated Acoustic Features for Singer
Identification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 2, 2007.
91) Haizhou Li, Bin Ma, and Chin-Hui Lee, “A Vector Space Modeling Approach to Spoken Language
Identification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 1, 2007.
92) Minghui Dong, Kim-Teng Lua, and Haizhou Li, “A Unit Selection-based Speech Synthesis Approach
for Mandarin Chinese”, Journal of Chinese Language and Computing, vol. 16, no. 1, March 2006.
93) Bin Ma and Haizhou Li, “A Comparative Study of Four Language Identification Systems”,
Computational Linguistics and Chinese Language Processing, vol. 11, no. 2, June 2006.
94) Jian Su, K. T. Ng, Haizhou Li, and Jean-Paul Haton, “Nonparametric distance measures of speaker
verification”, IEE Electronics Letters, vol. 31, no. 9, April 1995.
95) Haizhou Li, Jian Su, Jean-Paul Haton, “Short-timed speech dynamics for speaker recognition”, IEE
Electronics Letters, vol. 31, no. 17, August 1995.
Conference Papers (since 2004)
2017
1) Xiong Xiao, Shengkui Zhao, Douglas L. Jones, Eng Siong Chng, Haizhou Li:On time-frequency mask
estimation for MVDR beamforming with application in robust speech recognition. ICASSP 2017:
3246-3250
2) Liping Chen, Kong-Aik Lee, Bin Ma, Long Ma, Haizhou Li, Li-Rong Dai:Adaptation of PLDA for
multi-source text-independent speaker verification. ICASSP 2017: 5380-5384
3) Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li: Pairwise learning
using multi-lingual bottleneck features for low-resource query-by-example spoken term detection.
ICASSP 2017: 5645-5649
4) Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, Haizhou Li: Statistical
Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning
Framework. CoRR abs/1707.01670 (2017)
5) D.-Y. Huang, Wan Ding, Mingyu Xu, Huaiping Ming, Minghui Dong, Xinguo Yu, Haizhou Li,
Multimodal Prediction of Affective Dimensions via Fusing Multiple Regression Techniques,
INTERSPEECH 2017
6) Kong Aik Lee, Haizhou Li , Gain Compensation for Fast i-Vector Extraction Over Short Duration,
INTERSPEECH 2017
7) Chenglin Xu, Xiong Xiao, Sining Sun, Wei Rao, Eng Siong Chng, Haizhou Li, Weighted Spatial
Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source,
INTERSPEECH 2017
8) Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li , Investigating Scalability in
Hierarchical Language Identification System, INTERSPEECH 2017
9) Jie Wu, D.-Y. Huang, Lei Xie, Haizhou Li , Denoising Recurrent Neural Network for Deep
Bidirectional LSTM Based Voice Conversion, INTERSPEECH 2017
10) Berrak Sisman, Haizhou Li, Kay Chen Tan, Transformation of Prosody in Voice Conversion, APSIPA
ASC 2017
11) Chitralekha Gupta, Haizhou Li, Ye Wang, Perceptual Evaluation of Singing Quality, APSIPA ASC
2017
12) Berrak Sisman, Haizhou Li, Kay Chen Tan, Sparse Representation of Phonetic Features for Voice
Conversion with and without parallel data, ASRU 2017
13) Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, Haizhou Li, Statistical
Parametric Speech Synthesis using Generative Adversarial Networks under a Multi-task Learning
Framework, ASRU 2017
14) Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, Multilingual bottle-neck feature
learning from Untranscribed Speech, ASRU 2017
15) Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, Extracting Bottleneck
Features and Word-like Pairs from Untranscribed Speech from Feature Representation, ASRU 2017
2016
16) Seokhwan Kim, Rafael E. Banchs, Haizhou Li, Exploring Convolutional and Recurrent Neural
Networks in Sequential Labelling for Dialogue Topic Tracking. ACL (1) 2016
17) Nancy F. Chen, Haizhou Li, “Computer-assisted pronunciation training: From pronunciation scoring
towards spoken language learning”, in Proceedings of APSIPA 2016, pp. 1-7
18) Xiaohai Tian, Xiong Xiao, Eng Siong Chng, Haizhou Li, “Spoofing speech detection using temporal
convolutional neural network”, in Proceedings of APSIPA 201, pp. 1-6.
19) Xiong Xiao, Shinji Watanabe, Eng Siong Chng, Haizhou Li, “Beamforming networks using spatial
covariance features for far-field speech recognition”, in Proceedings of APSIPA 2016, pp. 1-6.
20) Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li, “I-vector based deep
neural network acoustic model adaptation using multilingual language resource”, in Proceedings of
APSIPA 2016, pp. 1-5.
21) Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li, “Spoofing detection from a
feature representation perspective”, in Proceedings of ICASSP 2016, pp. 2119-2123.
22) Huaiping Ming, Dong-Yan Huang, Lei Xie, Shaofei Zhang, Minghui Dong, Haizhou Li, “Exemplar-
based sparse representation of timbre and prosody for voice conversion”, in Proceedings of ICASSP
2016, pp. 5175-5179.
23) Liping Chen, Kong-Aik Lee, Eng Siong Chng, Bin Ma, Haizhou Li, Li-Rong Dai, “Content-aware
local variability vector for speaker verification with short utterance”, in Proceedings of ICASSP 2016,
pp.5485-5489.
24) Saad Irtza, Vidhyasaharan Sethu, Haris Bavattichalil, Eliathamby Ambikairajah, Haizhou Li, “A
hierarchical framework for language identification”, in Proceedings of ICASSP 2016, pp. 5820-5824.
25) Chongjia Ni, Cheung-Chi Leung, Lei Wang, Haibo Liu, Feng Rao, Li Lu, Nancy F. Chen, Bin Ma,
Haizhou Li, “Cross-lingual deep neural network based submodular unbiased data selection for low-
resource keyword search”, in Proceedings of ICASSP 2016, pp. 6015-6019.
26) Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do,
Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, “Approximate search of audio queries by
using DTW with phone time boundary and data augmentation”, in Proceedings of ICASSP 2016, pp.
6030-6034.
27) Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li, “Keyword
search using query expansion for graph-based rescoring of hypothesized detections”, in Proceedings of
ICASSP 2016, pp. 6035-6039.
28) Nancy F. Chen, Van Tung Pharri, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen,
Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li, “Exemplar-inspired strategies for
low-resource spoken keyword search in Swahili”, in Proceedings of ICASSP 2016, pp. 6040-6044.
29) Xiong Xiao, Shengkui Zhao, Thi Ngoc Tho Nguyen, Douglas L. Jones, Eng Siong Chng, Haizhou Li,
“An expectation-maximization eigenvector clustering approach to direction of arrival estimation of
multiple speech sources”, in Proceedings of ICASSP 2016, pp. 6330-6334.
30) Dong-Yan Huang, Minghui Dong, Haizhou Li, “Combining multiple kernel models for automatic
intelligibility detection of pathological speech”, in Proceedings of ICASSP 2016: 6485-6489.
31) Wan Ding, Mingyu Xu, Dong-Yan Huang, Weisi Lin, Minghui Dong, Xinguo Yu, Haizhou Li, “Audio
and face video emotion recognition in the wild using deep neural networks and small datasets. ”, in
Proceedings of ICMI 2016, pp. 506-513.
32) Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, “Learning Neural Network
Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information”, in
Proceedings of INTERSPEECH 2016, pp. 788-792.
33) Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, “Unsupervised Bottleneck Features
for Low-Resource Query-by-Example Spoken Term Detection”, in Proceedings of INTERSPEECH
2016, pp. 923-927.
34) Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li, “Rescoring
Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples”, in Proceedings of
INTERSPEECH 2016, pp. 933-937.
35) Paul Yaozhu Chan, Minghui Dong, Grace Xue Hui Ho, Haizhou Li, “SERAPHIM: A Wavetable
Synthesis System with 3D Lip Animation for Real-Time Speech and Singing Applications on Mobile
Platforms”, in Proceedings of INTERSPEECH 2016, pp. 1225-1229.
36) Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li, “Semi-
Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models
Under Low-Resource Conditions”, in Proceedings of INTERSPEECH 2016, pp. 1315-1319.
37) Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li, “A DNN-HMM Approach to Story
Segmentation”, in Proceedings of INTERSPEECH 2016, pp. 1527-1531.
38) Nancy F. Chen, Rong Tong, Darren Wee, Pei Xuan Lee, Bin Ma, Haizhou Li, “SingaKids-Mandarin:
Speech Corpus of Singaporean Children Speaking Mandarin Chinese”, in Proceedings of
INTERSPEECH 2016, pp. 1545-1549.
39) Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li, “An Investigation of Spoofing
Speech Detection Under Additive Noise and Reverberant Conditions”, in Proceedings of
INTERSPEECH 2016, pp. 1715-1719.
40) Paul Yaozhu Chan, Minghui Dong, Grace Xue Hui Ho, Haizhou Li, “SERAPHIM Live! - Singing
Synthesis for the Performer, the Composer, and the 3D Game Developer”, in Proceedings of
INTERSPEECH 2016, pp. 1966-1967.
41) Huaiping Ming, Dong-Yan Huang, Lei Xie, Jie Wu, Minghui Dong, Haizhou Li, “Deep Bidirectional
LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion”, in Proceedings of
INTERSPEECH 2016, pp. 2453-2457.
42) Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li, “Context Aware Mispronunciation Detection for
Mandarin Pronunciation Training”, in Proceedings of INTERSPEECH 2016, pp. 3112-3116.
43) Kong-Aik Lee, Haizhou Li, Li Deng, Ville Hautamäki, Wei Rao, Xiong Xiao, Anthony Larcher,
Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Jianshu Chen, Ivan Kukanov,
Amir Hossein Poorjam, Trung Ngo Trong, Chenglin Xu, Haihua Xu, Bin Ma, Eng Siong Chng, Sylvain
Meignier, “The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and
SingaMS”, in Proceedings of INTERSPEECH 2016, pp. 3211-3215.
44) Saad Irtza, Vidhyasaharan Sethu, Sarith Fernando, Eliathamby Ambikairajah, Haizhou Li, “Out of Set
Language Modelling in Hierarchical Language Identification”, in Proceedings of INTERSPEECH 2016,
pp. 3270-3274.
45) Chongjia Ni, Lei Wang, Cheung-Chi Leung, Feng Rao, Li Lu, Bin Ma, Haizhou Li, “Rapid Update of
Multilingual Deep Neural Network for Low-Resource Keyword Search”, in Proceedings of
INTERSPEECH 2016, pp. 3698-3702.
46) Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong
Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li, “Toward High-Performance Language-
Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation
Analysis”, in Proceedings of INTERSPEECH 2016, pp. 3703-3707.
2015
47) Huaiping Ming, Dong-Yan Huang, Minghui Dong, Haizhou Li, Lei Xie, Shaofei Zhang “Fundamental
frequency modeling using wavelets for emotional voice conversion”, in Proceedings of ACII 2015, pp.
804-809.
48) Van Hai Do, Xiong Xiao, Eng Siong Chng, Haizhou Li “Distance metric learning for kernel density-
based acoustic model under limited training data conditions”, in Proceedings of APSIPA 2015, pp.
54-58.
49) Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, Haizhou Li, “A density peak clustering approach to
unsupervised acoustic subword units discovery”, in Proceedings of APSIPA 2015, pp. 178-183.
50) Shaofei Zhang, Dong-Yan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, Minghui Dong, “Non-
negative matrix factorization using stable alternating direction method of multipliers for source
separation”, in Proceedings of APSIPA 2015, pp. 222-228.
51) Van Tung Pham, Haihua Xu, Van Hai Do, Tze Yuang Chong, Xiong Xiao, Eng Siong Chng, Haizhou
Li, “On the study of very low-resource language keyword search”, in Proceedings of APSIPA 2015, pp.
358-364.
52) Minghui Dong, Chenyu Yang, Yanfeng Lu, Jochen Walter Ehnes, Dong-Yan Huang, Huaiping Ming,
Rong Tong, Siu Wa Lee, Haizhou Li, “Mapping frames with DNN-HMM recognizer for non-parallel
voice conversion” in Proceedings of APSIPA 2015, pp. 488-494.
53) Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng, Haizhou Li, “Multilingual exemplar-based
acoustic model for the NIST Open KWS 2015 evaluation”, in Proceedings of APSIPA 2015, pp. 594-
98.
54) Shengkui Zhao, Xiong Xiao, Zhaofeng Zhang, Thi Ngoc Tho Nguyen, Xionghu Zhong, Bo Ren,
Longbiao Wang, Douglas L. Jones, Engsiong Chng, Haizhou Li, “Robust speech recognition using
beamforming with adaptive microphone gains and multichannel noise reduction”, in Proceedings of
ASRU 2015, pp. 460-467.
55) Haihua Xu, Xiong Xiao, Engsiong Chng, Haizhou Li “On statistical machine translation method for
lexicon refinement in speech recognition”, in Proceedings of ChinaSIP 2015, pp. 25-29.
56) Xiaohai Tian, Steven Du, Xiong Xiao, Haihua Xu, Engsiong Chng, Haizhou Li, “Detecting synthetic
speech using long term magnitude and phase information”, in Proceedings of ChinaSIP 2015, pp.
611-615.
57) Seokhwan Kim, Rafael E. Banchs, Haizhou Li, “Wikification of Concept Mentions within Spoken
Dialogues Using Domain Constraints from Wikipedia”, in Proceedings of EMNLP 2015, pp. 2225-
2229.
58) Kui Wu, Xuancong Wang, Nina Zhou, AiTi Aw, Haizhou Li, “Joint Chinese word segmentation and
punctuation prediction using deep recurrent neural network for social media data”, in Proceedings of
IALP 2015, pp. 41-44.
59) Gillian Chua, Qian Ci Chang, Ye Won Park, Paul Yaozhu Chan, Minghui Dong, Haizhou Li, “The
expression of singing emotion - contradicting the constraints of song”, in Proceedings of IALP 2015,
pp. 98-102.
60) Yang Yu, Weisi Lin, Dong-Yan Huang, Minghui Dong, Haizhou Li, “Performance scoring of singing
voice”, in Proceedings of IALP 2015, pp. 119-122.
61) Ridong Jiang, Seokhwan Kim, Rafael E. Banchs, Haizhou Li, “Towards improving the performance of
Vector Space Model for Chinese Frequently Asked Question Answering”, in Proceedings of IALP
2015, pp. 136-139.
62) Miaolong Yuan, Bo Tian, Vui Ann Shim, Huajin Tang, and Haizhou Li, “An Entorhinal-Hippocampal
Model for Simultaneous Cognitive Map Building”, in Proceedings of AAAI-15, Austin Texas, USA,
2015, pp.586-592.
63) Jonathan Dennis, Tran Huy Dat, and Haizhou Li, “Combining Robust Spike Coding with Spiking
Neural Networks for Sound Event Classification”, in Proceedings of ICASSP 2015, Brisbane, Australia,
April 2015.
64) Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng, and Haizhou Li, “A
Learning-based Approach to Direction of Arrival Estimation in Noisy and reverberant Environments”,
in Proceedings of ICASSP 2015, Brisbane, Australia, April 2015.
65) Sven Ewan Shepstone, Kong Aik Lee, Haizhou Li, Zheng-Hua Tan, and Søren Holdt Jensen ,
“Source-Specific Informative Prior for i-Vector Extraction”, in Proceedings of ICASSP 2015, Brisbane,
Australia, April 2015.
66) Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei
Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, and Haizhou Li, “Language Independent Query-by-
Example Spoken Term Detection using N-Best Phone Sequences and Partial Matching”, in
Proceedings of ICASSP 2015, Brisbane, Australia, April 2015.
67) Liping Chen, Kong Aik Lee, Bin Ma, Wu Guo, Haizhou Li, and Li Rong Dai, “Channel Adaptation of
PLDA for Text-Independent Speaker Verification”, in Proceedings of ICASSP 2015, Brisbane,
Australia, April 2015.
68) Rong Tong, Nancy F. Chen, Boon Pang Lim, Bin Ma, and Haizhou Li, “Tokenizing Fundamental
Frequency Variation for Mandarin Tone Error Detection”, in Proceedings of ICASSP 2015, Brisbane,
Australia, April 2015.
69) Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Van Tung Pham, Haihua Xu, Xiong Xiao,
Tze Siong Lau, Su Jun Leow, Boon Pang Lim, Cheung-Chi Leung, Lei Wang, Chin-Hui Lee, Alvina
Goh, Eng Siong Chng, Bin Ma, and Haizhou Li, “Low-Resource Keyword Search Strategies for
Tamil”, in Proceedings of ICASSP 2015, Brisbane, Australia, April 2015.
70) Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai, “Phone-centric local
variability vector for text-constrained speaker verification”, in Proceedings of INTERSPEECH 2015,
pp. 229-233.
71) Nancy F. Chen, Rong Tong, Darren Wee, Pei Xuan Lee, Bin Ma, Haizhou Li, “iCALL corpus:
Mandarin Chinese spoken by non-native speakers of European descent” in Proceedings of
INTERSPEECH 2015, pp. 324-328.
72) Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li, “Goodness of tone (GOT) for non-native Mandarin
tone recognition”, in Proceedings of INTERSPEECH 2015, pp. 801-805.
73) Saad Irtza, Vidhyasaharan Sethu, Phu Ngoc Le, Eliathamby Ambikairajah, Haizhou Li “Phonemes
frequency based PLLR dimensionality reduction for language recognition”, in Proceedings of
INTERSPEECH 2015, pp. 997-1001.
74) Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang, “Sparse coding of total variability matrix” in
Proceedings of INTERSPEECH 2015, pp. 1022-1026.
75) Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li, “TDTO language modeling with
feedforward neural networks” in Proceedings of INTERSPEECH 2015, pp. 1458-1462.
76) Shaofei Zhang, Dong-Yan Huang, Lei Xie, Engsiong Chng, Haizhou Li, Minghui Dong, “Regularized
non-negative matrix factorization using alternating direction method of multipliers and its application
to source separation.”, in Proceedings of INTERSPEECH 2015, pp. 1498-1502.
77) Jonathan William Dennis, Tran Huy Dat, Haizhou Li, “Spiking neural networks and the generalised
hough transform for speech pattern detection”, in Proceedings of INTERSPEECH 2015, pp. 1997-
2001.
78) Xiong Xiao, Xiaohai Tian, Steven Du, Haihua Xu, Engsiong Chng, Haizhou Li, “Spoofing speech
detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015
challenge”, in Proceedings of INTERSPEECH 2015, pp. 2052-2056.
79) Kong-Aik Lee, Guangsen Wang, Kam Pheng Ng, Hanwu Sun, Trung Hieu Nguyen, Ngoc Thuy Huong
Thai, Bin Ma, Haizhou Li, ”The reddots platform for mobile crowd-sourcing of speech data”, in
Proceedings of INTERSPEECH 2015, pp. 2603-2604.
80) Dong-Yan Huang, Minghui Dong, Haizhou Li, ”A real-time variable-q non-stationary Gabor
transform for pitch shifting”, in Proceedings of INTERSPEECH 2015, pp. 2744-2748.
81) Kong-Aik Lee, Anthony Larcher, Guangsen Wang, Patrick Kenny, Niko Brümmer, David A. van
Leeuwen, Hagai Aronowitz, Marcel Kockmann, Carlos Vaquero, Bin Ma, Haizhou Li, Themos
Stafylakis, Md. Jahangir Alam, Albert Swart, Javier Perez, “The reddots data collection for speaker
recognition”, in Proceedings of INTERSPEECH 2015, pp. 2996-3000.
82) Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li ,“Parallel inference of dirichlet
process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study” in
Proceedings of INTERSPEECH 2015, pp. 3189-3193.
83) Huaiping Ming, Dong-Yan Huang, Lei Xie, Haizhou Li, Minghui Dong, “An alternating optimization
approach for phase retrieval” in Proceedings of INTERSPEECH 2015, pp. 3426-3430.
84) Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Engsiong Chng, Haizhou Li,
“Learning to estimate reverberation time in noisy and reverberant rooms”, in Proceedings of
INTERSPEECH 2015, pp. 3431-3435.
85) Sheng Gao, Haizhou Li “Popular song summarization using chorus section detection from audio
signal”, in Proceedings of MMSP 2015, pp. 1-6.
86) Seokhwan Kim, Rafael E. Banchs, Haizhou Li, “Towards Improving Dialogue Topic Tracking
Performances with Wikification of Concept Mentions”, in Proceedings of SIGDIAL Conference 2015,
pp. 124-128.
2014
87) Seokhwan Kim, Rafael E. Banchs, and Haizhou Li, “A Composite Kernel Approach for Dialog Topic
Tracking with Structured Domain Knowledge from Wikipedia”, in Proceedings of ACL-2014, vol.2,
Baltimore, Maryland, USA, 2014, pp.19-13.
88) Dong-Yan Huang, Haizhou Li, and Minghui Dong, “Ensemble Nyström method for predicting conflict
level from speech”, in Proceedings of APSIPA ASC 2014, Cambodia, 2014.
89) Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, Chng Eng Siong, and Haizhou Li, “Multi-view
features in a DNN-CRF model for improved sentence unit detection on English broadcast news”, in
Proceedings of APSIPA ASC 2014, Cambodia, 2014.
90) Shuojun Liu, Dong-Yan Huang, Weisi Lin, Minghui Dong, Haizhou Li, and Ee Ping Ong, “Emotional
facial expression transfer based on temporal restricted Boltzmann machines”, in Proceedings of
APSIPA ASC 2014, Cambodia, 2014.
91) Zhizheng Wu, Sheng Gao, Eng Siong Chng, and Haizhou Li, “A study on replay attack and anti-
spoofing for text-dependent speaker verification”, in Proceedings of APSIPA ASC 2014, Cambodia,
2014.
92) Haihua Xu, Van Tung Pham, Eng Siong Chng, and Haizhou Li, “Towards better keyword search
performance on Malay broadcast news data”, in Proceedings of APSIPA ASC 2014, Cambodia, 2014.
93) Seokhwan Kim, Rafael E. Banchs, and Haizhou Li, “Wikipedia-based Kernels for dialogue topic
tracking”, in Proceedings of ICASSP 2014, Florence, Italy, May 2014, pp.131-135.
94) Anthony Larcher, Kong-Aik Lee, Bin Ma, and Haizhou Li, “Modelling the alternative hypothesis for
text-dependent speaker verification”, in Proceedings of ICASSP 2014, Florence, Italy, May 2014,
pp.734-738.
95) Anthony Larcher, Kong-Aik Lee, Bin Ma, and Haizhou Li, “Imposture classification for text-
dependent speaker verification”, in Proceedings of ICASSP 2014, Florence, Italy, May 2014, pp.739-
743.
96) Xiong Xiao, Jinyu Li, Eng Siong Chng, and Haizhou Li, “Feature compensation using linear
combination of speaker and environment dependent correction vectors”, in Proceedings of ICASSP
2014, Florence, Italy, May 2014, pp.1720-1724.
97) Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Generalization of temporal
filter and linear transformation for robust speech recognition”, in Proceedings of ICASSP 2014,
Florence, Italy, May 2014, pp.1730-1734.
98) Jonathan William Dennis, Tran Huy Dat, Haizhou Li, and Eng Siong Chng, “A discriminatively
trained Hough Transform for frame-level phoneme recognition”, in Proceedings of ICASSP 2014,
Florence, Italy, May 2014, pp.2514-2518.
99) Dong-Yan Huang, Minghui Dong, and Haizhou Li, “Intelligibility detection of pathological speech
using asymmetric sparse kernel partial least squares classifier”, in Proceedings of ICASSP 2014,
Florence, Italy, May 2014, pp.3744-3748.
100) Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, and Li-Rong Dai, “Minimum
divergence estimation of speaker prior in multi-session PLDA scoring”, in Proceedings of ICASSP
2014, Florence, Italy, May 2014, pp.4007-4011.
101) Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Hoang Gia Ngo, Haihua Xu, Van Tung Pham,
Bin Ma, and Haizhou Li, “Strategies for Vietnamese keyword search”, in Proceedings of ICASSP
2014, Florence, Italy, May 2014, pp.4121-4125.
102) Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng, and Haizhou Li, “Improving language
modeling by using distance and co-occurrence information of word-pairs and its application to
LVCSR”, in Proceedings of ICASSP 2014, Florence, Italy, May 2014, pp.4883-4887.
103) Rong Tong, Boon Pang Lim, Nancy F. Chen, Bin Ma, and Haizhou Li, “Subspace Gaussian
mixture model for computer-assisted language learning”, in Proceedings of ICASSP 2014, Florence,
Italy, May 2014, pp.5347-5351.
104) Van Tung Pham, Haihua Xu, Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Eng Siong Chng,
and Haizhou Li, “Discriminative score normalization for keyword search decision”, in Proceedings of
ICASSP 2014, Florence, Italy, May 2014, pp.7078-7082.
105) Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Kernel density-based acoustic
model with cross-lingual bottleneck features for resource limited LVCSR”, in Proceedings of
INTERSPEECH 2014, Singapore, September 2014, pp.6-10.
106) Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “A graph-based
Gaussian component clustering approach to unsupervised acoustic modeling”, in Proceedings of
INTERSPEECH 2014, Singapore, September 2014, pp.875-879.
107) Anthony Larcher, Kong-Aik Lee, Pablo Luis Sordo Martinez, Trung Hieu Nguyen, Bin Ma,
and Haizhou Li, “Extended RSR2015 for text-dependent speaker verification over VHF channel”, in
Proceedings of INTERSPEECH 2014, Singapore, September 2014, pp.1322-1326.
108) Hoang Gia Ngo, Nancy F. Chen, Sunil Sivadas, Bin Ma, and Haizhou Li, “A minimal-
resource transliteration framework for Vietnamese”, in Proceedings of INTERSPEECH 2014,
Singapore, September 2014, pp.1410-1414.
109) Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li, “Intrinsic spectral analysis
based on temporal context features for query-by-example spoken term detection”, in Proceedings of
INTERSPEECH 2014, Singapore, September 2014, pp.1722-1726.
110) Haihua Xu, Hang Su, Eng Siong Chng, and Haizhou Li, “Semi-supervised training for bottle-
neck feature based DNN-HMM hybrid systems”, in Proceedings of INTERSPEECH 2014, Singapore,
September 2014, pp.2078-2082.
111) Minghui Dong, Siu Wa Lee, Haizhou Li, Paul Y. Chan, Xuejian Peng, Jochen Walter Ehnes,
and Dong-Yan Huang, “I2R speech2singing perfects everyone's singing”, in Proceedings of
INTERSPEECH 2014, Singapore, September 2014, pp.2148-2149.
112) Siu Wa Lee, Zhizheng Wu, Minghui Dong, Xiaohai Tian, and Haizhou Li, “A comparative
study of spectral transformation techniques for singing voice synthesis”, in Proceedings of
INTERSPEECH 2014, Singapore, September 2014, pp.2499-2503.
113) Zhizheng Wu, Eng Siong Chng, and Haizhou Li, “Joint nonnegative matrix factorization for
exemplar-based voice conversion”, in Proceedings of INTERSPEECH 2014, Singapore, September
2014, pp.2509-2513.
114) Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “A
deep neural network approach for sentence boundary detection in broadcast news”, in Proceedings of
INTERSPEECH 2014, Singapore, September 2014, pp.2887-2891.
115) Rong Tong, Bin Ma, and Haizhou Li, “Virtual example for phonotactic language recognition”,
in Proceedings of INTERSPEECH 2014, Singapore, September 2014, pp.3017-3021.
116) Vui Ann Shim, Bo Tian, Miaolong Yuan, Huajin Tang, and Haizhou Li, “Direction-driven
navigation using cognitive map for mobile robots”, in Proceedings of the IEEE/RSJ International
Conference on Intelligent Robots and Systems (IROS 2014), Chicago, Illinois, USA, pp.2639-2646.
117) Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, and Li-Rong Dai, “Local
variability vector for text-independent speaker verification”, in Proceedings of ISCSLP 2014,
Singapore, September 2014, pp.54-58.
118) Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Eng Siong Chng, and Haizhou Li,
“Single-channel dereverberation for distant-talking speech recognition by combining denoising
autoencoder and temporal structure normalization”, in Proceedings of ISCSLP 2014, Singapore,
September 2014, pp.379-383.
119) Kelvin Poon-Feng, Dong-Yan Huang, Minghui Dong, and Haizhou Li, “Acoustic emotion
recognition based on fusion of multiple feature-dependent deep Boltzmann machines”, in Proceedings
of ISCSLP 2014, Singapore, September 2014, pp.584-588.
120) Nicole Mirnig, Yeow Kee Tan, Tai Wen Chang, Yuanwei Chua, Tran Anh Dung, Haizhou Li,
and Manfred Tscheligi, “Screen feedback in human-robot interaction: How to enhance robot
expressiveness”, in Proceedings of IEEE International Symposium on Robot and Human Interactive
Communication (RO-MAN 2014), Edinburgh, UK, 2014, pp.224-230.
121) Van Tung Pham, Nancy F. Chen, Sunil Sivadas, Haihua Xu, I-Fan Chen, Chongjia Ni, Eng
Siong Chng, and Haizhou Li, “System and keyword dependent fusion for spoken term detection”, in
Proceedings of IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, Nevada,
USA, 2014, pp.430-435.
122) Andreea I. Niculescu, Rafael E. Banchs, and Haizhou Li, “Why Industrial Robots Should
Become More Social - On the Design of a Natural Language Interface for an Interactive Robot Welder”,
in Proceedings of ICSR 2014, Sydney, Australia, 2014, pp.276-278.
2013
123) Zhizheng Wu and Haizhou Li, “Voice conversion and spoofing attack on speaker verification
systems”, in Proceedings of APSIPA ASC 2013, Kaohsiung, Taiwan, 2013. (Invited paper)
124) Duc Hoang Ha Nguyen, Aleem Mushtaq, Xiong Xiao, Eng Siong Chng, Haizhou Li, and
Chin Hui Lee, “A Particle Filter Compensation Approach to Robust LVCSR”, in Proceedings of
APSIPA ASC 2013, Kaohsiung, Taiwan, 2013.
125) Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng, and Haizhou Li, “Modeling of term-
distance and term-occurrence information for improving n-gram language model performance”, in
Proceedings of ACL-2013, Sofia, Bulgaria, 2013, pp.233-237.
126) Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Broadcast news story
segmentation using manifold learning on latent topic distributions”, in Proceedings of ACL-2013,
Sofia, Bulgaria, 2013, pp. 190-195.
127) Zhizheng Wu, Eng Siong Chng, and Haizhou Li, "Conditional restricted boltzmann machine
for voice conversion", in Proceedings of ChinaSIP 2013, Beijing, China, 2013.
128) Vidhyasaharan Sethu, Julien Epps, Eliathamby Ambikairajah, and Haizhou Li, “GMM Based
Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge”, in
Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.
129) Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Context-Dependent Phone
Mapping for LVCSR of Under-Resourced Languages”, in Proceedings of INTERSPEECH 2013, Lyon,
France, August 2013.
130) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Attribute-Based Histogram Equalization
(HEQ) and its Adaptation for Robust Speech Recognition”, in Proceedings of INTERSPEECH 2013,
Lyon, France, August 2013.
131) Zhizheng Wu, Anthony Larcher, Kong Aik Lee, Eng Siong Chng, Tomi Kinnunen, and
Haizhou Li, “Vulnerability Evaluation of Speaker Verification Under Voice Conversion Spoofing:
The Effect of Text Constraints”, in Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.
132) R. Saeidi, Kong Aik Lee, Tomi Kinnunen, Taufiq Hasan, Benoit Fauve, P.-M. Bousquet, Elie
Khoury, P.L. Sordo Martinez, J. M. K. Kua, Chang Huai You, Hanwu Sun, Anthony Larcher,
Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilçi, B. Braithwaite, Rosa González Hautamäki,
Seyed Omid Sadjadi, Gang Liu, Hynek Boril, N. Shokouhi, D. Matrouf, L. El Shafey, Pejman
Mowlaee, Julien Epps, T. Thiruvaran, David A. van Leeuwen, Bin Ma, Haizhou Li, John H.L. Hansen,
and Jean-Francois Bonastre, “I4U Submission to NIST SRE 2012: A Large-Scale Collaborative Effort
for Noise-Robust Speaker Verification”, in Proceedings of INTERSPEECH 2013, Lyon, France,
August 2013.
133) Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Unsupervised
Mining of Acoustic Subword Units with Segment-Level Gaussian Posteriorgrams”, in Proceedings of
INTERSPEECH 2013, Lyon, France, August 2013.
134) Nancy F. Chen, Vivaek Shivakumar, Mahesh Harikumar, Bin Ma, and Haizhou Li, “Large-
Scale Characterization of Mandarin Pronunciation Errors Made by Native Speakers of European
Languages”, in Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.
135) Anthony Larcher, Jean-Francois Bonastre, Benoit Fauve, Kong Aik Lee, Christophe Lévy,
Haizhou Li, John S. D. Mason, and Jean-Yves Parfait, “ALIZE 3.0 — Open Source Toolkit for State-
of-the-Art Speaker Recognition”, in Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.
136) Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li,
“Exemplar-Based Unit Selection for Voice Conversion Utilizing Temporal Information”, in
Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.
137) Kong Aik Lee, Anthony Larcher, Chang Huai You, Bin Ma, and Haizhou Li, “Multi-Session
PLDA Scoring of i-Vector for Partially Open-Set Speaker Detection”, in Proceedings of
INTERSPEECH 2013, Lyon, France, August 2013.
138) Zhizheng Wu, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Synthetic Speech Detection
using Temporal Modulation Feature”, in Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.
139) Dau-Cheng Lyu, Eng-Siong Chng, and Haizhou Li, “Language Diarization for Code-Switch
Conversational Speech”, in Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.
140) Nancy F. Chen, Bin Ma, and Haizhou Li, “Minimal-Resource Phonetic Language Models to
Summarize Untranscribed Speech”, in Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.
141) Anthony Larcher, Kong Aik Lee, Bin Ma, and Haizhou Li, “Phonetically-Constrained PLDA
Modeling for Text-Dependent Speaker Verification with Multiple Short Utterances”, in Proceedings of
ICASSP 2013, Vancouver, Canada, May 2013.
142) Chang Huai You, Haizhou Li, Bin Ma, and Kong Aik Lee, “A Study on GMM-SVM with
Adaptive Relevance Factor and Its Comparison with i-Vector and JFA for Speaker Recognition”, in
Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.
143) Heike Adel, Ngoc Thang Vu, Franziska Kraus, Tim Schlippe, Haizhou Li, and Tanja Schultz,
“Recurrent Neural Network Language Modeling for Code Switching Conversational Speech”, in
Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.
144) Xiaoming Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li, “Broadcast News Story
Segmentation using Latent Topics on Data Manifold”, in Proceedings of ICASSP 2013, Vancouver,
Canada, May 2013.
145) Jonathan Dennis, Yu Qiang, Tang Huajin, Tran Huy Dat, and Li Haizhou, “Temporal Coding
of Local Spectrogram Features for Robust Sound Recognition”, in Proceedings of ICASSP 2013,
Vancouver, Canada, May 2013.
146) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Temporal Filter Design by Minimum KL
Divergence Criterion for Robust Speech Recognition”, in Proceedings of ICASSP 2013, Vancouver,
Canada, May 2013.
147) Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Using Parallel
Tokenizers with DTW Matrix Combination for Low-Resource Spoken Term Detection”, in
Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.
148) Yanan Li, Keng Peng Tee, Shuzhi Sam Ge, and Haizhou Li, “Building Companionship
through Human-Robot Collaboration”, in Proceedings of ICSR 2013, Bristol, UK, October, 2013.
2012
149) Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, and Eliathamby Ambikairajah,
“A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case”, in
Proceedings of APSIPA ASC 2012, California, USA, 2012. (Best Paper Award)
150) Tze Yuang Chong, Xiong Xiao, Tien-Ping Tan, Eng Siong Chng, and Haizhou Li,
“Collection and annotation of Malay conversational speech corpus”, in Proceedings of O-COCOSDA
2012, Macau, China, December 2012.
151) Deyi Xiong, Min Zhang, and Haizhou Li, “Modeling the Translation of Predicate-Argument
Structure for SMT”, in Proceedings of ACL-2012, Jeju, Korea, July 2012.
152) Wenliang Chen, Min Zhang, and Haizhou Li, “Utilizing Dependency Language Models for
Graph-based Dependency Parsing Models”, in Proceedings of ACL-2012, Jeju, Korea, July 2012.
153) Rafael E. Banchs and Haizhou Li, “IRIS: a Chat-oriented Dialogue System based on the
Vector Space Model”, in Proceedings of ACL-2012 (System Demonstrations), Jeju, Korea, July 2012.
154) Xiong Xiao, Jinyu Li, Eng Siong Chng, and Haizhou Li, “Lasso Environment Model
Combination for Robust Speech Recognition”, in Proceedings of ICASSP 2012, Kyoto, Japan, March
2012.
155) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Joint Spectral and Temporal Normalization
of Features for Robust Recognition of Noisy and Reverberated Speech”, in Proceedings of ICASSP
2012, Kyoto, Japan, March 2012.
156) Siu Wa Lee, Shen Ting Ang, Minghui Dong, and Haizhou Li, “Generalized F0 modelling
with absolute and relative pitch features for singing voice synthesis”, in Proceedings of ICASSP 2012,
Kyoto, Japan, March 2012.
157) Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li, “Acoustic texttiling for
story segmentation of spoken documents”, in Proceedings of ICASSP 2012, Kyoto, Japan, March 2012.
158) Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “An acoustic segment
modeling approach to query-by-example spoken term detection”, in Proceedings of ICASSP 2012,
Kyoto, Japan, March 2012.
159) Anthony Larcher, Pierre-Michel Bousquet, Kong Aik Lee, Driss Matrouf, Haizhou Li, and
Jean-Francois Bonastre, “I-vectors in the context of phonetically-constrained short utterances for
speaker verification”, in Proceedings of ICASSP 2012, Kyoto, Japan, March 2012.
160) Tomi Kinnunen, Zhi-Zheng Wu, Kong Aik Lee, Filip Sedlak, Eng Siong Chng, and Haizhou
Li, “Vulnerability of speaker verification systems against voice conversion spoofing attacks: the case
of telephone speech”, in Proceedings of ICASSP 2012, Kyoto, Japan, March 2012.
161) Ye Jiang, Kong Aik Lee, Zhenmin Tang, Bin Ma, Anthony Larcher, and Haizhou Li, “PLDA
Modeling in I-Vector and Supervector Space for Speaker Verification”, in Proceedings of
INTERSPEECH 2012, Portland, Oregon, September 2012.
162) Anthony Larcher, Kong Aik Lee, Bin Ma, and Haizhou Li, “RSR2015: Database for Text-
Dependent Speaker Verification using Multiple Pass-Phrases”, in Proceedings of INTERSPEECH 2012,
Portland, Oregon, September 2012.
163) You Changhuai, Li Haizhou, Ma Bin, and Lee Kong Aik, “Effect of Relevance Factor of
Maximum a posteriori Adaptation for GMM-SVM in Speaker and Language Recognition”, in
Proceedings of INTERSPEECH 2012, Portland, Oregon, September 2012.
164) Van Hai Do, Xiong Xiao, Engsiong Chng, and Haizhou Li, “Context dependant phone
mapping for cross-lingual acoustic modelling”, in Proceedings of ISCSLP 2012, Hong Kong,
December 2012, pp. 16-20.
165) Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Phonotactic spoken language recognition:
Using diversely adapted acoustic models in parallel phone recognizers”, in Proceedings of ISCSLP
2012, Hong Kong, December 2012, pp. 108-111.
166) Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong, and Haizhou Li, “An analysis of
vector Taylor series model compensation for non-stationary noise in speech recognition”, in
Proceedings of ISCSLP 2012, Hong Kong, December 2012, pp. 131-135.
167) Siu Wa Lee, Minghui Dong, and Haizhou Li, “A study of F0 modelling and generation with
lyrics and shape characterization for singing voice synthesis”, in Proceedings of ISCSLP 2012, Hong
Kong, December 2012, pp. 150-154.
168) Van Hai Do, Xiong Xiao, Engsiong Chng, and Haizhou Li, “A Phone Mapping Technique for
Acoustic Modeling of Under-Resourced Languages”, in Proceedings of the International Conference
on Asian Language Processing 2012 (IALP 2012), Hanoi, Vietnam, November 2012, pp. 233-236.
169) Liyuan Li, Xinguo Yu, Jun Li, Gang Wang, Ji Yu Shi, Yeow Kee Tan, and Haizhou Li,
“Vision-based attention estimation and selection for social robot to perform natural interaction in the
open world”, in Proceedings of the Seventh Annual Conference on Human-Robot Interaction (HRI
2012), Boston, Massachusetts, USA, March 2012, pp. 183-184.
170) Keng Peng Tee, Shuzhi Sam Ge, Rui Yan, and Haizhou Li, “Adaptive control for robot
manipulators under ellipsoidal task space constraints”, in Proceedings of the IEEE/RSJ International
Conference on Intelligent Robots and Systems (IROS 2012), Vilamoura, Algarve, Portugal, October
2012, pp. 1167-1172.
2011
171) Deyi Xiong, Min Zhang, and Haizhou Li, “Enhancing Language Models in Statistical
Machine Translation with Backward N-grams and Mutual Information Triggers”, in Proceedings of
ACL-2011: HLT, Portland, Oregon, June 2011.
172) Rafael E. Banchs and Haizhou Li, “AM-FM: A Semantic Framework for Translation Quality
Assessment”, in Proceedings of ACL-2011: HLT, Portland, Oregon, June 2011, pp. 153-158.
173) Wenliang Chen, Junichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang,
Kentaro Torisaws, and Haizhou Li, “SMT Helps Bitext Dependency Parsing”, in Proceedings of
EMNLP 2011, Edinburgh, UK, July 2011.
174) Zhenghua Li, Min Zhang, Wanxiang Che, Ting Liu, Wenliang Chen, and Haizhou Li, “Joint
Models for Chinese POS Tagging and Dependency Parsing”, in Proceedings of EMNLP 2011,
Edinburgh, UK, July 2011.
175) Min Zhang, Xiangyu Duan, Ming Liu, Yunqing Xia, and Haizhou Li, “Joint Alignment and
Artificial Data Generation: An Empirical Study of Pivot-based Machine Transliteration”, in
Proceedings of IJCNLP 2011, Chiang Mai, Thailand, November 2011.
176) Guoyu Tang, Yunqing Xia, Min Zhang, Haizhou Li, and Fang Zhang, “CLGVSM: Adapting
Generalized Vector Space Model to Cross-lingual Document Clustering”, in Proceedings of IJCNLP
2011, Chiang Mai, Thailand, November 2011.
177) Huy Dat Tran and Haizhou Li, “Probabilistic Distance SVM With Hellinger-Exponential
Kernel for Sound Event Classification”, in Proceedings of ICASSP 2011, Prague, Czech, May 2011.
178) Huy Dat Tran and Haizhou Li, “Jump Function Kolmogorov for Overlapping Audio Event
Classification”, in Proceedings of ICASSP 2011, Prague, Czech, May 2011.
179) Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “Score Fusion
and Calibration in Multiple Language Detectors With Large Performance Variation”, in Proceedings of
ICASSP 2011, Prague, Czech, May 2011.
180) Filip Sedlak, Tomi Kinnunen, Ville Hautamäki, Kong Aik Lee, Haizhou Li, “Classifier
Subset Selection and Fusion for Speaker Verification”, in Proceedings of ICASSP 2011, Prague, Czech,
May 2011.
181) Eryu Wang, Kong Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai, “Factored
Covariance Modeling for Text-Independent Speaker Verification”, in Proceedings of ICASSP 2011,
Prague, Czech, May 2011.
182) Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, “Maximum Likelihood Adaptation of
Histogram Equalization With Constraint for Robust Speech Recognition”, in Proceedings of ICASSP
2011, Prague, Czech, May 2011.
183) Kong Aik Lee, Chang Huai You, Ville Hautamäki, Anthony Larcher, and Haizhou Li,
“Spoken Language Recognition in the Latent Topic Simplex”, in Proceedings of INTERSPEECH 2011,
Florence, Italy, August 2011.
184) Chang Huai You, Haizhou Li, and Kong Aik Lee, “Study on the Relevance Factor of
Maximum a Posteriori with GMM for Language Recognition”, in Proceedings of INTERSPEECH 2011,
Florence, Italy, August 2011.
185) Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “Target-aware Lattice Rescoring for
Dialect Recognition”, in Proceedings of INTERSPEECH 2011, Florence, Italy, August 2011.
186) Yiren Leng, Huy Dat Tran, Norihide Kitaoka, and Haizhou Li, “Alternative Frequency Scale
Cepstral Coefficient for Robust Sound Event Recognition”, in Proceedings of INTERSPEECH 2011,
Florence, Italy, August 2011.
187) Kong Aik Lee, Anthony Larcher, Helen Thai, Bin Ma, and Haizhou Li, “Joint Application of
Speech and Speaker Recognition for Automation and Security in Smart Home”, in Proceedings of
INTERSPEECH 2011, Florence, Italy, August 2011.
188) Chien-Lin Huang, Bin Ma, Haizhou Li, and Chung-Hsien Wu, “Speech Indexing Using
Semantic Context Inference”, in Proceedings of INTERSPEECH 2011, Florence, Italy, August 2011.
189) Xiong Xiao, Jinyu Li, Eng Siong Chng, and Haizhou Li, “Feature Normalization Using
Structured Full Transforms for Robust Speech Recognition”, in Proceedings of INTERSPEECH 2011,
Florence, Italy, August 2011.
190) Sethserey Sam, Xiong Xiao, Laurent Besacier, Eric Castelli, and Haizhou Li, and Eng Siong
Chng, “Speech Modulation Features for Robust Nonnative Speech Accent Detection”, in Proceedings
of INTERSPEECH 2011, Florence, Italy, August 2011.
191) Jonathan William Dennis, Huy Dat Tran, and Haizhou Li, “Image Representation of the
Subband Power Distribution for Robust Sound Classification”, in Proceedings of INTERSPEECH 2011,
Florence, Italy, August 2011.
192) Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li, “Probabilistic Latent
Semantic Analysis for Broadcast News Story Segmentation”, in Proceedings of INTERSPEECH 2011,
Florence, Italy, August 2011.
2010
193) Min Zhang, Hui Zhang, and Haizhou Li, “Convolution Kernel over Packed Parse Forest”, in
Proceedings of ACL 2010, Uppsala, Sweden, July 2010. (Full paper)
194) Deyi Xiong, Min Zhang, and Haizhou Li, “Error Detection for Statistical Machine
Translation Using Linguistic Features”, in Proceedings of ACL 2010, Uppsala, Sweden, July 2010.
(Full paper)
195) Xiangyu Duan, Min Zhang, and Haizhou Li. “Pseudo-word for Phrase-based Machine
Translation”, in Proceedings of ACL 2010, Uppsala, Sweden, July 2010. (Full paper)
196) Deyi Xiong, Min Zhang, and Haizhou Li, “Learning Translation Boundaries for Phrase-
Based Decoding”, in Proceedings of NAACL-HLT 2010, Los Angeles, CA, June 2010.
197) Lianhau Lee, Aiti Aw, Min Zhang, and Haizhou Li, “EM-based Hybrid Model for Bilingual
Terminology Extraction from Comparable Corpora”, in Proceedings of COLING 2010, Beijing, China,
August 2010.
198) Vladimir Pervouchine, Min Zhang, Ming Liu, and Haizhou Li, “Improving Name Origin
Recognition with Context Features and Unlabelled Data”, in Proceedings of COLING 2010, Beijing,
China, August 2010.
199) Min Zhang, Xiangyu Duan, Vladimir Pervouchine, and Haizhou Li, “Machine Transliteration:
Leveraging on Third Languages”, in Proceedings of COLING 2010, Beijing, China, August 2010.
200) Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamaki, Tan Lee, Bin Ma, and Haizhou Li,
“Towards Long-Range Prosodic Attribute Modeling For Language Recognition”, in Proceedings of
INTERSPEECH 2010, Makuhari, Japan, September 2010.
201) Tin Lay Nwe, Hanwu Sun, Bin Ma, and Haizhou Li, “Speaker Diarization in Meeting Audio
for Single Distant Microphone”, in Proceedings of INTERSPEECH 2010, Makuhari, Japan, September
2010.
202) Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “Selecting Phonotactic Features for
Language Recognition”, in Proceedings of INTERSPEECH 2010, Makuhari, Japan, September 2010.
203) Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “A Discriminative Performance
Metric for GMM-UBM Speaker Identification”, in Proceedings of INTERSPEECH 2010, Makuhari,
Japan, September 2010.
204) Cheung-Chi Leung, Donglai Zhu, Kong-Aik Lee, Bin Ma, and Haizhou Li, “Incorporating
MAP Estimation and Covariance Transform for SVM based Speaker Recognition”, in Proceedings of
INTERSPEECH 2010, Makuhari, Japan, September 2010.
205) Chien-Lin Huang, Hanwu Sun, Bin Ma, and Haizhou Li, “Speaker Characterization Using
Long-Term and Temporal Information”, in Proceedings of INTERSPEECH 2010, Makuhari, Japan,
September 2010.
206) Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, and Haizhou Li, “Phoneme Lattice based
TextTiling towards Multilingual Story Segmentation”, in Proceedings of INTERSPEECH 2010,
Makuhari, Japan, September 2010.
207) Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, and Lirong Dai, “The Estimation
and Kernel Metric of Spectral Correlation for Text-Independent Speaker Verification”, in Proceedings
of INTERSPEECH 2010, Makuhari, Japan, September 2010.
208) Donglai Zhu, Bin Ma, Kong-Aik Lee, Cheung-Chi Leung, and Haizhou Li, “MAP Estimation
of Subspace Transform for Speaker Recognition”, in Proceedings of INTERSPEECH 2010, Makuhari,
Japan, September 2010.
209) Hanwu Sun, Bin Ma, Chien-Lin Huang, Trung Hieu Nguyen, and Haizhou Li, “The IIR NIST
SRE 2008 and 2010 Summed Channel Speaker Recognition Systems”, in Proceedings of
INTERSPEECH 2010, Makuhari, Japan, September 2010.
210) Ville Hautamaki, Tomi Kinnunen, Mohaddeseh Nosratighods, Kong-Aik Lee, Bin Ma, and
Haizhou Li, “Approaching Human Listener Accuracy with Modern Speaker Verification”, in
Proceedings of INTERSPEECH 2010, Makuhari, Japan, September 2010.
211) Minghui Dong, Paul Chan, Ling Cen, Haizhou Li, Jason Teo, and Ping Jen Kua, “Phonetic
Segmentation of Singing Voice using MIDI and Parallel Speech”, in Proceedings of INTERSPEECH
2010, Makuhari, Japan, September 2010.
212) You Changhuai, Li Haizhou, and Kong-Aik Lee, “A Hybrid Modeling Strategy for GMM-
SVM Speaker Recognition System with Adaptive Relevance factor”, in Proceedings of
INTERSPEECH 2010, Makuhari, Japan, September 2010.
213) Leng Yi Ren, Tran Huy Dat, Norihide Kitaoka, and Li Haizhou, “Selective Gammatone
Filterbank Feature for Robust Sound Event Recognition”, in Proceedings of INTERSPEECH 2010,
Makuhari, Japan, September 2010.
214) Zhi-Zheng Wu, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li, “Text-Independent F0
Transformation with Non-Parallel Data for Voice Conversion”, in Proceedings of INTERSPEECH
2010, Makuhari, Japan, September 2010.
215) Dau-Cheng Lyu, Tien-Ping Tan, Eng-Siong Chng, and Haizhou Li, “SEAME: a Mandarin-
English Code-switching Speech Corpus in South-East Asia”, in Proceedings of INTERSPEECH 2010,
Makuhari, Japan, September 2010.
216) Dat Tran Huy, Yi Ren Leng, and Haizhou Li, “Feature Integration for Heart Sound
Biometrics”, in Proceedings of ICASSP 2010, Dallas, USA, March 2010.
217) Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “Error Corrective Classifier
Fusion for Spoken Language Recognition”, in Proceedings of ICASSP 2010, Dallas, USA, March
2010.
218) C. P. Santhosh Kumar, Haizhou Li, Rong Tong, Pavel Matejka, Lukas Burget, and Jan
Cernocky, “Tuning Phone Decoders for Language Identification”, in Proceedings of ICASSP 2010,
Dallas, USA, March 2010.
219) Hanwu Sun, Bin Ma, Swe Zin Kalayar Khine, and Haizhou Li, “Speaker Diarization System
for RT07 and RT09 Meeting Room Audio”, in Proceedings of ICASSP 2010, Dallas, USA, March
2010.
220) Yu Tsao, Hanwu Sun, Haizhou Li, and Chin-Hui Lee, “An Acoustic Segment Model
Approach to Incorporating Temporal Information into Speaker Modeling for Text-Independent Speaker
Recognition”, in Proceedings of ICASSP 2010, Dallas, USA, March 2010.
221) Donglai Zhu, Bin Ma, and Haizhou Li, “Soft Margin Estimation of Gaussian Mixture Model
Parameters for Spoken Language Recognition”, in Proceedings of ICASSP 2010, Dallas, USA, March
2010.
222) Shuanhu Bai, Chien-Lin Huang, Bin Ma, and Haizhou Li, “Semi-Supervised Learning of
Language Model using Unsupervised Topic Model”, in Proceedings of ICASSP 2010, Dallas, USA,
March 2010.
223) Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “Prosodic
Attribute Model for Spoken Language Identification”, in Proceedings of ICASSP 2010, Dallas, USA,
March 2010.
2009
224) Vladimir Pervouchine, Haizhou Li, and Bo Lin, “Transliteration Alignment”, in Proceedings
of the 47th Annual Meeting of Association for Computational Linguistics and the 4th International
Joint Conference of Natural Language Processing (ACL-IJCNLP 2009), Singapore, August 2009. (Full
paper)
225) Deyi Xiong, Min Zhang, Aiti Aw and Haizhou Li, “A Syntax-Driven Bracketing Model for
Phrase-Based Translation”, in Proceedings of the 47th Annual Meeting of Association for
Computational Linguistics and the 4th International Joint Conference of Natural Language Processing
(ACL-IJCNLP 2009), Singapore, August 2009. (Full paper)
226) Hendra Setiawan, Min Yen Kan, Haizhou Li, and Philip Resnik, “Topological Ordering of
Function Words in Hierarchical Phrase-based Translation”, in Proceedings of the 47th Annual Meeting
of Association for Computational Linguistics and the 4th International Joint Conference of Natural
Language Processing (ACL-IJCNLP 2009), Singapore, August 2009. (Full paper)
227) Hui Zhang, Min Zhang, Haizhou Li, Aiti Aw, and Chew Lim Tan, “Forest-based Tree
Sequence to String Translation Model”, in Proceedings of the 47th Annual Meeting of Association for
Computational Linguistics and the 4th International Joint Conference of Natural Language Processing
(ACL-IJCNLP 2009), Singapore, August 2009. (Full paper)
228) Boxing Chen, Min Zhang, Haizhou Li, and Aiti Aw, “A Comparative Study of Hypothesis
Alignment and its Improvement for Machine Translation System Combination”, in Proceedings of the
47th Annual Meeting of Association for Computational Linguistics and the 4th International Joint
Conference of Natural Language Processing (ACL-IJCNLP 2009), Singapore, August 2009. (Full
paper)
229) Min Zhang and Haizhou Li, “Tree Kernel-based SVM with Structured Syntactic Knowledge
for BTG-based Phrase Reordering”, in Proceedings of EMNLP 2009, Singapore, August 2009.
230) Hui Zhang, Min Zhang, Haizhou Li, and Chew Lim Tan, “Fast Translation Rule Matching for
Syntax-based Statistical Machine Translation”, in Proceedings of EMNLP 2009, Singapore, August
2009.
231) Hui Zhang, Min Zhang, Chew Lim Tan, and Haizhou Li, “K-Best Combination of Syntactic
Parsers”, in Proceedings of EMNLP 2009, Singapore, August 2009.
232) Rong Tong, Bin Ma, Haizhou Li, Eng Siong Chng, and Kong-Aik Lee, “Target-Aware
Language Models for Spoken Language Recognition”, in Proceedings of INTERSPEECH 2009,
Brighton, UK, September 2009, pp. 200-203.
233) Hanwu Sun, Tin Lay Nwe, Bin Ma, and Haizhou Li, “Speaker Diarization for Meeting Room
Audio”, in Proceedings of INTERSPEECH 2009, Brighton, UK, September 2009, pp. 900-903.
234) Ling Cen, Minghui Dong, Paul Chan, and Haizhou Li, “Unit Selection Based Speech
Synthesis for Poor Channel Condition”, in Proceedings of INTERSPEECH 2009, Brighton, UK,
September 2009, pp. 2075-2078.
235) Donglai Zhu, Bin Ma, and Haizhou Li, “Large Margin Estimation of Gaussian Mixture
Model Parameters with Extended Baum-Welch for Spoken Language Recognition”, in Proceedings of
INTERSPEECH 2009, Brighton, UK, September 2009, pp. 2179-2182.
236) Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “Discriminative Feature
Transformation Using Output Coding for Speech Recognition”, in Proceedings of INTERSPEECH
2009, Brighton, UK, September 2009, pp. 2979-2982.
237) Khe Chai Sim and Haizhou Li, “Stream-Based Context-Sensitive Phone Mapping for Cross-
Lingual Speech Recognition”, in Proceedings of INTERSPEECH 2009, Brighton, UK, September 2009,
pp. 3019-3022.
238) Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, Eng Siong Chng, and Lirong Dai, “Exploiting
Prosodic Information for Speaker Recognition”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April
2009.
239) Chang Huai You, Kong Aik Lee, and Haizhou Li, “A GMM Supervector Kernel with the
Bhattacharyya Distance for SVM based Speaker Recognition”, in Proceedings of ICASSP 2009, Taipei,
Taiwan, April 2009.
240) Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah,
Bin Ma, and Haizhou Li, “Evaluation of a Fused FM and Cepstral-Based Speaker Recognition System
on the NIST 2008 SRE”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April 2009.
241) Hanwu Sun, Bin Ma, and Haizhou Li, “Cross-Validation of Multiple Language Recognition
Systems using Pseudo Keys”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April 2009.
242) Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai
You, Rong Tong, Ismo Karkkainen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li,
Lirong Dai, Mohaddeseh Nosratighods, Thiruvaran Tharmarajah, Julien Epps, Eliathamby
Ambikairajah, Eng-Siong Chng, Tanja Schultz, and Qin Jin, “The I4U System in NIST 2008 Speaker
Recognition Evaluation”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April 2009.
243) Donglai Zhu, Bin Ma, and Haizhou Li, “Joint MAP Adaptation of Feature Transformation
and Gaussian Mixture Model for Speaker Recognition”, in Proceedings of ICASSP 2009, Taipei,
Taiwan, April 2009.
244) Tran Huy Dat and Haizhou Li, “Sound Event Classification based on Feature Integration,
Recursive Feature elimination and Structured Classification”, in Proceedings of ICASSP 2009, Taipei,
Taiwan, April 2009.
245) Trung Hieu Nguyen, Eng Siong Chng, and Haizhou Li, “Clustering Criterion Functions in
Spectral Subspace and Their Application in Speaker Clustering”, in Proceedings of ICASSP 2009,
Taipei, Taiwan, April 2009.
246) Tin Lay Nwe, Hanwu Sun, Haizhou Li, and Susanto Rahardja, “Speaker Diarization in
Meeting Audio”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April 2009.
2008
247) Min Zhang, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, and Sheng Li, “A Tree
Sequence Alignment-based Tree-to-Tree Translation Model”, in Proceedings of ACL-08: HLT,
Columbus, Ohio, June 2008. (Full paper)
248) Deyi Xiong, Min Zhang Aiti Aw, and Haizhou Li, “A Linguistically Annotated Reordering
Model for BTG-based Statistical Machine Translation”, in Proceedings of ACL-08: HLT, Columbus,
Ohio, June 2008. (Short paper)
249) Boxing Chen, Min Zhang Aiti Aw, and Haizhou Li, “Exploiting N-best Hypotheses for SMT
Self-Enhancement”, in Proceedings of ACL-08: HLT, Columbus, Ohio, June 2008. (Short paper)
250) Jin-Shea Kuo and Haizhou Li, “Multi-View Co-Training of Transliteration Model”, in
Proceedings of IJCNLP 2008, Hyderabad, India, January 2008.
251) Min Zhang, Chengjie Sun, Haizhou Li, Aiti Aw, and Chew Lim Tan, “Name Origin
Recognition Using Maximum Entropy Model and Diverse Features”, in Proceedings of IJCNLP 2008,
Hyderabad, India, January 2008.
252) Jin-Shea Kuo, Haizhou Li, and Chih-Lung Lin, “Mining Transliterations from Web Query
Results: An Incremental Approach,” in Proceedings of the 6th SIGHAN Workshop, Hyderabad, India,
January 2008.
253) Min Zhang, Hongfei Jiang, Haizhou Li, Aiti Aw, and Sheng Li, “Grammar Comparison
Study for Translational Equivalence Modeling and Statistical Machine Translation”, in Proceedings of
COLING2008, Manchester, UK, August 2008.
254) Boxing Chen, Min Zhang, Aiti Aw, and Haizhou Li, “Regenerating Hypotheses for Statistical
Machine Translation”, in Proceedings of COLING2008, Manchester, UK, August 2008.
255) Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li, “Linguistically Annotated BTG for
Statistical Machine Translation”, in Proceedings of COLING2008, Manchester, UK, August 2008.
256) Tee Kiah Chia, Khe Chai Sim, Haizhou Li, and Hwee Tou Ng, “A Lattice-Based Approach to
Query-by-Example Spoken Document Retrieval”, in Proceedings of the 31st Annual International
ACM SIGIR Conference on Research & Development on Information Retrieval, Singapore, July 2008.
(Full paper)
257) Rong Tong, Bin Ma, Haizhou Li, and Eng-Siong Chng, “Target-Oriented Phone Selection
from Universal Phone Set for Spoken Language Recognition”, in Proceedings of INTERSPEECH 2008,
Brisbane, Australia, September 2008.
258) Donglai Zhu, Bin Ma, and Haizhou Li, “Using MAP Estimation of Feature Transformation
For Speaker Recognition”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia, September
2008.
259) Chien-Lin Huang, Bin Ma, Chung-Hsien Wu, Brian Mak, and Haizhou Li, “Robust Speaker
Verification Using Short-Time Frequency with Long-Time Window and Fusion of Multi-Resolutions”,
in Proceedings of INTERSPEECH 2008, Brisbane, Australia, September 2008.
260) Tin Lay Nwe, Minghui Dong, Swe Zin Kalayar Khine, and Haizhou Li, “Multi-Speaker
Meeting Audio Segmentation”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia,
September 2008.
261) Swe Zin Kalayar Khine, Tin Lay Nwe, and Haizhou Li, “Speech/Laughter Classification in
Meeting Audio”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia, September 2008.
262) Tran Huy Dat and Haizhou Li, “Speaker Identification in Noise Mismatch Conditions based
on Jump Function Kolmogorov Analysis in Wavelet Domain”, in Proceedings of INTERSPEECH 2008,
Brisbane, Australia, September 2008.
263) Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen, and Donglai Zhu,
“Characterizing Speech Utterances for Speaker Verification with Sequence Kernel SVM”, in
Proceedings of INTERSPEECH 2008, Brisbane, Australia, September 2008.
264) Namunu Maddage and Haizhou Li, “Rhythm Based Music Segmentation and Octave Scale
Cepstral Features for Sung Language Recognition”, in Proceedings of INTERSPEECH 2008, Brisbane,
Australia, September 2008.
265) Tran Hieu Nguyen , Eng Siong Chng, and Haizhou Li, “T-Test Distance and Clustering
Criterion for Speaker Diarization”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia,
September 2008.
266) Khe Chai Sim and Haizhou Li, “Context-sensitive Probabilistic Phone Mapping Model for
Cross-lingual Speech Recognition”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia,
September 2008.
267) Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “Target-Oriented Phone Tokenizers
For Spoken Language Recognition”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March-
April 2008.
268) Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui Lee, “Discriminative Learning For
Optimizing Detection Performance In Spoken Language Recognition”, in Proceedings of ICASSP 2008,
Las Vegas, Nevada, March- April 2008.
269) Tin Lay Nwe and Haizhou Li, “On Fusion Of Timbre-Motivated Features For Singing Voice
Detection And Singer Identification”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March-
April 2008.
270) Swe Zin Kalayar Khine, Tin Lay Nwe, and Haizhou Li, “Singing Voice Detection In Pop
Songs Using Co-Training Algorithm”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March-
April 2008.
271) Khe Chai Sim and Haizhou Li, “Robust Phone Set Mapping Using Decision Tree Clustering
For Cross-Lingual Phone Recognition”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March-
April 2008.
272) Kong-Aik Lee, Changhuai You, and Haizhou Li, “Spoken Language Recognition Using
Support Vector Machines With Generative Front-End”, in Proceedings of ICASSP 2008, Las Vegas,
Nevada, March- April 2008.
273) Tran Huy Dat and Haizhou Li, “Jump Function Komogorov And Its Application For Audio
Stream”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March- April 2008.
274) Chien-Lin Huang, Chung-Hsien Wu, Chia-Hsin Hsieh, Haizhou Li, and Bin Ma,
“Unsupervised Pronunciation Grammar Growing using Knowledge-based and Data-Driven
Approaches”, in Proceedings of ICME 2008, Hannover, Germany, June 2008.
275) Chang Huai You, Susanto Rahardja, and Haizhou Li, “Speech Enhancement for Telephony
Name Speech Recognition”, in Proceedings of ICME 2008, Hannover, Germany, June 2008.
276) Boxing Chen, Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li, “I2R Multi-Pass Machine
Translation System for IWSLT 2008”, in Proceedings of IWSLT 2008, Hawaii, USA, 2008, pp.46-51.
277) Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “Fuzzy Rule Selection using
Iterative Rule Learning for Speech Data Classification”, in Proceedings of the International
Conference on Pattern Recognition 2008 (ICPR 2008), Tampa, Florida, December 2008.
278) Eugene Chin Wei Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Eng-
Siong Chng, Haizhou Li, and Susanto Rahardja, “Speaker Diarization Using Direction of Arrival
Estimate and Acoustic Feature Information: The I2R-NTU Submission for the NIST RT 2007
Evaluation”, in Lecture Notes of Computer Science Vol. 4625, Multimodal Technologies for Perception
of Humans, Springer 2008, pp.484-496.
2007
279) Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, and Minghui Dong, “Semantic Transliteration of
Personal Names”, in Proceedings of ACL 2007, Prague, Czech Republic, June 2007, pp. 120-127.
280) Hendra Setiawan, Min-Yen Kan, and Haizhou Li, “Ordering Phrases with Function Words”,
The in Proceedings of ACL 2007, Prague, Czech Republic, June 2007, pp. 712-719.
281) Tee Kiah Chia, Haizhou Li, and Hwee Tou Ng, “A Statistical Language Modeling Approach
to Lattice-based Spoken Document Retrieval”, in Proceedings of the Joint Meeting Conference on
Empirical Methods in Natural Language Processing, and Conference on Computational Natural
Language Learning(EMNLP-CoNLL 2007), Prague, Czech Republic, June 2007.
282) Bin Ma, Rong Tong, and Haizhou Li, “Discriminative Vector for Spoken Language
Recognition”, in Proceedings of ICASSP 2007, Hawaii, USA, April 2007.
283) Rong Tong, Haizhou Li, Bin Ma, and Eng Siong Chng, “Spoken Language Recognition with
Relevance Feedback”, in Proceedings of ICASSP 2007, Hawaii, USA, April 2007.
284) Donglai Zhu, Bin Ma, Haizhou Li, and Qiang Huo, “A Generalized Feature Transformation
Approach for Channel Robust Speaker Verification”, in Proceedings of ICASSP 2007, Hawaii, USA,
April 2007.
285) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Normalizing the Speech Modulation
Spectrum for Robust Speech Recognition”, in Proceedings of ICASSP 2007, Hawaii, USA, April 2007.
286) Kong Aik Kee, Changhuai You, Haizhou Li, and Tomi Kinnunen, “A GMM-based
Probabilistic Sequence Kernel for Speaker Verification”, in Proceedings of INTERSPEECH 2007,
Antwerp, Belgium, August 2007.
287) Eugene Chin Wei Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Eng-Siong
Chng, Haizhou Li, and Susanto Rahardja, “Using Direction of Arrival Estimate and Acoustic Feature
Information in Speaker Diarization”, in Proceedings of INTERSPEECH 2007, Antwerp, Belgium,
August 2007.
288) Khe Chai Sim and Haizhou Li, “Fusion of Contrastive Acoustic Models for Parallel
Phonotactic Spoken Language Identification”, in Proceedings of INTERSPEECH 2007, Antwerp,
Belgium, August 2007.
289) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Evaluating the Temporal Structure
Normalisation Technique on the Aurora-4 Task”, in Proceedings of INTERSPEECH 2007, Antwerp,
Belgium, August 2007.
290) Tin Lay Nwe and Haizhou Li, “Singing Voice Detection using Perceptually-Motivated
Features”, in Proceedings of ACM Multimedia Conference 2007, Augsburg, Germany, September 2007.
291) Lei Wang, Eng Siong Chng, and Haizhou Li, “A vector-based approach to broadcast audio
database indexing and retrieval”, in Proceedings of ICME 2007, Beijing, China, July 2007.
2006
292) Jin-Shea Kuo, Haizhou Li, and Ying-Kuei Yang, “Learning Transliteration Lexicons from the
Web”, in Proceedings of the 44th Annual Meeting of Association for Computational Linguistics
(COLING-ACL 2006), Sydney, Australia, July 2006, pp. 1129 – 1136.
293) Namunu Maddage, Haizhou Li, and Mohan Kankanhalli, “Music Structure based Vector
Space Retrieval”, in Proceedings of the 29th Annual International ACM SIGIR Conference on
Research & Development on Information Retrieval (SIGIR 2006), Seattle, Washington, August 2006,
pp. 67-74. (Full paper)
294) Shuanhu Bai and Haizhou Li, “Bayesian Learning of N-gram statistical Language Modeling”,
in Proceedings of ICASSP 2006, Toulouse, France, May 2006.
295) Haizhou Li and Tin Lay Nwe, “Vibrato-Motivated Acoustic Features for Singer
Identification”, in Proceedings of ICASSP 2006, Toulouse, France, May 2006.
296) Rong Tong, Bin Ma, Donglai Zhu, Haizhou Li, and Eng Siong Chng, “Integrating Acoustic,
Prosodic and Phonotactic features for Spoken language identification”, in Proceedings of ICASSP 2006,
Toulouse, France, May 2006.
297) Tin Lay Nwe, Haizhou Li, and Minghui Dong, “Analysis and Detection of Speech under
Sleep Deprivation”, in Proceedings of INTERSPEECH 2006, Pittsburgh, USA, September 2006.
298) Haizhou Li, Bin Ma, and Rong Tong, “Vector-Based Spoken Language Recognition using
Output Coding”, in Proceedings of INTERSPEECH 2006, Pittsburgh, USA, September 2006.
299) Minghui Dong, Haizhou Li, and Tin Lay Nwe, “Evaluating Prosody of Mandarin Speech for
Language Learning”, in Proceedings of INTERSPEECH 2006, Pittsburgh, USA, September 2006.
300) Ma Bin, Donglai Zhu, Rong Tong, and Haizhou Li, “Speaker Cluster based GMM
Tokenization for Speaker Recognition”, in Proceedings of INTERSPEECH 2006, Pittsburgh, USA,
September 2006.
301) Denny Iskandar, Ye Wang, Min -Yen Kan, and Haizhou Li, “Syllabic Level Automatic
Synchronization of Music Signals and Text Lyrics”, in Proceedings of the ACM Multimedia
Conference 2006, Santa Barbara, USA, October 2006.
302) Namunu C Maddage, Mohan S. Kankanhalli, and Haizhou Li, “A Hirarchical Approach for
Music Chord Modeling based on the Analysis of Tonal Characteristics”, in Proceedings of ICME 2006,
Toronto, Canada, July 2006.
303) Jinyu Li, Sibel Yaman, Chin-Hui Lee, Bin Ma, Rong Tong, Donglai Zhu, and Haizhou Li,
“Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier
Fusion”, in Proceedings of the IEEE Odyssey 2006 - The Speaker and Language Recognition
Workshop, San Juan, Puerto Rico, June 2006.
2005
304) Min Zhang, Haizhou Li, Jian Su, and Hendra Setiawan, “A Phrase-based Context-dependent
Joint Probability”, in Proceedings of IJCNLP 2005, Jeju, South Korea, October 2005.
305) Hendra Setiawan, Haizhou Li, Min Zhang, and Beng Chin Ooi, “Phrase-based Statistical
Machine Translation: A Level of Detail Approach”, in Proceedings of IJCNLP 2005, Jeju, South Korea,
October 2005.
306) Haizhou Li and Bin Ma, “A Phonotactic Language Model for Spoken Language
Identification”, in Proceedings of ACL 2005, Ann Arbor, USA, June 2005, pp. 515-522.
307) Bin Ma and Haizhou Li, “A Phonotactic-Semantic Paradigm for Automatic Spoken
Document Classification”, in Proceedings of the 28th Annual International ACM SIGIR Conference
(SIGIR 2005), Salvador, Brazil, August 2005, pp. 369-376. (Full paper)
308) Tin Lay Nwe and Haizhou Li, “Broadcast News Segmentation by Audio Type Analysis”, in
Proceedings of ICASSP 2005, Philadelphia, PA, March 2005.
309) Boon Pang Lim, Haizhou Li, and Bin Ma, “Using Local and Global Phonotactical Features in
Chinese Dialect Identification”, in Proceedings of ICASSP 2005, Philadelphia, PA, March 2005.
310) Santhosh C. Kumar, V.P. Mohandas, and Haizhou Li, “Multilingual Speech Recognition: A
Unified Approach”, in Proceedings of INTERSPEECH 2005 - Eurospeech - 9th European Conference
on Speech Communication and Technology, Lisboa, Portugal, September 2005.
311) Tin Lay Nwe and Haizhou Li, “Identifying Singers of Popular Songs”, in Proceedings of
INTERSPEECH 2005 - Eurospeech - 9th European Conference on Speech Communication and
Technology, Lisboa, Portugal, September 2005.
312) Minghui Dong, Kim-Teng Lua, and Haizhou Li, “A Probabilistic Approach to Prosodic Word
Prediction for Mandarin Chinese TTS”, in Proceedings of INTERSPEECH 2005 - Eurospeech - 9th
European Conference on Speech Communication and Technology, Lisboa, Portugal, September 2005.
313) Sheng Gao, Bin Ma, Haizhou Li, and Chin-Hui Lee, “A Text Categorization Approach to
Automatic Language Identification”, in Proceedings of INTERSPEECH 2005 - Eurospeech - 9th
European Conference on Speech Communication and Technology, Lisboa, Portugal, September 2005.
314) Bin Ma, Haizhou Li, and Chin-Hui Lee, “An Acoustic Segment Modeling Approach to
Automatic Language Identification”, in Proceedings of INTERSPEECH 2005 - Eurospeech - 9th
European Conference on Speech Communication and Technology, Lisboa, Portugal, September 2005.
315) Minghui Dong, Kim Teng Lua, and Haizhou Li, “A Unit Selection based Speech Synthesis
Approach for Chinese Mandarin Text-to-Speech”, in Proceedings of the International Conference on
Chinese Computing 2005 (ICCC 2005), Singapore, March 2005.
316) Bin Ma and Haizhou Li, “Spoken Language Identification Using Bag-of-Sounds”, in
Proceedings of the International Conference on Chinese Computing 2005 (ICCC 2005), Singapore,
March 2005.
317) Manickam K and Haizhou Li, “Complexity Analysis of Normal and Deaf Infant Cry Acoustic
Waves”, in Proceedings of the 4th International Workshop on Model and Analysis of Vocal Emission
for Biomedical Applications (MAVEBA 2005), Florence, Italy, 2005.
318) Boon Pang Lim, Bin Ma, and Haizhou Li, “Using Semantic Context to Improve Voice
Keyword Mining”, in Proceedings of the International Conference on Chinese Computing 2005 (ICCC
2005), Singapore, March 2005.
2004
319) Haizhou Li, Min Zhang, and Jian Su, “A Joint Source-Channel Model for Machine
Transliteration”, in Proceedings of ACL 2004, Barcelona, Spain, July 2004, pp. 160-167.
320) Min Zhang, Haizhou Li, and Jian Su, “Direct Orthographical Mapping for Machine
Transliteration”, in Proceedings of the 20th International Conference on Computational Linguistics
(COLING2004), Geneva, Switzerland, August 2004.
321) Jun Xu, Guohong Fu, and Haizhou Li, “Grapheme-to-Phoneme Conversion for Chinese Text-
to-Speech Session Code”, in Proceedings of INTERSPEECH-ICSLP 2004, Jeju Island, Korea, October
2004.
322) Boon Pang Lim, Haizhou Li, and Yu Chen, “Language Identification through Large
Vocabulary Continuous Speech Recognition”, in Proceedings of ISCSLP 2004, Hong Kong, December
2004.
323) Yeow Kee Tan, Boon Seong Teoh, and Haizhou Li, “A Grapheme to Phoneme Conversion
for Standard Malay”, in Proceedings of ICSLT-O-COCOSDA 2004, New Delhi, India, November 2004.
324) C. S. Kumar and Haizhou Li, “Language identification System for Multilingual Speech
Recognition Systems”, in Proceedings of the 9th International Conference Speech and Computer
(SPECOM 2004), St. Petersburg, Russia, September 2004.