(1)Responsible for or participated as a key person in the research and development of more than 30 national key projects and international cooperation projects, and won than 10 awards from the Ministry of Education, the Ministry of Science and Technology, and Beijing.
(2)Published more than 310 academic papers in domestic and well-known journals and academic conferences, including 13 papers (3 papers as the first author) that won the excellent paper award; published 14 monographs.
Representative are as follows:
[1]Tongxu Li, Hui Zhang, Thomas Fang Zheng, “The Voiceprint Recognition Technology and Its Applications in Unsupervised Identity Authentication,” 8(9): 46-54, 2018, Chinese Association for Artificial Intelligence Transactions (in Chinese)
[2]Lantian Li, Dong Wang, Chenhao Zhang, and Thomas Fang Zheng, "Improving short utterance speaker recognition by modeling speech unit classes," IEEE/ACM Trans. on Audio, Speech, and Language Processing, pp. 1129-1139, vol. 24, no. 6, June 2016
[3]Linlin Wang, Jun Wang, Lantian Li, Thomas Fang Zheng, Frank K. Soong, “Improving speaker verification performance against long-term speaker variability,” Speech Communication, 79 (2016), 14-29, Mar. 2016
[4]Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman. “Distributed Representation Learning for Knowledge Bases with Entity Descriptions,” Pattern Recognition Letters, DOI: 10.1016/j.patrec.2016.09.005, Elsevier.
[5]Miao Fan, Qiang Zhou, Andrew Abel, Thomas Fang Zheng, Ralph Grishman, “Probabilistic Belief Embedding for Large-Scale Knowledge Population,” Cognitive Computation, December 2016, Volume 8, Issue 6, pp. 1087-1102
[6]Meng Sun, Xiongwei Zhang, Hugo Van hamme, and Thomas Fang Zheng, "Unseen noise estimation using separable deep auto encoder for speech enhancement," IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 93-104, Vol. 24, No. 1, Jan. 2016 (DOI 10.1109/TASLP.2015.2498101)
[7]Guoyu Tang, Yunqing Xia, Erik Cambria, Peng Jin, Thomas Fang Zheng, “Document representation with statistical word senses in cross-lingual document clustering,” Vol. 29, No. 2 (2015), International Journal of Pattern Recognition and Artificial Intelligence, World Scientific Publishing Company
[8]Shi Yin, Chao Liu, Zhiyong Zhang, Yiye Lin, Dong Wang, Javier Tejedor, Thomas Fang Zheng and Yingguo Li, “Noisy Training for Deep Neural Networks in Speech Recognition,” EURASIP Journal on Audio, Speech, and Music Processing, 2015, 2015:2
[9]Dong Wang, Ravichander Vipperla, Nicholas Evans, Thomas Fang Zheng, “Online Non-Negative Convolutive Pattern Learning for Speech Signals,” IEEE Trans. on Signal Processing, 61(1): 44-56, Jan. 1, 2013
[10]Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng, “Multilingual Stemming and Term Extraction for Uyghur, Kazak and Kirghiz,” the 10th APSIPA Annual Summit and Conference (APSIPA ASC 2018), November 12-15, 2018, 587-590, Hawaii, USA
[11]Thomas Fang Zheng, “Speech Signal for Unsupervised Identity Authentication,” APSIPA 10th Anniversary Magazine, pp. 26-28, Nov. 2018, Hawaii, USA
[12]Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, “Full-Info Training for Deep Speaker Feature Learning,” International Conference on Acoustics, Speech and Signal Processing (ICASSP’18), pp. 5369-5373, Apr. 15-20, 2018, Calgary, Alberta, Canada
[13]Lantian Li, Dong Wang, Yixiang Chen, Ying Shi, Zhiyuan Tang, Thomas Fang Zheng, “Deep Factorization for Speech Signal,” International Conference on Acoustics, Speech and Signal Processing (ICASSP’18), pp. 5094-5098, Apr. 15-20, 2018, Calgary, Alberta, Canada
[14]Xingliang Cheng, Xiaotong Zhang, Mingxing Xu, and Thomas Fang Zheng, “MMANN: Multimodal Multilevel Attention Neural Network for Horror Clip Detection,” the 10th APSIPA Annual Summit and Conference (APSIPA ASC 2018), November 12-15, 2018, 329-334, Hawaii, USA
[15]Xiaotong Zhang, Xingliang Cheng, Mingxing Xu, Thomas Fang Zheng, “Imbalance Learning-based Framework for Fear Recognition in the MediaEval Emotional Impact of Movies Task,” pp.3678-3682, Interspeech 2018, 2-6 Sepember 2018, Hyderabad, India, DOI: 10.21437/Interspeech.2018-1744
[16] Replay Detection using CQT-based Modified Group Delay Feature and ResNeWt Network in ASVspoof 2019
[17] XIAOLONG WU, CHANG FENG, MINGXING XU, THOMAS FANG ZHENG, ASKAR HAMDULLA,“DialoguePCN: Perception and Cognition Network for Emotion Recognition in Conversations”,IEEE Access, VOLUME 11, pp. 141251-141260, 2023, DOI 10.1109/ACCESS.2023.3342456
Book:《Robustness-Related Issues in Speaker Recognition》
(3)Possess 16 invention patents (including one international invention patent) and one utility model patent.
The representative patents obtained in recent years as follows:
[1] A training method and system for a language model based on a distributed neural network, 201410067916, 2014.02.27, China
[2] A method and system for voice password authentication, 201710052098, 2017.01.22, China [3] A system and method for voice identity confirmation based on dynamic password voice, Z 201310123555.0, 2013.10.12, China
[4] A voice access system based on dynamic digital verification code, ZL 201620119381.X, 2016, China
[5] A method and device for automatic reconstruction of voiceprint models, ZL 201510061721.8, 2015.1.06, China
[6] A method for dual authentication of fingerprints and voiceprints, ZL 2015100479665, 2015.10.04, China
[7] A feature extraction method and device for voice replay detection, ZL20180191512.9, China
(4)The "Unsupervised Identity Authentication System Based on Dynamic Password Voice" has passed the scientific and technological achievementisal of the Chinese Electronics Society, and the appraisal conclusion is "overall at the international leading level".