何亮
|
新疆大学信息半岛体育在线(中国)有限公司官网常务副院长(援疆),清华大学电子系,教授
研究领域:语音识别、知识图谱、归因分析与稳定学习
办公室&实验室:新疆大学博达校区计算机科学与技术楼A335
电子邮件:heliang@mail.tsinghua.edu.cn
联系电话:(+86)0991-8582510
|
教育背景
2006.9-2011.6 清华大学 电子工程系 博士 信息与通信工程
2004.9-2006.6 浙江大学 信电系 硕士 信息与通信工程
2000.9-2004.6 中国民航大学 空管半岛体育在线(中国)有限公司官网 本科 通信工程
社会工作
工作简历
2006.9-2011.6 清华大学 电子工程系 博士 信息与通信工程
2004.9-2006.6 浙江大学 信电系 硕士 信息与通信工程
2000.9-2004.6 中国民航大学 空管半岛体育在线(中国)有限公司官网 本科 通信工程
学术兼职
2021.5-今:中国中文信息学会开源情报技术专委会 秘书长
2022.4-今:中国刑事科学技术协会声纹检验技术专业委员会 副主任
2019-今:中国计算机学会语音对话与听觉专委会 委员
2019-2020:全国声纹识别技术与应用研讨会会议主席;2022:Odyssey程序委员会主席;2020-2022:INTERSPEECH、ICASSP分会主席;2020-2023:ICME领域主席、分会主席;IEEE ASLP、IEEE SP、IEEE SPL、PR、CSL、EURASIP ASMP、IET SP、ICASSP、ICME、INTERSPEECH等国际期刊或会议审稿人。
科研项目
国家自然科学基金,大规模高精度可解释声纹识别关键技术研究(主持),2023
科技创新2030课题,智能农事决策和管理调度系统(主持),2022
自治区重点研发课题,区域特色肿瘤防诊治疗(主持),2022
横向项目,上海海思,声纹识别和模糊命令词识别(主持),2022
横向项目,新疆国电,电网设备机器声纹识别(主持),2022
国家自然科学基金,肝癌精准治疗的智能化外科决策与手术规划(参与),2021
横向项目,腾讯,语音防伪方法研究(主持),2020
国家自然科学基金,复杂环境下语音数据的说话人识别及关键词检索(参与),2018
横向项目,淘宝中国,说话人识别和数字串内容识别项目(主持),2017
横向项目,华为,说话人标记(主持),2016
国家自然科学基金,基于信息几何的说话人标记方法研究(主持),2014
学术成果
作为项目负责人承担:科技部2030“新一代人工智能“重大项目课题1项,自治区重点研发课题1项,国家自然科学基金1项,专项3项;华为、腾讯和上海海思等企事业合作;以第一作者或通讯作者(含学生一作)发表高水平学术论文(Nature Communication、IEEE TASLP、IEEE SPL、JASA、ICASSP和Interspeech等)33篇,Web of Science引用379次,Google引用超千次。参加美国国家标准技术署举办的说话人识别评测(国际权威评测),NIST SRE 2021,清华联队(THUEE)系统性能指标获世界第1。任ICASSP、ICME、Interspeech、Odyssey和CCBR程序主席等;创办全国声纹识别技术与应用研讨会;中文信息学会开源情报专委会秘书长;全国刑事技术标准化技术委员会副主任。
期刊论文
Fangjing Niu, Tengfei Cao, Ying Hu, Hao Huang and Liang He, "Speech Topic Classification Based on Pre-trained and Graph Networks," in 2023 IEEE International Conference on Multimedia and Expo (ICME), 2023, pp. 1721-1726. (Oral)
Zhida Song, Liang He, Baowei Zhao, Minqiang Xu and Yu Zheng, "Dynamic Fully Connected Layer for Large-Scale Speaker Verification," INTERSPEECH 2023, 2023, pp. 2003–2007.
Zhihua Fang, Liang He, Hanhan Ma, Xiaochen Guo and Lin Li, "Robust Training for Speaker Verification against Noisy Labels," INTERSPEECH 2023, 2023, pp. 3192–3196.
Jian Zhang, Liang He, Ciaochen Guo and Jing Ma, "A Study on Visualization of Voiceprint Feature," INTERSPEECH 2023, 2023, pp. 2233–2237. (Oral)
Jiaming Li, Liang He, Lei Wang, Shaolei Wang, Hanhan Ma and Kan Feng, "Makbqa: Multi-hop knowledge base question answering system based on sensors and internet agricultural data," 2023 20th Annual IEEE International Conference on Sensing, Communication, and Networking (Secon), 2023.
Zuoer Chen and Liang He, "A quick and effective speaker diarization systemm," The Speaker and Language Recognition Workshop (Odyssey 2022), 2022, pp. 170–177.
Mengqi Niu, Liang He, Zhihua Fang, Baowei Zhao and Kai Wang, "Pseudo-phoneme label loss for text-independent speaker verification," Applied Sciences, 2022.
Weiwei Hu, Liang He, Hanhan Ma, Kai Wang and Jingfeng Xiao, "Kgner: Improving chinese named entity recognition by bert infused with the knowledge graph," Applied Sciences, 2022.
Tengfei Cao, Liang He and Fangjing Niu, "End-to-end speech topic classification based on pre-trained model wavlm," 2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2022, pp. 369–373.
Zhida Song, Liang He, Zhihua Fang, Ying Hu and Hao Huang, "Virtual fully-connected layer for a large-scale speaker verification dataset," Biometric Recognition: 16th Chinese Conference (CCBR 2022), 2022, pp. 382–390.
Xinyue Ma, Tianyu Liang, Shanshan Zhang, Shen Huang and Liang He, "Improved Lightcnn with Attention Modules for Asv Spoofing Detection," 2021 IEEE International Conference on Multimedia and Expo (ICME), 2021, pp. 1-6.
Xianwei Zhang and Liang He. (2021) End-to-End Cross-Lingual Spoken Language Understanding Model with Multilingual Pretraining. Proc. Interspeech 2021, 4728-4732, doi: 10.21437/Interspeech.2021-818.
Wenhao Ding, Liang He, “Adaptive Multi-Scale Detection of Acoustic Events,” IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 28, no. 1, p. 294-306, Dec. 2020. (SCI)
Keming Zhang, Yuanwen Cai, Yuan Ren, Ruida Ye and Liang He, "MTF-CRNN: Multiscale Time-Frequency Convolutional Recurrent Neural Network for Sound Event Detection," IEEE Access, vol. 8, pp. 147337-147348, 2020, doi: 10.1109/ACCESS.2020.3015047.
Ruyun Li, Tianyu Liang, Dandan Song, Yi Liu, Yangcheng Wu, Can Xu, Peng Ouyang, Xianwei Zhang, Xianhong Chen, Weiqiang Zhang, Shouyi Yin and Liang He, "THUEE System for NIST SRE19 CTS Challenge," Interspeech 2020, pp. 2232-2236.
Liang He, Xianhong Chen, Can Xu, Liu Yi, Jia Liu and Michael T. Johnson, “Latent class model with application to speaker diarization,” EURASIP Journal on Audio, Speech, and Music Processing, vol. 2019, no. 1, p. 12, Jul. 2019. (SCI)
Xianhong chen, Liang He, Can Xu and Jia Liu, “Distance-Dependent Metric Learning,” IEEE Signal Processing Letters, Feb. 2019, 26(2), 357-361. (SCI)
Yi Liu, Liang He, Jia Liu, Michael T. Johnson, “Introducing phonetic information to speaker embedding for speaker verification,” EURASIP Journal on Audio, Speech, and Music Processing, vol. 2019, no. 1, p. 19, Dec. 2019. (SCI)
Liang He, Xianhong Chen, Can Xu, and Jia Liu, “Multi-objective Optimization Training of PLDA for Speaker Verification,” ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 6026-6030.
Yi Liu, Liang He and Jia Liu, “Large Margin Softmax Loss for Speaker Verification,” INTERSPEECH 2019, 20th Annual Conference of the International Speech Communication Association.
Liang He, Xianhong Chen, Can Xu, Jia Liu and Michael T. Johnson, “Local Pairwise Linear Discriminant Analysis for Speaker Verification,” IEEE Signal Processing Letters, Oct. 2018, 25(10), 1575-1579. [code]. (SCI)
Xukui Yang, Liang He, Dan Qu and Weiqiang Zhang, “Semi-supervised minimum redundancy maximum relevance feature selection for audio classification,” Multimedia Tools and Applications 77(1), 713-739. (SCI)
Xukui Yang, Liang He, Dan Qu, Weiqiang Zhang and Michael T. Johnson, “Semi-supervised feature selection for audio classification based on constraint compensated Laplacian score”, EURASIP Journal on Audio, Speech, and Music Processing. (SCI)
Xukui Yang, Liang He, Dan Qu and Weiqiang Zhang, “Voice activity detection algorithm based on long-term pitch information”, EURASIP Journal on Audio, Speech, and Music Processing. (SCI)
专利
何亮、陈仙红、徐灿、梁天宇、刘加,一种基于深度混合模型的说话人确认方法,2021-10-01,ZL 201810465602.2
刘加、刘艺、何亮、张卫强,身份验证的方法、装置、计算机设备及存储介质,2021-10-08,ZL 2019 10711306.0
陈仙红、何亮、徐灿、刘加,一种说话人标记方法,2020-07-07,ZL 201710817534.7
何亮、徐灿、陈仙红、刘艺、田垚、刘巍巍、刘加,基于DNN模型和支持向量机模型的说话人个数估计方法,2020-05-19,ZL 201710123753.5
何亮、陈仙红、徐灿、刘艺、田垚、刘加,一种基于二次建模的说话人识别方法,2020-04-14,ZL 201710031899.7
刘艺、何亮、田垚、陈仙红、刘加,一种基于数字口令与声纹联合确认的用户身份验证方法,2020-01-07,ZL 201710208226.4
何亮, 徐灿, 田垚, 刘艺, 刘加; 基于密度峰值聚类和变分贝叶斯的说话人方法与系统, 2020-01-07, 中国, ZL201710035673.4.
刘加, 赵军红, 袁桦, 张卫强, 何亮, 赵峰, 邵颖; 特征提取方法、装置及重音检测的方法、装置,2018-12-25,中国,ZL201310488434.6.
刘加, 赵军红, 袁桦, 张卫强, 何亮, 赵峰, 邵颖; 韵律事件检测方法和装置,2018-10-02,中国,ZL201310487945.6.
何亮,张卫强,刘加;一种用于语种识别的建模方法及装置, 2012-07-04,中国,ZL201010207237.9.
|