杨佳沛,王宇,彭广建,白清,刘昕,靳宝全.基于DTW-GMM的光纤传感系统声纹识别方法[J].电子测量与仪器学报,2024,38(4):176-186
基于DTW-GMM的光纤传感系统声纹识别方法
Voiceprint recognition method of optical fiber sensingsystem based on DTW-GMM
  
DOI:
中文关键词:  光纤传感  萨格奈克干涉  声纹识别  高斯混合模型
英文关键词:optical fiber sensing  Sagnac interference  voiceprint recognition  Gaussian mixture model
基金项目:山西省重点研发计划项目(202102130501021)、山西省水利科学技术研究与推广项目(2024GM18)、中央引导地方科技发展资金项目(YDZJSX20231B004)、山西省科技创新团队项目(201805D131003)资助
作者单位
杨佳沛 太原理工大学电子信息与光学工程学院太原030024 
王宇 1.太原理工大学电子信息与光学工程学院太原030024;2.太原理工大学新型传感器与智能控制教育部重点实验室 太原030024;3.山西省交通科技研发有限公司太原030024 
彭广建 太原理工大学电子信息与光学工程学院太原030024 
白清 1.太原理工大学电子信息与光学工程学院太原030024;2.太原理工大学新型传感器与智能控制教育部重点实验室 太原030024;3.山西省交通科技研发有限公司太原030024 
刘昕 1.太原理工大学电子信息与光学工程学院太原030024;2.太原理工大学新型传感器与智能控制教育部重点实验室 太原030024;3.山西省交通科技研发有限公司太原030024 
靳宝全 1.太原理工大学电子信息与光学工程学院太原030024;2.太原理工大学新型传感器与智能控制教育部重点实验室 太原030024 
AuthorInstitution
Yang Jiapei College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China 
Wang Yu 1.College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China; 2.Key Laboratory of Advanced Transducers and Intelligent Control System, Ministry of Education, Taiyuan University of Technology, Taiyuan 030024, China; 3.Shanxi Transportation Technology Research & Development Company Limited, Taiyuan 030024, China 
Peng Guangjian College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China 
Bai Qing 1.College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China; 2.Key Laboratory of Advanced Transducers and Intelligent Control System, Ministry of Education, Taiyuan University of Technology, Taiyuan 030024, China; 3.Shanxi Transportation Technology Research & Development Company Limited, Taiyuan 030024, China 
Liu Xin 1.College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China; 2.Key Laboratory of Advanced Transducers and Intelligent Control System, Ministry of Education, Taiyuan University of Technology, Taiyuan 030024, China; 3.Shanxi Transportation Technology Research & Development Company Limited, Taiyuan 030024, China 
Jin Baoquan 1.College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China; 2.Key Laboratory of Advanced Transducers and Intelligent Control System, Ministry of Education, Taiyuan University of Technology, Taiyuan 030024, China 
摘要点击次数: 41
全文下载次数: 2553
中文摘要:
      为了满足易燃易爆环境的声纹识别需求,设计了直线型萨格奈克干涉光纤声音传感系统,利用维纳滤波算法对语音数据进行了降噪,通过三电平削波法获取了基音周期特征,采用动态时间规整算法筛选了说话人样本,并提取了梅尔频率倒谱系数特征,运用高斯混合模型 期望最大化算法开展了声纹识别实验研究,同时探究了光纤声音传感系统的频率响应特性与声纹特征,研究了采集语音幅值对声纹识别结果的影响。实验结果表明,系统可实现300~3 500 Hz频率段的声音信号感知,声音幅值从0.9 V降至0.15 V时最大与次大对数似然值之差由35.5降至10.9,识别结果从成功变为失败。重复性实验表明,在10 km的传感光纤上,距声源2 m位置处,传感系统可对400段时长为3~5 s之间的文本无关语音段实现准确检测,且综合识别准确率为94.75%。本系统有望为易燃易爆环境中的设备故障、应急救援、渗漏监测等领域提供声纹识别的解决方案。
英文摘要:
      In order to meet the demand of voiceprint recognition in flammable and explosive environment. A linear Sagnac interference optical fiber acoustic sensor system has been designed. Speech data was denoised using the Wiener filtering algorithm, and pitch features were extracted through three-level clipping. Speaker samples were screened using dynamic time warping, and Mel-frequency cepstral coefficients were extracted as features. Voiceprint recognition experiments were conducted utilizing the Gaussian mixture model-expectation maximization algorithm, concurrently investigating the frequency response characteristics of the optical fiber acoustic sensor system and their relationship with voiceprint features. The influence of the amplitude of acquired speech on voiceprint recognition outcomes was studied. Experimental results demonstrate that the system can realize the sound signal perception in the frequency range of 300~3 500 Hz. When the sound amplitude decreases from 0.9 to 0.15 V, the difference between the maximum and second-largest log-likelihood values drops from 35.5 to 10.9, the recognition result changed from success to failure. Repetition experiments show that, at a distance of 2 meters from the sound source along a 10-kilometer sensing fiber, the system accurately recognizes 400 speech segments of 3 to 5 seconds duration, unrelated to any specific text, achieving an overall identification accuracy rate of 94.75%. This system holds promise as a solution for voiceprint recognition in applications such as equipment fault diagnosis and emergency response within flammable and explosive environments.
查看全文  查看/发表评论  下载PDF阅读器