基于cnn的音频分割工具包。做语音活动检测,语音检测,音乐检测,说话人性别识别。
inaSpeechSegmenter的Python项目详细描述
将音频信号分割成语音和音乐的均匀区域,并检测说话人的性别。
InAspeechSegmenter在加拿大卡尔加里举行的IEEE2018国际声学、语音和信号处理会议(ICASSP)上作了介绍。如果您在研究中使用此工具箱,您可以在出版物中引用以下工作:
@inproceedings{ddoukhanicassp2018,author={Doukhan, David and Carrive, Jean and Vallet, Félicien and Larcher, Anthony and Meignier, Sylvain},title={An Open-Source Speaker Gender Detection Framework for Monitoring Gender Equality},year={2018},organization={IEEE},booktitle={Acoustics Speech and Signal Processing (ICASSP), 2018 IEEE International Conference on}}
InAspeechSegmenter赢得了Mirex 2018语音检测挑战赛。
http://www.music-ir.org/mirex/wiki/2018:Music_and_or_Speech_Detection_Results
有关语音检测子模块的详细信息,请参见以下内容:
@inproceedings{ddoukhanmirex2018,author={Doukhan, David and Lechapt, Eliott and Evrard, Marc and Carrive, Jean},title={INA’S MIREX 2018 MUSIC AND SPEECH DETECTION SYSTEM},year={2018},booktitle={Music Information Retrieval Evaluation eXchange (MIREX 2018)}}