Audio Split 基于双门限法的语音端点检测及语音分割
☆135May 26, 2020Updated 5 years ago
Alternatives and similar repositories for voice_activity_detection
Users that are interested in voice_activity_detection are comparing it to the libraries listed below
Sorting:
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆217May 26, 2020Updated 5 years ago
- Minerva是一个便捷的音频工具,支持快速进行录音(PCM/MP3/WAV)和VAD端点检测识别,并保存活动语音。☆10May 23, 2024Updated last year
- the algorithm of Apnea detection by breath☆14Jan 3, 2021Updated 5 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- 语音切割,python ,webrtc☆11Sep 28, 2018Updated 7 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- 🔈 Use python to achieve voice activity detection, this little program may be helpful for voice application☆169Dec 28, 2017Updated 8 years ago
- ☆16Dec 12, 2023Updated 2 years ago
- A qt capture window that can be resize(with resize marker), move , and transparent.☆18Sep 4, 2019Updated 6 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)☆12Aug 9, 2018Updated 7 years ago
- ☆49Feb 12, 2026Updated 3 weeks ago
- Human ear perception scales and feature(mel、bark、ERB、gammatone)☆27May 19, 2024Updated last year
- json是一个C++语言的Json解析程序,提供json解析及构造json数据功能。☆23Jul 26, 2015Updated 10 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆50Apr 7, 2019Updated 6 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- Robust Speech Activity Detection (SAD) in movie audio☆26Jan 27, 2021Updated 5 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Jan 23, 2021Updated 5 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆37Feb 11, 2025Updated last year
- distributed data parallel, apex, and horovod tutorial example codes☆26Jan 16, 2021Updated 5 years ago
- ☆33Jan 14, 2023Updated 3 years ago
- Manage audio and video datasets☆34Updated this week
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆64Nov 5, 2025Updated 4 months ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Vocal Synthesis Through MIDI and Vocal Transformation Using RVC (KO, EN, JA, ZH)☆33Sep 12, 2023Updated 2 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Feb 17, 2022Updated 4 years ago
- 音频特征提取程序,MFCC,HFCC,MFCC_WALSH,Philips☆31Mar 31, 2019Updated 6 years ago
- Elucidating The Design Space of Classifier-Guided Diffusion Generation☆32Jan 20, 2024Updated 2 years ago
- A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM☆123Aug 7, 2019Updated 6 years ago
- A statistical model-based Voice Activity Detection☆194Nov 30, 2018Updated 7 years ago
- dl4j for beginner☆30May 12, 2021Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.☆204Oct 14, 2019Updated 6 years ago
- ☆32Aug 10, 2022Updated 3 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 7 months ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- 파파고 비공식 번역 자동화 도구 (Unofficial Papago API using reverse-engineered web endpoints)☆10Jul 4, 2025Updated 8 months ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Image reconstruction using Bézier diffusion curves and color diffusion constrained by those curves.☆35Jan 22, 2026Updated last month
- 单纯形法,运输问题的Python简单实现。最优化方法课程作业,设计一个可以运行的平台软件,可以选择单纯形、运输问题求解。☆14Jan 5, 2021Updated 5 years ago