几种VAD算法的测评
☆25Jul 31, 2020Updated 5 years ago
Alternatives and similar repositories for VAD_campare
Users that are interested in VAD_campare are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆21Mar 28, 2023Updated 3 years ago
- speech enhancement algorithms for microphone arrays☆15May 12, 2020Updated 6 years ago
- Ambisonic Blind Reverberation Time Estimation☆12Jun 14, 2020Updated 5 years ago
- Audio signals noise reduction☆13Dec 27, 2021Updated 4 years ago
- A tutorial on the delay and sum beamformer for microphone arrays☆18Jun 9, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- 使用keras&tensorflow框架,GTZAN数据集。☆15Feb 20, 2019Updated 7 years ago
- ☆69Jul 17, 2024Updated last year
- Python implementation of OMLSA+IMCRA algorithm for speech enhancement.☆69Jun 29, 2021Updated 4 years ago
- ☆11Dec 28, 2023Updated 2 years ago
- A fourier-based audio-synthesiser wrote in MATLAB as a university project.☆12Jan 19, 2019Updated 7 years ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆26Aug 21, 2024Updated last year
- context-aware Unet based on transformer for speech denoising☆24Feb 6, 2021Updated 5 years ago
- Hugging Face Audio Course中文版,帮助学习者快速入门音频模态☆37May 25, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated 2 years ago
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- ☆11Mar 28, 2021Updated 5 years ago
- 碧树西风经典文章☆11Dec 17, 2021Updated 4 years ago
- LogMMSE speech enhancement/noise reduction☆89Apr 1, 2020Updated 6 years ago
- A personal toolkit for single/multi-channel speech recognition & enhancement & separation.☆146Jul 6, 2023Updated 2 years ago
- Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.…☆16Sep 14, 2023Updated 2 years ago
- A fully and partially fake speech dataset for evaluation☆15Nov 11, 2025Updated 6 months ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- 记录关于AEC的论文和代码、博客以及相关资料☆15Jul 26, 2022Updated 3 years ago
- Basic Tools☆13Dec 18, 2021Updated 4 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Feb 1, 2020Updated 6 years ago
- C Library for computing MFCC☆13Apr 30, 2019Updated 7 years ago
- BC-ResNet for Keyword Spotting☆44Jan 11, 2022Updated 4 years ago
- InfAnFace: Bridging the infant-adult domain gap in facial landmark estimation in the wild (ICPR2022)☆15Jan 9, 2026Updated 4 months ago
- Python的音频工具☆16Dec 5, 2025Updated 5 months ago
- LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, w…☆16Nov 6, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An audio steganalysis method based on CNN in the time domain.☆12Feb 25, 2021Updated 5 years ago
- This repository collects papers related to Speech Tokenizer.☆18Oct 16, 2024Updated last year
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- ☆19Dec 29, 2024Updated last year
- tdnn (time delay neural network) tensorflow implementation☆10Mar 6, 2020Updated 6 years ago
- An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models☆12Nov 13, 2021Updated 4 years ago
- MVDR beamformer written in python☆10Jul 2, 2021Updated 4 years ago