☆27Oct 25, 2024Updated last year
Alternatives and similar repositories for NAS_VAD
Users that are interested in NAS_VAD are comparing it to the libraries listed below
Sorting:
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆12Dec 3, 2021Updated 4 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- ☆20Apr 27, 2024Updated last year
- ☆21Jul 29, 2024Updated last year
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- speex aec kalman filter☆15Mar 17, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆24Aug 29, 2025Updated 6 months ago
- ☆11Nov 7, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- ☆13Nov 22, 2022Updated 3 years ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆35Jan 19, 2024Updated 2 years ago
- ☆57Apr 24, 2024Updated last year
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 11 months ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- ☆15Nov 11, 2024Updated last year
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Ecr-helper is a tool for call recording☆25Apr 18, 2025Updated 10 months ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- ☆13Jan 12, 2024Updated 2 years ago
- a repository for trainabale tts multi speaker☆14Nov 28, 2021Updated 4 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 9 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆30Jan 22, 2026Updated last month
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- ☆57Jul 5, 2022Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- [WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).☆26May 29, 2024Updated last year
- ☆23Jan 29, 2026Updated last month
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆138Jan 20, 2024Updated 2 years ago
- ☆33Aug 6, 2021Updated 4 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆32Jul 9, 2024Updated last year
- This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without…☆50Feb 4, 2026Updated 3 weeks ago