Efficient voice activity detection algorithm using long-term spectral flatness measurement
☆15Feb 21, 2017Updated 9 years ago
Alternatives and similar repositories for vad_lsfm
Users that are interested in vad_lsfm are comparing it to the libraries listed below
Sorting:
- The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…☆38Oct 10, 2025Updated 4 months ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- ☆20Aug 25, 2025Updated 6 months ago
- General utility functions for golang☆13Sep 28, 2020Updated 5 years ago
- PHP Template Engine using nothing more than HTML5 tags.☆11Aug 7, 2015Updated 10 years ago
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation☆23Nov 4, 2025Updated 4 months ago
- This is the official implementation of PGUSE☆34Jun 7, 2025Updated 9 months ago
- ☆15Jan 24, 2017Updated 9 years ago
- ☆21Jul 16, 2025Updated 7 months ago
- Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.☆33Nov 12, 2025Updated 3 months ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆32Nov 9, 2025Updated 3 months ago
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆31Jan 13, 2026Updated last month
- Official code of SenSE.☆74Oct 30, 2025Updated 4 months ago
- This repository is based on the Voice Acitivyt Detectors (VAD) implemented on "Analysis of the use of noise removal techniques as preproc…☆19Dec 19, 2017Updated 8 years ago
- extract "agc" and "ns" part from webrtc☆21Mar 2, 2016Updated 10 years ago
- cgo interface to WebRTC Voice Activity Dectection☆68Jan 21, 2021Updated 5 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- The official SpeakerVid-5M data curation code.☆68Jul 23, 2025Updated 7 months ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆35Mar 22, 2021Updated 4 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆38Oct 27, 2025Updated 4 months ago
- Speech Recognition implementation using Artificial Neural Networks☆10Sep 7, 2015Updated 10 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆38Aug 7, 2024Updated last year
- ☆15Mar 15, 2022Updated 3 years ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- [INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"☆57Nov 3, 2025Updated 4 months ago
- A news based stock scalper using LLM and quant approach☆15Jan 16, 2025Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆36Aug 15, 2019Updated 6 years ago
- Simple implementation of TDOA localization algorithm.☆13Oct 12, 2016Updated 9 years ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Sep 12, 2024Updated last year
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control☆28Feb 27, 2026Updated last week
- This is a repository of Swagger documents representing the HL7 FHIR REST API Implementation.☆11Aug 4, 2017Updated 8 years ago
- Using acceleration and heart rate data to classify awake, deep, and light sleep☆10Dec 21, 2017Updated 8 years ago
- CEX.IO API integration. PHP sources.☆21Oct 16, 2017Updated 8 years ago
- ☆15Sep 16, 2024Updated last year
- A signal processing library, currently sufficient for basic speech recognition stuff like mel frequency cepstrum☆19Mar 15, 2012Updated 13 years ago