Benchmarking different VAD models on AVA-Speech dataset
☆18May 21, 2023Updated 2 years ago
Alternatives and similar repositories for VAD_Benchmark
Users that are interested in VAD_Benchmark are comparing it to the libraries listed below
Sorting:
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 3 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆19May 8, 2025Updated 10 months ago
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 5 months ago
- SPEAR Challenge scripts and tools.☆24Mar 17, 2023Updated 3 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆48Oct 15, 2021Updated 4 years ago
- Utilizes ONNX Runtime for speech activity detection.☆42Dec 10, 2025Updated 3 months ago
- To Implement the Generalized Side Lobe Canceller with Fixed Beamformer,parallel blocking matrix and adaptive interference canceller achie…☆29Oct 15, 2019Updated 6 years ago
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 5 months ago
- A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.☆13Dec 3, 2020Updated 5 years ago
- Perceived Music Quality Dataset☆12Jul 1, 2024Updated last year
- ☆10Jan 26, 2021Updated 5 years ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆15May 27, 2024Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- tiny reusable framework for building microservices with messaging and rest☆11Dec 15, 2023Updated 2 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- Zoom Audio Transcription offline☆33Sep 30, 2020Updated 5 years ago
- Feedforward Sequential Memory Networks☆16Aug 2, 2022Updated 3 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- Python bindings for minimp3☆17Sep 11, 2023Updated 2 years ago
- A Mongo REPL with the full power of Mongoose☆16Jun 24, 2022Updated 3 years ago
- ☆16Mar 29, 2022Updated 3 years ago
- ☆24Jun 26, 2024Updated last year
- ☆16Dec 18, 2023Updated 2 years ago
- ☆21Apr 27, 2024Updated last year
- ☆51Jun 14, 2022Updated 3 years ago
- ☆16Apr 2, 2025Updated 11 months ago
- a grunt task plugin to run preen☆13Feb 21, 2016Updated 10 years ago
- WARNING : This connector is deprecated, please use an API Client☆22Mar 7, 2017Updated 9 years ago
- The source code for the paper titled "Sentiment Knowledge Enhanced Attention Fusion Network (SKEAFN)".☆31Aug 17, 2023Updated 2 years ago
- Description and usage of secret-loader into a real project and in relation with the medium article☆13Mar 20, 2020Updated 6 years ago
- http://jcgt.org/published/0002/02/06/☆16Dec 23, 2020Updated 5 years ago
- Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…☆18Oct 21, 2022Updated 3 years ago
- ConvLSTM-AE_VAD_ICME2017 (code reimplementation)☆21Oct 10, 2020Updated 5 years ago
- 偏微分方程数值解作业☆14Aug 10, 2020Updated 5 years ago
- Single Channel Speech Enhancement Methods and Toolbox☆48Feb 26, 2026Updated 3 weeks ago
- Lunary AI Python Client (Analytics, monitoring and evaluations for GenAI apps)☆20Apr 15, 2025Updated 11 months ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago