Tr-VAD: An Efficient Transformer based Voice Activity Detection Model
☆17Aug 1, 2024Updated last year
Alternatives and similar repositories for Tr-VAD
Users that are interested in Tr-VAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆21Nov 25, 2024Updated last year
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆14Dec 3, 2021Updated 4 years ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆76Jul 29, 2024Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection☆41Jul 25, 2025Updated 8 months ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- Codebase of the submitted work in ICASSP 2023☆14Nov 30, 2022Updated 3 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆152Jun 5, 2025Updated 9 months ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆80Sep 22, 2022Updated 3 years ago
- Universal differential equations for ecologists☆14Mar 2, 2026Updated 3 weeks ago
- 3D Sound Source Localization using Masked Autoencoders☆19Feb 12, 2025Updated last year
- Landing Page for Divide and Remaster v3☆26Jul 29, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Fault Injection Automatic Test Equipment☆16Nov 22, 2021Updated 4 years ago
- ☆22Jul 10, 2025Updated 8 months ago
- ☆11Sep 22, 2022Updated 3 years ago
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆25Aug 21, 2024Updated last year
- ☆13May 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Welcome to my project. OpenPyVision is a real time videoMixer based on opencv and pyqt6.☆14Aug 22, 2024Updated last year
- including compiler to encode DGL GNN model to instructions, runtime software to transfer data and control the accelerator, and hardware v…☆14Nov 19, 2023Updated 2 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- state-of-the-art models for diacritics restoration for Arabic language☆17Feb 23, 2025Updated last year
- ☆25Aug 29, 2025Updated 7 months ago
- Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement" (Accepted to INTERSPEECH 2025)☆58Aug 28, 2025Updated 7 months ago
- Simple PyTorch Denoisers for Waveform Audio☆41Mar 18, 2026Updated last week
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- ☆80Mar 3, 2026Updated 3 weeks ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [TMLR 2024] Revisiting Random Weight Perturbation for Efficiently Improving Generalization☆12Oct 18, 2024Updated last year
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated last year
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- TCP tunnel powered by epoll☆15Dec 16, 2021Updated 4 years ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- ☆12Mar 11, 2024Updated 2 years ago
- multi-scale time domain speaker extraction☆73Jun 7, 2021Updated 4 years ago