☆21Mar 7, 2025Updated 11 months ago
Alternatives and similar repositories for turndetection
Users that are interested in turndetection are comparing it to the libraries listed below
Sorting:
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆33Jan 28, 2026Updated last month
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- ☆35Feb 10, 2026Updated 3 weeks ago
- ☆19Jan 8, 2025Updated last year
- This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without…☆50Feb 4, 2026Updated last month
- ☆57Feb 8, 2026Updated 3 weeks ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆45Nov 8, 2025Updated 3 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Toolbox for Evaluation of AEC/AES Systems☆33Feb 18, 2026Updated 2 weeks ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆78Feb 3, 2026Updated last month
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- T5-based (russian) text normalization☆25Jan 25, 2024Updated 2 years ago
- 复现Wav2Lip作者新的论文☆20Jun 20, 2023Updated 2 years ago
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆28Jul 31, 2025Updated 7 months ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- ☆24Mar 13, 2020Updated 5 years ago
- Данные 6-г о издания «Грамматического словаря русского языка» А. А. Зализняка (2010) в виде текстовых файлов☆25Sep 17, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆24Dec 20, 2022Updated 3 years ago
- A simple, performant re-implementation of AutoVC☆22Jul 6, 2023Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- only rmvpe☆23Aug 8, 2023Updated 2 years ago
- ☆28Nov 15, 2023Updated 2 years ago
- ☆68Dec 30, 2025Updated 2 months ago
- ☆24Sep 20, 2024Updated last year
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- 一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端.☆26Apr 23, 2025Updated 10 months ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆35Jan 19, 2024Updated 2 years ago