☆21Mar 7, 2025Updated last year
Alternatives and similar repositories for turndetection
Users that are interested in turndetection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 8 months ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated last year
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆27Jul 31, 2025Updated 10 months ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- ☆58Feb 8, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Onnx compatible styletts2 code☆16Apr 4, 2026Updated 2 months ago
- Russian phonetical transcription☆11May 20, 2026Updated 3 weeks ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 11 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆23Jun 7, 2025Updated last year
- ☆70Sep 3, 2024Updated last year
- ☆12Jan 14, 2020Updated 6 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 复现Wav2Lip作者新的论文☆20Jun 20, 2023Updated 2 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- ☆41Feb 10, 2026Updated 4 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆37Jan 28, 2026Updated 4 months ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 3 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆77Oct 8, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Jan 8, 2025Updated last year
- Simple diarization model☆53Jun 13, 2025Updated last year
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- Convert @NVlabs StyleGAN pkls to @taki0112 StyleGAN-Tensorflow checkpoints (copy over the weights)☆27Sep 17, 2019Updated 6 years ago
- ☆24Mar 13, 2020Updated 6 years ago
- A demo project demonstrating the performance improvement by cpp extension, which wrapped with pybind11.☆10Nov 16, 2021Updated 4 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆44Feb 9, 2023Updated 3 years ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆100May 18, 2026Updated 3 weeks ago
- ☆22Apr 29, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- 一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端.☆29Apr 23, 2025Updated last year
- ☆10Feb 17, 2023Updated 3 years ago
- Toolbox for Evaluation of AEC/AES Systems☆39Feb 18, 2026Updated 3 months ago
- only rmvpe☆24Aug 8, 2023Updated 2 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without…☆57May 22, 2026Updated 3 weeks ago