A cross-platform inference engine for neural TTS models.
☆73Nov 25, 2024Updated last year
Alternatives and similar repositories for sonata
Users that are interested in sonata are comparing it to the libraries listed below
Sorting:
- Speech detection using silero vad in Rust☆30Dec 16, 2024Updated last year
- Almost-Pure Rust TTS Engine for my Rustnation talk☆50Jan 6, 2025Updated last year
- ☆19Jan 8, 2025Updated last year
- A whisper <lib|cli|server> written in rust☆20Jan 3, 2026Updated 2 months ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆55Feb 11, 2026Updated 3 weeks ago
- ☆38Apr 3, 2024Updated last year
- Use piper TTS models in Rust☆48Dec 17, 2024Updated last year
- pyannote audio diarization in rust☆105Sep 7, 2025Updated 5 months ago
- Speexdsp bindings and pure-rust implementation☆25Feb 2, 2026Updated last month
- ☆57Feb 8, 2026Updated 3 weeks ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Work in progress rust bindings to ggml☆12May 1, 2023Updated 2 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- IPA Phonetic dataset lexicon☆18Feb 22, 2026Updated last week
- ☆12Feb 3, 2026Updated last month
- Native JSON for Rust☆17Dec 10, 2023Updated 2 years ago
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆297Nov 1, 2025Updated 4 months ago
- Set or toggle multiple monitor's input sources via DDC/CI☆13Mar 1, 2019Updated 7 years ago
- Openfst mirror with some fixes☆14Aug 23, 2024Updated last year
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago
- Simple, efficient and cross-platform TFIDF-based text summarizer in Rust☆13Apr 12, 2024Updated last year
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- Code for the Cartoon Set webpage.☆15May 21, 2020Updated 5 years ago
- A Voice Activity Detector rust library using the Silero VAD model.☆62Aug 4, 2025Updated 7 months ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- an opinionated Wayland clipboard manager☆14Oct 24, 2025Updated 4 months ago
- Whisper.cpp with diarization☆19Nov 18, 2024Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Rust interface for the WebRTC Voice-Activity-Module☆31Jul 16, 2020Updated 5 years ago
- Enhance internal site search with LLM☆16Sep 9, 2024Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- A modified servo browser which accepts content patches over an IPC channel☆11Apr 26, 2016Updated 9 years ago
- ☆35Feb 10, 2026Updated 3 weeks ago
- top notch GIFs, right from your desktop☆58Sep 1, 2024Updated last year
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆48Oct 4, 2025Updated 5 months ago