T-one is a high-performance streaming ASR pipeline for Russian, specialized for the telephony domain.
☆247Dec 30, 2025Updated 2 months ago
Alternatives and similar repositories for T-one
Users that are interested in T-one are comparing it to the libraries listed below
Sorting:
- ☆13Dec 7, 2022Updated 3 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆35Jan 19, 2024Updated 2 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆17May 20, 2025Updated 9 months ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Jun 27, 2023Updated 2 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- ☆11Nov 7, 2024Updated last year
- ☆11Dec 11, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- in-browser playground for mtcute!☆20Updated this week
- Foundational Model for Speech Recognition Tasks☆488Feb 12, 2026Updated 3 weeks ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- ☆15Nov 11, 2024Updated last year
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- Пакет словарей русского языка с поддержкой букв Е и Ё☆13Oct 4, 2018Updated 7 years ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago
- ☆14Jun 16, 2023Updated 2 years ago
- Russian open TTS dataset☆17Nov 5, 2019Updated 6 years ago
- moved to https://git.stupid.fish/teidesu/tei.su☆12Dec 29, 2024Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- ☆140May 21, 2025Updated 9 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 9 months ago
- ☆62Feb 1, 2026Updated last month
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- ☆30Jan 22, 2026Updated last month
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- ASR on WS, POST/GET FAST_API Can use many RU asr models.☆18Jan 27, 2026Updated last month
- ☆23Jan 29, 2026Updated last month
- Telegram bot for different language models. Supports system prompts and images☆63Jun 26, 2025Updated 8 months ago
- Deep Learning for Speech☆107Dec 21, 2025Updated 2 months ago
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year
- ☆19Jan 8, 2025Updated last year
- ☆34Jun 9, 2025Updated 8 months ago