omogr / omogreLinks
Russian accentuator and IPA transcriber
☆15Updated last year
Alternatives and similar repositories for omogre
Users that are interested in omogre are comparing it to the libraries listed below
Sorting:
- Normalize Text in Russian☆28Updated last year
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Updated 2 years ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 5 months ago
- Simple audio AE☆12Updated 11 months ago
- 🎵 muse: Music Separation☆10Updated last year
- ☆13Updated 2 years ago
- ☆43Updated 4 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 6 months ago
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Neural model for prediction of stress position in Russian words☆11Updated 3 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆14Updated 11 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Пакет словарей русского языка с поддержкой букв Е и Ё☆13Updated 7 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆20Updated 4 months ago
- A system for multi-user transcribing speech in audio files.☆35Updated 8 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆17Updated 4 months ago
- ☆13Updated 2 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆20Updated 3 weeks ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 6 months ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆40Updated last year
- Text To Speech Synthesis with Vosk☆218Updated last month
- ☆25Updated last year
- T5-based (russian) text normalization☆22Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆17Updated last month
- The Vokan Architecture (Tsukasa speech based)☆10Updated 8 months ago
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆43Updated this week
- ☆43Updated 2 months ago
- ☆17Updated 4 years ago
- Transfer learning approach to pronunciation scoring☆11Updated last year