Простой IPA фонемизатор на базе ruaccent-encoder
☆24Apr 15, 2025Updated last year
Alternatives and similar repositories for ruphon
Users that are interested in ruphon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Простой нормализатор текстов перед синтезом речи☆48May 13, 2024Updated 2 years ago
- Простой расстановщик ударений с обработкой омографов☆188Oct 23, 2024Updated last year
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆14Oct 14, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- NVIDIA's FastPitch, extracted from the DeepLearningExamples repository☆14Mar 29, 2024Updated 2 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆109May 20, 2025Updated last year
- StyleTTS 2 Optimized Training Fork☆32Feb 2, 2025Updated last year
- ☆18May 14, 2025Updated last year
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆14Mar 15, 2025Updated last year
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆30Updated this week
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆19Jun 22, 2025Updated 11 months ago
- Exaple of usage different features of USB interface on STM32☆19Apr 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-…☆36Mar 30, 2026Updated last month
- Neural Homomorphic Vocoder optimized for singing voice synthesis☆34May 2, 2026Updated 3 weeks ago
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated last year
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆16Sep 10, 2025Updated 8 months ago
- ☆30Aug 25, 2021Updated 4 years ago
- Foundational Model for Speech Recognition Tasks☆580Apr 15, 2026Updated last month
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 7 months ago
- ☆36Oct 23, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Documentation site for fast-agent☆32May 10, 2026Updated 2 weeks ago
- Voice gender classifier using ECAPA-TDNN☆67Jan 24, 2025Updated last year
- Text To Speech Synthesis with Vosk☆262Mar 14, 2026Updated 2 months ago
- Russian speech technology links☆396Mar 17, 2026Updated 2 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆98Oct 8, 2025Updated 7 months ago
- A small rust-based data loader☆37Feb 20, 2026Updated 3 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 8 months ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆45Aug 7, 2024Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆26Mar 17, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This is a soft fork of the Megalodon app, which itself is a fork of the official Mastodon Android app.☆14Jan 27, 2023Updated 3 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 8 months ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Sep 13, 2022Updated 3 years ago
- Access HTML and other pasteboards from JS and command line☆37Apr 24, 2022Updated 4 years ago
- A collection of all our phonemeizers for dataset construction and inference☆30Feb 21, 2025Updated last year
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- NeMo: a toolkit for conversational AI☆12Dec 23, 2022Updated 3 years ago