asiff00 / Training-TTSLinks
Train and finutune text-to-speech models for Bengali and many other languages!
☆15Updated 6 months ago
Alternatives and similar repositories for Training-TTS
Users that are interested in Training-TTS are comparing it to the libraries listed below
Sorting:
- Fully Configurable RAG Pipeline for Bengali Language RAG Applications. Supports both Local and Huggingface Models, Built with Langchain.☆46Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 3 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆206Updated last month
- Fine tuned llama 3 models for context based question answering in bengali language.☆18Updated last year
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆199Updated 6 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated last year
- A streaming whisper server for on-prem transcription☆22Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 10 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆45Updated 5 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 5 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆50Updated 4 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆83Updated this week
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆287Updated 5 months ago
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆34Updated last year
- Service for testing out the new Qwen2.5 omni model☆61Updated 6 months ago
- Train LLM on Hugging Face infra☆65Updated last month
- ☆12Updated 6 months ago
- Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)☆50Updated last month
- List of curated use cases built using Sesame's CSM 1B☆73Updated 5 months ago
- VoiceHub: A Unified Inference Interface for TTS Models☆56Updated 3 weeks ago
- Open TTS models, built for streaming on the edge☆43Updated 7 months ago
- ☆186Updated 2 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆102Updated 4 months ago
- Agentic RAG to help you build a startup🚀☆55Updated 6 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆95Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last week
- Kyutai with an "eye"☆222Updated 7 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 7 months ago
- Text Normalizer module use for Bangla as well as English digit convert to textual format, Normalize Date and Extract Date☆12Updated last month