☆87Sep 24, 2025Updated 6 months ago
Alternatives and similar repositories for IndicF5
Users that are interested in IndicF5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆12Jun 6, 2022Updated 3 years ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆53Feb 5, 2026Updated last month
- Fork of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆17Nov 27, 2024Updated last year
- A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of dom…☆23May 27, 2024Updated last year
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Aug 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆23Jun 5, 2025Updated 9 months ago
- Includes additional materials for the following keras.io blog post.☆12Jun 23, 2021Updated 4 years ago
- Text-to-Speech for languages of India☆345Nov 8, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆42Jun 12, 2025Updated 9 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆78Jun 8, 2025Updated 9 months ago
- ☆15May 14, 2025Updated 10 months ago
- FastAPI Implementation of Orpheus TTS streaming Chatbot☆28Jun 19, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Translation models for 22 scheduled languages of India☆414Oct 3, 2025Updated 5 months ago
- ☆12Oct 24, 2017Updated 8 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 2 months ago
- ☆11May 12, 2024Updated last year
- Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.☆23Mar 14, 2019Updated 7 years ago
- Using AI based approach to detect illegal parking of vehicles (Cars) from an image. The model will receive an image of parked car through…☆11Jun 2, 2020Updated 5 years ago
- The Gaming Zone is a web application that provides you with a collection of classic retro games, including puzzle games, trivia games, bo…☆10Feb 11, 2020Updated 6 years ago
- A miniature version of Modal☆23Jun 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- Add controlnet preprocessor to ComfyUI☆17Aug 24, 2023Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆20Nov 21, 2022Updated 3 years ago
- End-to-end speech-to-speech translation pipeline with voice cloning (RVC) and automatic lip-sync (Wav2Lip).☆26Updated this week
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆348Jul 21, 2025Updated 8 months ago
- [ACMMM2025] Official released code for ALLM4ADD☆36Oct 30, 2025Updated 4 months ago
- ☆28Nov 7, 2023Updated 2 years ago
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 7 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆21May 22, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆51May 6, 2024Updated last year
- Official Implementation of LatentSwap:An Efficient Latent Code Mapping Framework for Face Swapping☆29Mar 21, 2025Updated last year
- Dataset release for Emotional TTS in Indian Accent☆40Sep 2, 2022Updated 3 years ago
- A simple voice conversion tool☆20Mar 10, 2022Updated 4 years ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated 2 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆20Feb 9, 2025Updated last year