taresh18 / Image-ColorizationLinks
Colorizing Black & White images using GAN
☆11Updated 2 years ago
Alternatives and similar repositories for Image-Colorization
Users that are interested in Image-Colorization are comparing it to the libraries listed below
Sorting:
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆215Updated 3 weeks ago
- ☆370Updated last month
- SoTA open-source TTS☆23Updated 5 months ago
- ☆289Updated 4 months ago
- Fork of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆16Updated last year
- Examples of using the llasa-tts models locally☆182Updated 8 months ago
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆870Updated last week
- ☆532Updated 2 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆786Updated last week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆52Updated last year
- SoTA open-source TTS☆120Updated 6 months ago
- ☆68Updated 2 months ago
- ☆41Updated 3 months ago
- ☆75Updated 6 months ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆50Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆127Updated 4 months ago
- AI Video dubbing / dubber / Video dubbing / AI dubbing AI 視訊配音☆66Updated 4 months ago
- The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement☆690Updated 2 weeks ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆74Updated 3 weeks ago
- Frontier Open-Source Text-to-Speech☆95Updated 3 months ago
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆290Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆219Updated 7 months ago
- A ComfyUI node for Maya1, a 3B-parameter speech model built for expressive voice generation with rich human emotion and precise voice des…☆51Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆130Updated 4 months ago
- End-to-end speech-to-speech translation pipeline with voice cloning (RVC) and automatic lip-sync (Wav2Lip).☆14Updated last month
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆72Updated 7 months ago
- finetune llm part for spark-tts model☆112Updated 8 months ago
- Automated speech dataset creator☆212Updated 6 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆291Updated 7 months ago
- ☆81Updated last month