BazedFrog / SongGeneration-StudioLinks
Clean, polished interface for Tencent’s SongGeneration. Create songs from text prompts or reference audio, with batch processing and smart model selection. Minimum Requirement: 10GB of VRAM
☆344Updated 2 weeks ago
Alternatives and similar repositories for SongGeneration-Studio
Users that are interested in SongGeneration-Studio are comparing it to the libraries listed below
Sorting:
- This project is a collection of Docker-based web user interfaces designed to easily run various state-of-the-art generative AI models loc…☆399Updated 3 weeks ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆130Updated 5 months ago
- A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automat…☆276Updated last week
- Nanobanana fal AI powered Photoshop-esque Studio☆334Updated 2 months ago
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆632Updated last week
- Enable AI models for video production in the browser☆790Updated 3 months ago
- ☆161Updated 2 weeks ago
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆323Updated last month
- Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.☆174Updated 2 weeks ago
- Some music tools in ComfyUI☆118Updated last month
- PromptForge is a visual prompt management system for AI image generation. It provides an intuitive interface to organize, browse, and man…☆199Updated last month
- ☆105Updated last week
- ComfyDeployed☆440Updated 4 months ago
- VLLM Port of the Chatterbox TTS model☆364Updated 3 months ago
- feature-rich web interface designed to interact with a local ComfyUI☆75Updated last month
- 🍌 Create LoRA training datasets for Flux 2, Z-Image, Qwen Image Edit & more! Uses FAL.ai + Nano Banana Pro. 100% browser-based, no serve…☆131Updated last month
- Chain apps and models to build robust AI workflows 🤗☆424Updated this week
- A ComfyUI custom node suite for Qwen3-TTS, supporting 1.7B and 0.6B models, Custom Voice, Voice Design, Voice Cloning and Fine-Tuning.☆153Updated last week
- ☆208Updated last month
- Controllable and fast Text-to-Speech for over 7000 languages!☆323Updated 7 months ago
- A fully autonomous AI Agent/Python pipeline that utilizes Large Language Models (LLMs) like Gemini to generate content, produce videos, a…☆220Updated 3 months ago
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆234Updated 6 months ago
- The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy.☆486Updated 7 months ago
- ComfyUI node for highly expressive speech and realistic zero-shot voice cloning☆380Updated last month
- ClaudeCode Workflow Studio☆418Updated last month
- Generate AI-powered gaze-tracking face images and create interactive React components that follow the cursor.☆341Updated 3 months ago
- PersonaLive! : Expressive Portrait Image Animation for Live Streaming☆1,612Updated last month
- Free and open node based generative workflows.☆815Updated this week
- A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser☆115Updated 7 months ago
- Open-source clone of the MidJourney web interface featuring real AI image and video generation powered by Google's Gemini SDK. Use Imagen…☆226Updated 6 months ago