Nick088Official / Stable_Diffusion_Finetuned_Minecraft_Skin_Generator
Generates Minecraft skins with a text prompt using the HuggingFace "monadical-labs/minecraft-skin-generator" model.
☆25Updated last month
Alternatives and similar repositories for Stable_Diffusion_Finetuned_Minecraft_Skin_Generator:
Users that are interested in Stable_Diffusion_Finetuned_Minecraft_Skin_Generator are comparing it to the libraries listed below
- ☆13Updated 7 months ago
- A TriposR implementation for WebUI☆53Updated 10 months ago
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆21Updated last month
- ☆11Updated 8 months ago
- Text-to-Music Generation with Rectified Flow Transformer☆56Updated 4 months ago
- Advanced RVC Inference for quicker and effortless model downloads☆36Updated this week
- ☆57Updated 4 months ago
- ☆78Updated last year
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆47Updated 7 months ago
- Using RVC via console or python scripts☆93Updated 3 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆39Updated 9 months ago
- Embedding-inspector extension for AUTOMATIC1111/stable-diffusion-webui☆21Updated last year
- ☆42Updated 2 months ago
- 🎵 LyricWave - Your AI Music Composer 🎶 Compose Unique MP4 Songs Effortlessly! LyricWave uses AI to create personalized music by harmoni…☆27Updated 9 months ago
- SD.Next ModernUI☆24Updated this week
- A system for Prompt generation to improve Text-to-Image performance.☆68Updated last week
- A node collection for sound design, supporting MusicGen and Stable Audio. Welcome to use and experience it.☆107Updated 6 months ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆65Updated last year
- ☆38Updated 7 months ago
- ☆79Updated 6 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆79Updated this week
- Tweaked version of Mangio's fork of the Retrieval-based-Voice-Conversion WebUI☆10Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆32Updated 2 months ago
- Recursively writes descriptions of video scenes using Large Language Models and Image Captioners☆13Updated 10 months ago
- ☆27Updated last year
- Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)☆54Updated 4 months ago
- This is a repository for "character images search" from image and tags.☆26Updated 2 months ago
- ☆19Updated 9 months ago