Repository of lessons exploring image diffusion models, focused on understanding and education.
β64Jan 9, 2025Updated last year
Alternatives and similar repositories for mindiffusion
Users that are interested in mindiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unoffical LivePortrait Training Script [ π§ Under Construction]β39Jan 28, 2025Updated last year
- β20Sep 11, 2024Updated last year
- β17Jan 22, 2025Updated last year
- cpp inference for EmotiVoiceβ16Jan 1, 2024Updated 2 years ago
- This is the experimental description of MnTTS2.β12Apr 11, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A curated collection of prompts for Grok Imagine by xAIβ28Oct 19, 2025Updated 6 months ago
- Create bounding boxes selecting masks by threshold.β32May 22, 2024Updated last year
- Implementation of Adversarial Multi-Distillation for Automatic Modulation Recognition Models.β27Nov 2, 2023Updated 2 years ago
- β12Sep 25, 2024Updated last year
- Finally, some decent sample sentencesβ23Dec 3, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago
- [SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalizationβ10Jul 13, 2024Updated last year
- Text to Image Latent Diffusion using a Transformer coreβ224Aug 29, 2024Updated last year
- FlexiFilm: Long Video Generation with Flexible Conditionsβ31May 1, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β13Oct 14, 2024Updated last year
- β15Mar 1, 2022Updated 4 years ago
- [ECCV2024] Fast Sprite Decomposition from Animated Graphicsβ31Sep 26, 2024Updated last year
- [WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformerβ22Jan 14, 2026Updated 3 months ago
- β15May 13, 2024Updated last year
- A timegrapher for quartz watch using a standard soundcard and microphoneβ18Dec 27, 2025Updated 4 months ago
- An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"β14Apr 13, 2026Updated 3 weeks ago
- implementation of https://arxiv.org/pdf/2312.09299β21Jul 3, 2024Updated last year
- LCM Full Cycle Trainer for Ostris - Ai Toolkitβ16Aug 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ21May 20, 2025Updated 11 months ago
- β15Oct 31, 2023Updated 2 years ago
- Diffusing States and Matching Scores: A New Framework for Imitation Learningβ22Nov 16, 2024Updated last year
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.β14Dec 20, 2022Updated 3 years ago
- A toy text-to-image model trained from scratch.β19Jun 9, 2025Updated 11 months ago
- Vision Bridge Transformer at Scaleβ142Dec 1, 2025Updated 5 months ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matchingβ54Apr 21, 2025Updated last year
- π¨ Add text overlays to segmented objects in your images using AI. Powered by Meta's SAM2 for segmentation, running entirely in your browβ¦β22Feb 15, 2025Updated last year
- interact with your robot in JS, inspired by LeRobotβ38Nov 14, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- One File Tensor Librariesβ31Oct 7, 2025Updated 7 months ago
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesisβ169Jan 11, 2026Updated 3 months ago
- GitHub repository for the Bria 3.2 pipelineβ44Sep 10, 2025Updated 7 months ago
- [BMVC'24] G3FA: Geometry-guided GAN for Face Animationβ20Mar 14, 2025Updated last year
- Repository for βAnomaly Detection and Generation with Diffusion Models: A Surveyβ.β38Jun 15, 2025Updated 10 months ago
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataseβ¦β194Nov 5, 2024Updated last year
- Distilling Diversity and Control in Diffusion Modelsβ52Apr 28, 2025Updated last year