ChenDarYen / NitroFusionLinks
NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training
☆288Updated 2 months ago
Alternatives and similar repositories for NitroFusion
Users that are interested in NitroFusion are comparing it to the libraries listed below
Sorting:
- ☆396Updated last week
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆403Updated 2 months ago
- ☆163Updated last year
- Official GitHub repository for FLUX.1 Krea [dev].☆322Updated 3 weeks ago
- A Next.js app for fast image generation with Flux on Replicate☆109Updated 10 months ago
- Live-bending a foundation model’s output at neural network level.☆266Updated 4 months ago
- BentoDiffusion: A collection of diffusion models served with BentoML☆373Updated 3 months ago
- Cog inference for flux models☆364Updated 3 weeks ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆44Updated 7 months ago
- Official implementation of "SketchDeco: Decorating B&W Sketches with Colour"☆56Updated 8 months ago
- A real-time silent speech recognition tool.☆533Updated 6 months ago
- Replace OpenAI with Llama.cpp Automagically.☆324Updated last year
- 3D to Photo is an open-source package by Dabble, that combines threeJS and Stable diffusion to build a virtual photo studio for product p…☆445Updated last year
- An AI focused photo manipulation tool based on Gradio☆186Updated last month
- Official implementation of SwiftSketch☆195Updated 3 months ago
- 🌍 WorldGen - Generate Any 3D Scene in Seconds☆694Updated 3 months ago
- Automated speech dataset creator☆186Updated 2 months ago
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆271Updated 3 weeks ago
- [ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models☆451Updated 11 months ago
- ☆89Updated 2 months ago
- Control 3D models using hand gestures and voice commands in real-time. Threejs / mediapipe computer vision☆197Updated 2 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆196Updated 5 months ago
- CVPR2025☆886Updated 3 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆783Updated last year
- A random walk voice style cloning application for Kokoro text to speech☆117Updated 2 months ago
- VLLM Port of the Chatterbox TTS model☆273Updated last week
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillation☆391Updated last week
- ☆74Updated 2 weeks ago
- Cog wrapper for ostris/ai-toolkit + post-finetuning cog inference for flux models☆403Updated 2 months ago
- Mistral7B playing DOOM☆135Updated last year