☆226May 6, 2026Updated 2 weeks ago
Alternatives and similar repositories for mammothmoda
Users that are interested in mammothmoda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward☆94Mar 11, 2026Updated 2 months ago
- [ACL 2026] From Word to World: Can Large Language Models be Implicit Text-based World Models?☆61Apr 13, 2026Updated last month
- Metaskill: A Meta-Skill for Autonomous AI Agent Team Generation☆37Feb 23, 2026Updated 3 months ago
- Generative Refinement Networks for Visual Synthesis☆101May 13, 2026Updated last week
- [CVPR 2026] UnicEdit-10M and UnicBench project☆41Mar 3, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning☆35Jan 14, 2026Updated 4 months ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 10 months ago
- Awesome Audio-Visual Intelligence, Survey of Audio-Visual Intelligence☆64May 8, 2026Updated 2 weeks ago
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆43Apr 12, 2026Updated last month
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 6 months ago
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆29Dec 17, 2025Updated 5 months ago
- WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…☆156Jan 4, 2026Updated 4 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- [ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation☆39Dec 16, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A plugin application that utilizes ComfyUI to generate 360-degree panoramic images. It primarily works by converting between flat images …☆18Jun 23, 2025Updated 10 months ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Aug 30, 2024Updated last year
- Boost segmentation model mIoU/Dice instantly WITHOUT retraining. A plug-and-play, training-free optimization module. Published in NeurIPS…☆64Apr 30, 2026Updated 3 weeks ago
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆39Jul 7, 2025Updated 10 months ago
- ☆492May 1, 2026Updated 3 weeks ago
- ☆22May 25, 2023Updated 2 years ago
- Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis☆474Mar 15, 2026Updated 2 months ago
- ☆82May 2, 2026Updated 2 weeks ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Front end ComfyUI nodes for CartoonSegmentation☆17May 22, 2024Updated 2 years ago
- Multi-encoder segmentation for contrail detection in satellite imagery | Google Researc☆12Jan 28, 2026Updated 3 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆431Jun 25, 2025Updated 10 months ago
- [Preprint] Self-Adversarial One Step Generation via Condition Shifting☆52Apr 15, 2026Updated last month
- D2IM-Net: learning detail disentangled implicit fields from single images☆26Nov 12, 2021Updated 4 years ago
- Official Repo of "D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models"☆137May 13, 2026Updated last week
- [NeurIPS'2025] "OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis"☆30Dec 4, 2025Updated 5 months ago
- Extension to `F.grid_sample` that allows using batch index per grid point.☆19Jun 27, 2023Updated 2 years ago
- An EinSum system in JAX☆41Mar 6, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆54Nov 6, 2025Updated 6 months ago
- ☆67Jan 4, 2026Updated 4 months ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆46Apr 15, 2026Updated last month
- Official repository of SoftREPA: Aligning Text to Image in Diffusion Models is Easier Than You Think☆23Jun 5, 2025Updated 11 months ago
- Official repository for “PixelGen: Improving Pixel Diffusion with Perceptual Loss”☆248May 12, 2026Updated last week
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆517Feb 11, 2026Updated 3 months ago
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year