☆314May 6, 2026Updated last month
Alternatives and similar repositories for mammothmoda
Users that are interested in mammothmoda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2026][Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward☆103May 25, 2026Updated 2 weeks ago
- [ACL 2026 Oral] From Word to World: Can Large Language Models be Implicit Text-based World Models?☆62Apr 13, 2026Updated 2 months ago
- [CVPR 2026] UnicEdit-10M and UnicBench project☆41Mar 3, 2026Updated 3 months ago
- Metaskill: A Meta-Skill for Autonomous AI Agent Team Generation☆50Feb 23, 2026Updated 3 months ago
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning☆35Jan 14, 2026Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 11 months ago
- Official repository for VideoAR☆28Jan 14, 2026Updated 4 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 7 months ago
- Evaluation Tool for Anomaly Detection Research☆17May 9, 2024Updated 2 years ago
- Generative Refinement Networks for Visual Synthesis (Support C2I & T2I & T2V)☆127Updated this week
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆30Dec 17, 2025Updated 5 months ago
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆46Apr 12, 2026Updated 2 months ago
- WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…☆160Jan 4, 2026Updated 5 months ago
- [ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation☆41Dec 16, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Awesome Audio-Visual Intelligence, Survey of Audio-Visual Intelligence☆77May 8, 2026Updated last month
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆39Jul 7, 2025Updated 11 months ago
- Boost segmentation model mIoU/Dice instantly WITHOUT retraining. A plug-and-play, training-free optimization module. Published in NeurIPS…☆71Jun 4, 2026Updated last week
- Official codebase of the paper -- Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills☆127May 1, 2026Updated last month
- ☆22May 25, 2023Updated 3 years ago
- ☆525May 1, 2026Updated last month
- [ICML'26] Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis☆508May 23, 2026Updated 3 weeks ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- A plugin application that utilizes ComfyUI to generate 360-degree panoramic images. It primarily works by converting between flat images …☆20Jun 23, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 10 months ago
- Multi-encoder segmentation for contrail detection in satellite imagery | Google Researc☆12Jan 28, 2026Updated 4 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆430Jun 25, 2025Updated 11 months ago
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆25Feb 17, 2026Updated 3 months ago
- D2IM-Net: learning detail disentangled implicit fields from single images☆26Nov 12, 2021Updated 4 years ago
- [NeurIPS'2025] "OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis"☆30Dec 4, 2025Updated 6 months ago
- (ToCa-v2) A New version of ToCa,with faster speed and better acceleration!☆42Mar 13, 2025Updated last year
- Official repository for "Pre- to Post-Contrast Breast MRI Synthesis for Enhanced Tumour Segmentation"☆13Jan 31, 2024Updated 2 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Joint geodesic upsampling☆12Jan 16, 2018Updated 8 years ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆48Apr 15, 2026Updated last month
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆29Jul 3, 2025Updated 11 months ago
- ☆54Nov 6, 2025Updated 7 months ago
- ☆13Jul 11, 2025Updated 11 months ago
- Official repository of SoftREPA: Aligning Text to Image in Diffusion Models is Easier Than You Think☆24Jun 5, 2025Updated last year
- ☆14Oct 11, 2023Updated 2 years ago