☆318May 6, 2026Updated last month
Alternatives and similar repositories for mammothmoda
Users that are interested in mammothmoda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…☆13Nov 20, 2025Updated 7 months ago
- [ACL 2026 Oral] From Word to World: Can Large Language Models be Implicit Text-based World Models?☆63Apr 13, 2026Updated 2 months ago
- ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation☆126May 20, 2026Updated last month
- [CVPR 2026] UnicEdit-10M and UnicBench project☆42Mar 3, 2026Updated 4 months ago
- Metaskill: A Meta-Skill for Autonomous AI Agent Team Generation☆57Feb 23, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning☆38Jan 14, 2026Updated 5 months ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆16Jun 26, 2025Updated last year
- [ECCV 2026] Generative Refinement Networks for Visual Synthesis (Support C2I & T2I & T2V)☆138Jun 27, 2026Updated last week
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆32Dec 17, 2025Updated 6 months ago
- We propose Bidirectional Evolutionary Search (BES), a search framework that couples forward candidate evolution with backward goal decomp…☆160May 28, 2026Updated last month
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆46Apr 12, 2026Updated 2 months ago
- [ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation☆41Dec 16, 2025Updated 6 months ago
- WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…☆162Jan 4, 2026Updated 6 months ago
- Awesome Audio-Visual Intelligence, Survey of Audio-Visual Intelligence☆80May 8, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Aug 30, 2024Updated last year
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆39Jul 7, 2025Updated 11 months ago
- Official codebase of the paper -- Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills☆172May 1, 2026Updated 2 months ago
- ☆22May 25, 2023Updated 3 years ago
- ☆534May 1, 2026Updated 2 months ago
- [ICML'26] Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis☆519May 23, 2026Updated last month
- ☆86May 2, 2026Updated 2 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- A plugin application that utilizes ComfyUI to generate 360-degree panoramic images. It primarily works by converting between flat images …☆21Jun 23, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 11 months ago
- Front end ComfyUI nodes for CartoonSegmentation☆17May 22, 2024Updated 2 years ago
- Multi-encoder segmentation for contrail detection in satellite imagery | Google Researc☆12Jan 28, 2026Updated 5 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆430Jun 25, 2025Updated last year
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆28Feb 17, 2026Updated 4 months ago
- [Preprint] Self-Adversarial One Step Generation via Condition Shifting☆55Apr 15, 2026Updated 2 months ago
- python interface for mlc chat cli☆14May 7, 2023Updated 3 years ago
- D2IM-Net: learning detail disentangled implicit fields from single images☆26Nov 12, 2021Updated 4 years ago
- [NeurIPS'2025] "OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis"☆30Dec 4, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆45Mar 6, 2026Updated 3 months ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- The codes of our paper "EasyInv: Toward Fast and Better DDIM Inversion"☆14Jun 1, 2025Updated last year
- Extension to `F.grid_sample` that allows using batch index per grid point.☆19Jun 27, 2023Updated 3 years ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆50Apr 15, 2026Updated 2 months ago
- ☆54Nov 6, 2025Updated 7 months ago
- Semi-Supervised Fine-Grained Recognition Challenge at FGVC7☆31Mar 1, 2023Updated 3 years ago