multimodal-art-projection / AutoMVView external linksLinks
☆82Jan 4, 2026Updated last month
Alternatives and similar repositories for AutoMV
Users that are interested in AutoMV are comparing it to the libraries listed below
Sorting:
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆57Jan 20, 2026Updated 3 weeks ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- ☆76Dec 8, 2025Updated 2 months ago
- VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆61Jan 9, 2026Updated last month
- ☆28Jan 30, 2026Updated 2 weeks ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆38Feb 4, 2026Updated last week
- ☆14Nov 1, 2022Updated 3 years ago
- ☆87Dec 30, 2025Updated last month
- ☆15May 30, 2025Updated 8 months ago
- "Generating Music Medleys via Music Puzzle Games", AAAI 2018☆18Nov 6, 2018Updated 7 years ago
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation☆107Feb 1, 2026Updated last week
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Oct 2, 2025Updated 4 months ago
- MCP server + embedded terminal that lets Claude Code see and edit your ComfyUI workflows☆38Jan 31, 2026Updated 2 weeks ago
- ☆19May 7, 2025Updated 9 months ago
- Code for 'JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion'☆38Updated this week
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Dec 30, 2025Updated last month
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆24Nov 29, 2024Updated last year
- paris - world's first decentralized trained open-weight diffusion model☆54Oct 7, 2025Updated 4 months ago
- ☆26Jun 22, 2024Updated last year
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆28Dec 19, 2024Updated last year
- [AAAI 2026] SlideTailor: Personalized Presentation Slide Generation for Scientific Papers☆42Jan 1, 2026Updated last month
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆53Nov 26, 2024Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 9 months ago
- ☆53Jan 25, 2026Updated 2 weeks ago
- ☆318Jan 24, 2026Updated 2 weeks ago
- ☆65Dec 16, 2025Updated last month
- This is an extension of SD-WEBUI-DISCORD on the Stable Diffusion WebUI, which supports distributed deployment of SD node's Stable Diffusi…☆25Oct 8, 2023Updated 2 years ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆61Dec 16, 2025Updated last month
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆72Oct 12, 2025Updated 4 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 4 months ago
- FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.☆61Dec 9, 2025Updated 2 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Oct 7, 2025Updated 4 months ago
- OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆54Feb 1, 2026Updated last week
- Train LoRA using Microsoft's official implementation with Stable Diffusion models.☆33May 9, 2023Updated 2 years ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"☆57Jan 23, 2026Updated 3 weeks ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆36Nov 10, 2025Updated 3 months ago
- This repo contains the python code as well as the webpage html files for the Spice-E project from VAILab at TAU.☆26Dec 9, 2024Updated last year