EvolvingLMMs-Lab/Evolving-Visual-Generation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EvolvingLMMs-Lab/Evolving-Visual-Generation)

EvolvingLMMs-Lab / Evolving-Visual-Generation

[Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

☆125

Alternatives and similar repositories for Evolving-Visual-Generation

Users that are interested in Evolving-Visual-Generation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amiya-special / AutoMIA
View on GitHub
☆15Apr 3, 2026Updated 3 months ago
Carol-lyh / GateControl
View on GitHub
☆22Apr 3, 2026Updated 3 months ago
Lexiang-Xiong / CAD
View on GitHub
[ECCV 2026] Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models
☆28Jun 20, 2026Updated last month
haiquanlu / Mix-Quant
View on GitHub
☆37May 21, 2026Updated 2 months ago
YinBo0927 / FATE
View on GitHub
The official code of On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment
☆25May 13, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
UniX-AI-Lab / WorldReasonBench
View on GitHub
WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors
☆22May 19, 2026Updated 2 months ago
LiQiiiii / BadWAM
View on GitHub
[arxiv] BadWAM: When World-Action Models Dream Right but Act Wrong
☆43Jul 17, 2026Updated last week
SuhZhang / GeoSR
View on GitHub
The code for paper 'Make Geometry Matter for Spatial Reasoning'
☆53Updated this week
bigglesworthnotacat / LLM-Steg
View on GitHub
[ICLR 2026 Oral] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography
☆20Mar 22, 2026Updated 4 months ago
LiQiiiii / Awesome-VLA-Safety
View on GitHub
[Arxiv] Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms
☆125Jul 13, 2026Updated last week
Lexie-YU / ViFeEdit
View on GitHub
[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer
☆67Mar 31, 2026Updated 3 months ago
YinBo0927 / RePro
View on GitHub
The official code of Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
☆22Jan 6, 2026Updated 6 months ago
tsa18 / ConciseHint
View on GitHub
[Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation
☆26Oct 1, 2025Updated 9 months ago
LiQiiiii / Video-Metaphorical-Understanding
View on GitHub
[arxiv] ViMU: Benchmarking Video Metaphorical Understanding
☆56May 16, 2026Updated 2 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
czg1225 / DMax
View on GitHub
DMax: Aggressive Parallel Decoding for dLLMs
☆127Jul 5, 2026Updated 2 weeks ago
Yuanshi9815 / ViBT
View on GitHub
Vision Bridge Transformer at Scale
☆147Dec 1, 2025Updated 7 months ago
EvolvingLMMs-Lab / OpenMMReasoner
View on GitHub
[CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
☆164Mar 30, 2026Updated 3 months ago
VainF / In-Video-Instructions
View on GitHub
[Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control
☆45Nov 25, 2025Updated 8 months ago
LiQiiiii / Sponge-Tool-Attack
View on GitHub
[ICML‘26] Sponge Tool Attack: Stealthy Denial-of-Efficiency against Tool-Augmented Agentic Reasoning
☆28Jul 7, 2026Updated 2 weeks ago
XIAO4579 / PRISM
View on GitHub
Beyond SFT-to-RL: Pre-alignment via Black-BoxOn-Policy Distillation for Multimodal RL
☆97May 6, 2026Updated 2 months ago
yu-rp / Dimple
View on GitHub
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆117Jul 9, 2025Updated last year
fscdc / dVoting
View on GitHub
[arXiv 2026] dVoting: Fast Voting for dLLMs
☆30Feb 13, 2026Updated 5 months ago
EvolvingLMMs-Lab / ParaVT
View on GitHub
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
☆54Jun 2, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
czg1225 / VeriThinker
View on GitHub
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆67Sep 27, 2025Updated 9 months ago
world-action-models / awesome-world-action-models
View on GitHub
☆302Jun 23, 2026Updated last month
INV-WZQ / SparseD
View on GitHub
[ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models
☆67Feb 22, 2026Updated 5 months ago
czg1225 / dParallel
View on GitHub
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆65Apr 12, 2026Updated 3 months ago
G-U-N / UniRL
View on GitHub
[ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models
☆91May 26, 2026Updated last month
Yuanshi9815 / LiteFocus
View on GitHub
[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.
☆34Mar 11, 2025Updated last year
zlab-princeton / vero
View on GitHub
Vero: An Open RL Recipe for General Visual Reasoning
☆134Jun 19, 2026Updated last month
NVlabs / AnyFlow
View on GitHub
Flow Map OPD for AnyStep Video Diffusion
☆399May 23, 2026Updated 2 months ago
langmanbusi / CoCoEdit
View on GitHub
[ICML 2026] Official PyTorch implementation of paper “CoCoEdit: Content-Consistent Image Editing via Region Regularized Reinforcement Lea…
☆26Jun 14, 2026Updated last month
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Biangbiang0321 / SpotEdit
View on GitHub
SpotEdit:Selective Region Editing in Diffusion Transformers
☆196Jul 8, 2026Updated 2 weeks ago
florinshen / Vista3D
View on GitHub
[ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image
☆57Sep 19, 2024Updated last year
Adamdad / vico
View on GitHub
Vico: Compositional Video Generation as Flow Equalization
☆59Nov 15, 2024Updated last year
Vchitect / RealDPO
View on GitHub
☆32Dec 17, 2025Updated 7 months ago
EvolvingLMMs-Lab / sae
View on GitHub
A framework that allows you to apply Sparse AutoEncoder on any models
☆53Jul 11, 2025Updated last year
Huage001 / LinFusion
View on GitHub
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
☆317Dec 23, 2024Updated last year
JaydenLyh / Reward-Forcing
View on GitHub
[CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
☆352Dec 15, 2025Updated 7 months ago