lerogo / MMGenBench
Official repository of MMGenBench
☆119Updated 3 weeks ago
Alternatives and similar repositories for MMGenBench:
Users that are interested in MMGenBench are comparing it to the libraries listed below
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆152Updated 2 weeks ago
- Efficient controlnet for DiTs☆86Updated 2 weeks ago
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆64Updated 2 months ago
- ☆68Updated last week
- ☆160Updated 5 months ago
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆57Updated 3 months ago
- ☆207Updated last month
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆46Updated 8 months ago
- Wan2.1 with Controlnet☆143Updated this week
- Gotta Hear Them All: Sound Source Aware Vision to Audio Generation.☆59Updated 3 weeks ago
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation☆63Updated 8 months ago
- SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing☆127Updated last week
- Improving Generalist Model with Domain-Specific Experts☆85Updated 2 months ago
- Efficient DiT architecture for text2any tasks, ICLR2025☆399Updated last month
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆308Updated 2 months ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆494Updated last month
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆87Updated 2 weeks ago
- ☆153Updated last year
- [NOSSDAV 2023] Official code for RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆50Updated 8 months ago
- 从0到1手写基于mnist手写数字数据集的diffusion模型复现☆36Updated this week
- ☆244Updated 2 months ago
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆111Updated 2 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆73Updated last month
- 🔥 🔥 🔥 [NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies☆189Updated 3 weeks ago
- ☆93Updated 2 months ago
- Run JavaScript code from Python.☆101Updated 3 weeks ago
- Residual Kolmogorov-Arnold Network (RKAN) is designed to enhance the performance of classic CNNs by incorporating RKAN blocks into existi…☆261Updated 3 weeks ago
- ☆27Updated 4 months ago
- Official implementation of paper "Multi-Level Collaboration in Model Merging"☆40Updated 2 weeks ago
- Text-to-3D Generation by 2D Editing☆63Updated 2 weeks ago