AGI-Arena / MARS
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
☆604Updated 2 months ago
Alternatives and similar repositories for MARS:
Users that are interested in MARS are comparing it to the libraries listed below
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆198Updated 9 months ago
- [CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"☆287Updated last week
- code based for rectified flow☆126Updated last month
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆133Updated last month
- ☆160Updated 6 months ago
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆64Updated 2 months ago
- ☆153Updated last year
- Efficient DiT architecture for text2any tasks, ICLR2025☆421Updated 2 months ago
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,170Updated 3 weeks ago
- Official implementation of paper "Multi-Level Collaboration in Model Merging"☆40Updated 3 weeks ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆307Updated 2 months ago
- R1-like Computer-use Agent☆67Updated 3 weeks ago
- Efficient controlnet for DiTs☆198Updated this week
- A Tiny structure of pytorch for learning;☆56Updated 9 months ago
- ☆207Updated 2 weeks ago
- ☆135Updated last week
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆47Updated this week
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆154Updated last month
- Residual Kolmogorov-Arnold Network (RKAN) is designed to enhance the performance of classic CNNs by incorporating RKAN blocks into existi…☆261Updated last month
- Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms…☆446Updated 8 months ago
- Official repository of MMGenBench☆119Updated last month
- Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate (NeurIPS 2024)☆31Updated last year
- [NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models☆104Updated 9 months ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆498Updated 2 weeks ago
- JittorGeometric is a Jittor-based graph machine learning library.☆153Updated this week
- Improving Generalist Model with Domain-Specific Experts☆85Updated 3 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆42Updated last month
- ☆514Updated last month
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆57Updated 4 months ago
- ☆421Updated 7 months ago