AGI-Arena / MARSLinks

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

☆708

Alternatives and similar repositories for MARS

Users that are interested in MARS are comparing it to the libraries listed below

Sorting:

yuanze-lin / Olympus
[CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"
☆429Updated 2 months ago
D-Keqi / mtla
MTLA: Multi-head Temporal Latent Attention
☆664Updated last month
Zefan-Cai / R-KV
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models
☆1,097Updated 3 weeks ago
GreenBitAI / bitorch-engine
A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.
☆198Updated last year
LZY-the-boys / Twin-Merging
[NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
☆136Updated 4 months ago
Everlyn-Labs / Wasserstein-VQ
☆161Updated 9 months ago
Facico / GOAT-PEFT
[ICML2025] Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
☆121Updated last month
trestad / Noisy-Rewards-in-Learning-to-Reason
☆103Updated 2 months ago
jiaweizzhao / InRank
☆154Updated last year
360CVGroup / RelaCtrl
Efficient controlnet for DiTs
☆381Updated 2 months ago
WisconsinAIVision / YoChameleon
🦎 Yo'Chameleon: Your Personalized Chameleon (CVPR 2025)
☆142Updated 2 months ago
dayuyang1999 / Awesome-Code-Reasoning
☆279Updated last month
360CVGroup / Qihoo-T2X
Efficient DiT architecture for text2any tasks, ICLR2025
☆452Updated 2 months ago
HJYao00 / Mulberry
Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
☆1,208Updated 4 months ago
LiQiiiii / Neural-Ligand
[ICCV2025] Official implementation of paper "Towards Performance Consistency in Multi-Level Model Collaboration"
☆41Updated last month
xid32 / SoundMind
We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…
☆923Updated 3 weeks ago
xid32 / NAACL_2025_TWM
We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…
☆310Updated 6 months ago
Everlyn-Labs / ANTRP
Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs
☆157Updated 4 months ago
Alpha-Innovator / AdaptiveDiffusion
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
☆70Updated 6 months ago
jincan333 / LoT
Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate (NeurIPS 2024)
☆31Updated last year
HKUST-KnowComp / CoT-ICL-Eval
Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
☆51Updated 3 months ago
microsoft / TimeCraft
Official code for TimeCraft: A Time Series Generation Framework for Real-World Applications
☆570Updated 2 weeks ago
AlgRUC / JittorGeometric
JittorGeometric is a Jittor-based graph machine learning library.
☆160Updated last week
fudan-generative-vision / DicFace
[ICCV2025 Highlight] DicFace: Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration
☆429Updated last week
Meaquadddd / DPO-Shift
DPO-Shift: Shifting the Distribution of Direct Preference Optimization
☆60Updated 5 months ago
WaveSpeedAI / agent-mcp-lab
☆216Updated 2 months ago
Tencent-Hunyuan / ArtifactsBenchmark
☆221Updated this week
WaveSpeedAI / idea2product
☆279Updated last month
EMI-Group / evomo
EvoMO is a GPU-accelerated library for evolutionary multiobjective optimization (EMO)
☆114Updated last month
magic-YuanTian / Selective-Prompt-Anchoring
Selective Prompt Anchoring
☆68Updated last week