AGI-Arena / MARS
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
☆106Updated this week
Related projects ⓘ
Alternatives and complementary repositories for MARS
- SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models☆46Updated this week
- Mixed precision inference by Tensorrt-LLM☆94Updated last month
- The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"☆184Updated 3 weeks ago
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆46Updated 3 months ago
- This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tas…☆63Updated 3 months ago
- LLM Benchmark for Code☆33Updated 3 months ago
- Support mixed-precsion inference with vllm☆97Updated 2 weeks ago
- An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.☆42Updated 3 weeks ago
- ☆67Updated this week
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆131Updated 2 weeks ago
- ☆14Updated 3 weeks ago
- ☆53Updated last month
- AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning (NeurIPS 2024)☆170Updated this week
- Empower Your Model with Longer and Better Context Comprehention☆50Updated last year
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆47Updated 11 months ago
- A Comprehensive Benchmark for Code Information Retrieval.☆63Updated last month
- an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆45Updated last month
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆117Updated last year
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆48Updated 4 months ago
- [NeurIPS 2023] On Sparse Modern Hopfield Model☆57Updated 7 months ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆127Updated 3 months ago
- WorldGPT: Empowering LLM as Multimodal World Model☆123Updated 3 months ago
- An Information Flow Perspective for Exploring Large Vision Language Models on Reasoning Tasks☆61Updated last month
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆43Updated last year
- SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation☆59Updated last month
- The official code for "BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities"☆153Updated 3 weeks ago
- ☆115Updated last year
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆49Updated last week
- ☆34Updated 5 months ago
- PyTorch code for BagFormer: Better Cross-Modal Retrieval via bag-wise interaction☆115Updated last year