yang-ze-kang / AutoMMLabLinks
☆18Updated 10 months ago
Alternatives and similar repositories for AutoMMLab
Users that are interested in AutoMMLab are comparing it to the libraries listed below
Sorting:
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆27Updated 4 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 7 months ago
- ☆145Updated last year
- Geometric-Mean Policy Optimization☆95Updated last month
- Parameter-Efficient Fine-Tuning for Foundation Models☆104Updated 8 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆97Updated last year
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆38Updated 6 months ago
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation☆68Updated 2 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆42Updated last year
- ☆41Updated 6 months ago
- ☆67Updated 8 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆40Updated 2 years ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆56Updated 7 months ago
- Reproduction of LLaVA-v1.5 based on Llama-3-8b LLM backbone.☆65Updated last year
- ☆51Updated 10 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109Updated 7 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆36Updated 2 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆97Updated last year
- ☆22Updated last year
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆41Updated 5 months ago
- ☆53Updated 10 months ago
- Multimodal Graph Learning: how to encode multiple multimodal neighbors with their relations into LLMs☆67Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆84Updated last year
- ☆56Updated last year
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆88Updated 6 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆141Updated 5 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Updated 2 months ago