yang-ze-kang / AutoMMLabLinks
☆18Updated 9 months ago
Alternatives and similar repositories for AutoMMLab
Users that are interested in AutoMMLab are comparing it to the libraries listed below
Sorting:
- ☆145Updated last year
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆27Updated 3 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Updated 6 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆101Updated 8 months ago
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆42Updated last year
- Geometric-Mean Policy Optimization☆95Updated 3 weeks ago
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation, arXiv 2024☆66Updated last month
- [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆137Updated 4 months ago
- ☆72Updated 4 months ago
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆46Updated 7 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆40Updated 2 years ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆41Updated 5 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆96Updated 11 months ago
- ☆13Updated 6 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆97Updated 11 months ago
- ☆56Updated last year
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆94Updated last year
- Reinforcement Learning of Vision Language Models with Self Visual Perception Reward☆146Updated 2 months ago
- ☆48Updated 9 months ago
- survery of small language models☆17Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆55Updated 6 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated 11 months ago
- ☆78Updated 2 weeks ago
- ☆41Updated 6 months ago
- ☆67Updated 8 months ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Updated last year
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆41Updated 7 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆99Updated last month
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆64Updated last year
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆36Updated 5 months ago