yang-ze-kang / AutoMMLabLinks
☆13Updated 4 months ago
Alternatives and similar repositories for AutoMMLab
Users that are interested in AutoMMLab are comparing it to the libraries listed below
Sorting:
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆38Updated last year
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 4 months ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆46Updated 8 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated 2 months ago
- AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆73Updated last month
- ☆142Updated last year
- ☆66Updated 3 months ago
- ☆47Updated 5 months ago
- ☆46Updated 2 months ago
- ☆20Updated 11 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆49Updated last month
- Dedicated to building industrial foundation models for universal data intelligence across industries.☆57Updated 10 months ago
- ☆73Updated 2 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆93Updated last week
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- Parameter-Efficient Fine-Tuning for Foundation Models☆75Updated 3 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆52Updated 7 months ago
- survery of small language models☆15Updated 11 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆25Updated last month
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆59Updated 6 months ago
- ☆41Updated 8 months ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆120Updated 2 months ago
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆39Updated 9 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆34Updated last year
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated last month
- Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆24Updated last week
- ☆63Updated last year
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆77Updated 8 months ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆39Updated last week