cvenhoff / vlm-mappingLinks
☆13Updated 3 weeks ago
Alternatives and similar repositories for vlm-mapping
Users that are interested in vlm-mapping are comparing it to the libraries listed below
Sorting:
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem”☆18Updated last month
- Official Repo for RuleReasoner.☆24Updated last month
- Resa: Transparent Reasoning Models via SAEs☆39Updated last month
- Lottery Ticket Adaptation☆39Updated 7 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 4 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated this week
- Official implementation of ECCV24 paper: POA☆24Updated 11 months ago
- Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆28Updated 3 weeks ago
- ☆33Updated 6 months ago
- ☆33Updated 2 weeks ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 4 months ago
- A holistic benchmark for LLM abstention☆38Updated this week
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- ☆36Updated last month
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆19Updated 3 weeks ago
- ☆29Updated last year
- ☆48Updated last month
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆93Updated this week
- ☆13Updated 7 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 6 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆63Updated last month
- Partial Masking for Discrete Diffusion Models☆14Updated 3 weeks ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆15Updated 3 months ago
- ☆23Updated 3 weeks ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 4 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆24Updated 3 weeks ago
- Fork of Flame repo for training of some new stuff in development☆14Updated 3 weeks ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆19Updated 4 months ago
- ☆24Updated 3 weeks ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year