stanfordmlgroup / ManyICLLinks
☆142Updated last year
Alternatives and similar repositories for ManyICL
Users that are interested in ManyICL are comparing it to the libraries listed below
Sorting:
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆88Updated 2 weeks ago
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆222Updated 7 months ago
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆132Updated 4 months ago
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆112Updated last month
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆68Updated 3 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆103Updated last week
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆120Updated last year
- A Survey on Benchmarks of Multimodal Large Language Models☆105Updated 2 months ago
- ☆173Updated last week
- ☆179Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆67Updated last year
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆207Updated 3 weeks ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆136Updated 6 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆64Updated 3 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆71Updated 6 months ago
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆282Updated 6 months ago
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated this week
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆115Updated last month
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆117Updated 6 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆61Updated last week
- Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]☆215Updated 2 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆216Updated last month
- The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆158Updated 2 months ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆74Updated last month
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆106Updated 8 months ago
- ☆127Updated 3 weeks ago
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆86Updated last year
- ☆105Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆117Updated this week
- ☆59Updated last week