euclid-multimodal / Euclid
☆13Updated 2 months ago
Alternatives and similar repositories for Euclid:
Users that are interested in Euclid are comparing it to the libraries listed below
- ☆39Updated 4 months ago
- ☆35Updated last month
- Official PyTorch Implementation for Task Vectors are Cross-Modal☆22Updated 3 months ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆33Updated last week
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆14Updated 2 weeks ago
- Multimodal RewardBench☆31Updated last month
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆41Updated 3 months ago
- ☆21Updated 8 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 5 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆65Updated 9 months ago
- Preference Learning for LLaVA☆41Updated 4 months ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆32Updated last week
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆54Updated 5 months ago
- Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆44Updated last month
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆59Updated 8 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆55Updated last month
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆28Updated 4 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆49Updated 2 weeks ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆10Updated last month
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last month
- ☆70Updated 2 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆70Updated 9 months ago
- ☆18Updated 4 months ago
- ☆37Updated 2 months ago
- ☆8Updated 9 months ago
- Code for Heima☆37Updated last month
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆44Updated 7 months ago
- ☆15Updated 8 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆33Updated 2 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆62Updated last month