zongqianwu / ST-COTLinks
(ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training
☆9Updated 5 months ago
Alternatives and similar repositories for ST-COT
Users that are interested in ST-COT are comparing it to the libraries listed below
Sorting:
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 3 weeks ago
- ☆18Updated 6 months ago
- ☆12Updated 5 months ago
- ☆16Updated 2 months ago
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Updated 3 months ago
- ☆18Updated 9 months ago
- Adapt MLLMs to Domains via Post-Training☆9Updated 6 months ago
- ☆12Updated 6 months ago
- KV cache compression via sparse coding☆11Updated 2 months ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆38Updated this week
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆89Updated 7 months ago
- ☆19Updated this week
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆11Updated 4 months ago
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆30Updated last month
- [ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding☆9Updated 3 months ago
- This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acceleration☆15Updated 4 months ago
- ☆17Updated 9 months ago
- A curated list of Awesome Personalized Large Multimodal Models resources☆31Updated last month
- CLIP-MoE: Mixture of Experts for CLIP☆42Updated 9 months ago
- codes for Efficient Test-Time Scaling via Self-Calibration☆14Updated 4 months ago
- ☆17Updated 7 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆35Updated last month
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆65Updated 7 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆34Updated 2 weeks ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆15Updated 8 months ago
- Repo for Anonymous purpose, pls don't distribute☆10Updated 9 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆141Updated last week
- Multimodal Instruction Tuning with Conditional Mixture of LoRA (ACL 2024)☆28Updated 11 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 9 months ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆15Updated 11 months ago