Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
☆69May 31, 2024Updated last year
Alternatives and similar repositories for STIC
Users that are interested in STIC are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆86Oct 26, 2025Updated 4 months ago
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆22Sep 26, 2024Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆91Apr 30, 2024Updated last year
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆16Apr 23, 2024Updated last year
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆307Sep 11, 2024Updated last year
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- ☆101Dec 22, 2023Updated 2 years ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆86Nov 10, 2024Updated last year
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆71Jul 13, 2025Updated 7 months ago
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆233Nov 7, 2025Updated 3 months ago
- ☆66Feb 5, 2024Updated 2 years ago
- ☆28Feb 10, 2025Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 8 months ago
- ☆22Jun 5, 2025Updated 8 months ago
- Responsible Robotic Manipulation☆16Aug 31, 2025Updated 6 months ago
- ☆12Jul 16, 2025Updated 7 months ago
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆55Jan 31, 2025Updated last year
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆430Dec 22, 2024Updated last year
- ☆124Jul 29, 2024Updated last year
- ☆156Oct 31, 2024Updated last year
- ☆16Sep 25, 2025Updated 5 months ago
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- Aligning LMMs with Factually Augmented RLHF☆392Nov 1, 2023Updated 2 years ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆60Aug 23, 2024Updated last year
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆186Jul 5, 2024Updated last year
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- Official implementation of "Self-Correcting Self-Consuming Loops for Generative Model Training" (ICML 2024)☆35Jul 18, 2024Updated last year
- mmyolo for pose☆11Feb 24, 2023Updated 3 years ago
- OpenMMLab Detection Toolbox and Benchmark for V3Det☆15Apr 3, 2024Updated last year
- ☆93Mar 29, 2019Updated 6 years ago
- FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models☆32Nov 27, 2025Updated 3 months ago
- ☆67Mar 30, 2025Updated 11 months ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆107Aug 21, 2025Updated 6 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Incremental Python parser for constrained generation of code by LLMs.☆18Sep 18, 2024Updated last year
- Ongoing research project for code&math LLMs☆27Jul 4, 2025Updated 8 months ago
- ☆23Oct 30, 2025Updated 4 months ago
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year