chengzu-li / MVoTView external linksLinks
Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)
☆67Apr 12, 2025Updated 10 months ago
Alternatives and similar repositories for MVoT
Users that are interested in MVoT are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆32Feb 6, 2026Updated last week
- A Self-Training Framework for Vision-Language Reasoning☆88Jan 23, 2025Updated last year
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆35Nov 18, 2025Updated 2 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆98May 20, 2025Updated 8 months ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Jul 21, 2023Updated 2 years ago
- ☆32Oct 31, 2024Updated last year
- o1 Chain of Thought Examples☆33Oct 4, 2024Updated last year
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆32Jan 22, 2025Updated last year
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆182Jun 5, 2025Updated 8 months ago
- ☆88Jun 7, 2024Updated last year
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆22Nov 11, 2025Updated 3 months ago
- ☆12Jun 19, 2024Updated last year
- This repository collects awesome representative papers and resources for "From Pre-training to Post-training: A Survey on Time Series Fou…☆30Feb 1, 2026Updated last week
- ☆50Jun 7, 2025Updated 8 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆19Sep 24, 2025Updated 4 months ago
- The official implementation of Hard Negative Sampling via Large Language Models for Recommendation.☆11Jan 17, 2026Updated 3 weeks ago
- ☆15Feb 11, 2025Updated last year
- Self Evolving Large Multimodal Models with Continuous Rewards☆19Nov 21, 2025Updated 2 months ago
- ☆15Jul 22, 2024Updated last year
- Bridging Retrieval and Inference through Evidence Fusion☆12Oct 20, 2025Updated 3 months ago
- Diffusing States and Matching Scores: A New Framework for Imitation Learning☆21Nov 16, 2024Updated last year
- ☆24Jan 19, 2026Updated 3 weeks ago
- Personalized Fragrance Recommendation for Aromatherapy: A Machine Learning Approach Based on Personality Traits and Electrodermal Activit…☆14May 1, 2025Updated 9 months ago
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated last year
- ☆13May 9, 2024Updated last year
- Implementation of GraphReader paper: https://arxiv.org/abs/2406.14550☆13Oct 21, 2024Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- The PyTorch implementation of DSM (EMNLP 2022).☆10Mar 26, 2024Updated last year
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Oct 15, 2025Updated 3 months ago
- ☆17Dec 23, 2025Updated last month
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆20Jun 12, 2025Updated 8 months ago
- Awesome LLM for Cybersecurity☆11Nov 16, 2024Updated last year
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated 10 months ago
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆22Sep 23, 2025Updated 4 months ago
- Memory experiments with LLMs☆11Mar 31, 2023Updated 2 years ago
- A fork to add multimodal model training to open-r1☆1,449Feb 8, 2025Updated last year
- This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs☆44Aug 14, 2024Updated last year