sitloboi2012 / Visualization-of-ThoughtLinks
The implementation of the paper: "Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models"
☆34Updated last year
Alternatives and similar repositories for Visualization-of-Thought
Users that are interested in Visualization-of-Thought are comparing it to the libraries listed below
Sorting:
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆41Updated 10 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆147Updated 11 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆37Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆133Updated last year
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆82Updated last year
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆207Updated 7 months ago
- ☆86Updated last year
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆87Updated 4 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆85Updated 5 months ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆109Updated 5 months ago
- ☆24Updated 4 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆73Updated 11 months ago
- ☆27Updated 8 months ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆66Updated last year
- ☆116Updated 7 months ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆90Updated last year
- ☆221Updated 8 months ago
- ☆67Updated 7 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Updated 11 months ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆11Updated last year
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆143Updated last year
- Natural Language Reinforcement Learning☆99Updated 3 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆109Updated 5 months ago
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆16Updated last year
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆118Updated 3 months ago
- WONDERBREAD benchmark + dataset for BPM tasks☆31Updated 3 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆61Updated 10 months ago
- DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning☆157Updated 2 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆72Updated this week