sitloboi2012 / Visualization-of-ThoughtLinks
The implementation of the paper: "Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models"
☆35Updated last year
Alternatives and similar repositories for Visualization-of-Thought
Users that are interested in Visualization-of-Thought are comparing it to the libraries listed below
Sorting:
- ☆87Updated 2 years ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆90Updated 2 years ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆213Updated 10 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Updated last year
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆44Updated last year
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆84Updated last year
- WONDERBREAD benchmark + dataset for BPM tasks☆34Updated 6 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 7 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆141Updated last year
- ☆73Updated 8 months ago
- A benchmark for evaluating learning agents based on just language feedback☆94Updated 7 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆73Updated last year
- ☆118Updated 9 months ago
- ☆23Updated this week
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆40Updated last year
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆145Updated last year
- ☆29Updated 10 months ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Updated last year
- ☆23Updated last year
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆33Updated last year
- Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆92Updated 2 years ago
- ☆67Updated 10 months ago
- ☆123Updated last year
- augmented LLM with self reflection☆136Updated 2 years ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆101Updated 2 years ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆108Updated 7 months ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆68Updated last year
- Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Lear…☆48Updated 7 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆36Updated 3 months ago
- [TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆94Updated 3 months ago