THUDM / SceneGenAgentLinks
[ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent
☆19Updated 8 months ago
Alternatives and similar repositories for SceneGenAgent
Users that are interested in SceneGenAgent are comparing it to the libraries listed below
Sorting:
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆120Updated 9 months ago
- Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, and external …☆42Updated 11 months ago
- (ICLR 2025) The Official Code Repository for GUI-World.☆64Updated 7 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆78Updated last month
- Code for the paper "AutoPresent: Designing Structured Visuals From Scratch" (CVPR 2025)☆117Updated 2 months ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆82Updated 3 weeks ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆80Updated 2 months ago
- ☆77Updated 11 months ago
- ☆37Updated 8 months ago
- ☆302Updated last week
- ☆66Updated 2 months ago
- ☆87Updated last year
- [ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆167Updated 4 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆284Updated this week
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆43Updated 9 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆104Updated 2 weeks ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆104Updated 2 months ago
- (VillagerAgent ACL 2024) A Graph based Minecraft multi agents framework☆68Updated last month
- ☆48Updated 3 months ago
- ☆42Updated 9 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆152Updated last month
- AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆76Updated 2 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 3 weeks ago
- ☆25Updated 3 weeks ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆214Updated last month
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆117Updated 11 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆72Updated 8 months ago
- Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆116Updated 3 weeks ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆32Updated this week
- ☆66Updated 2 years ago