[ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation
☆28Feb 25, 2025Updated last year
Alternatives and similar repositories for text2world
Users that are interested in text2world are comparing it to the libraries listed below
Sorting:
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated 11 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Feb 17, 2026Updated last week
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆19Mar 17, 2025Updated 11 months ago
- ☆33Jul 9, 2025Updated 7 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated 11 months ago
- [arXiv 2024] Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking☆18Apr 4, 2025Updated 10 months ago
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆29Feb 6, 2026Updated 3 weeks ago
- ☆21May 3, 2025Updated 9 months ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated last year
- Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.☆16Dec 26, 2023Updated 2 years ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆30Jan 10, 2026Updated last month
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last week
- Official Code for "Rethinking Diffusion Model in High Dimension"☆24May 20, 2025Updated 9 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- Official implementation of DEMO3☆65Jul 29, 2025Updated 7 months ago
- [ACM MM 2025] MLLMs for Aesthetics Reasoning☆23Jan 5, 2026Updated last month
- ☆21Aug 30, 2025Updated 6 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆28May 14, 2025Updated 9 months ago
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆42Sep 3, 2025Updated 5 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- ☆59Mar 3, 2025Updated 11 months ago
- ☆31Nov 23, 2025Updated 3 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆28Oct 28, 2024Updated last year
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 6 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆59Feb 6, 2026Updated 3 weeks ago
- A curated list of awesome open-source grasping libraries and resources☆63Jul 16, 2025Updated 7 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆34Nov 10, 2025Updated 3 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆31Aug 7, 2025Updated 6 months ago
- RePo: Language Models with Context Re-Positioning☆70Dec 24, 2025Updated 2 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆51Jan 21, 2026Updated last month
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆50Jun 6, 2025Updated 8 months ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆38Feb 1, 2026Updated last month