The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".
☆110Feb 12, 2026Updated last month
Alternatives and similar repositories for EnvScaler
Users that are interested in EnvScaler are comparing it to the libraries listed below
Sorting:
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆37Mar 13, 2026Updated last week
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 5 months ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆34Feb 24, 2026Updated 3 weeks ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆47Updated this week
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆47Mar 7, 2026Updated last week
- ☆122Jan 21, 2026Updated last month
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆34Feb 4, 2026Updated last month
- GRPO Algorithm for Llava Architecture (Based on Verl)☆49May 9, 2025Updated 10 months ago
- ☆67Aug 14, 2025Updated 7 months ago
- The demo, code and data of FollowRAG☆76Jun 30, 2025Updated 8 months ago
- TBD☆49Updated this week
- EAFT(Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting) official repo☆93Jan 15, 2026Updated 2 months ago
- ☆13Feb 4, 2025Updated last year
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- ☆20Jun 13, 2025Updated 9 months ago
- Open Ended Medical Reinforcement Learning☆35Updated this week
- [IEEE TIP] FakeReasoning: Towards Generalizable Forgery Detection and Reasoning.☆18Mar 9, 2026Updated last week
- Agentic Learning Powered by AWorld☆92Updated this week
- OmniGAIA: Towards Native Omni-Modal AI Agents☆66Feb 28, 2026Updated 2 weeks ago
- ☆32Jan 30, 2026Updated last month
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆82Dec 20, 2024Updated last year
- ☆35Updated this week
- We release our code and data for SEAS in this repository.☆21Dec 23, 2024Updated last year
- ☆20Jun 16, 2025Updated 9 months ago
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆12Mar 5, 2025Updated last year
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)☆131Feb 6, 2026Updated last month
- [ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution☆276Mar 12, 2026Updated last week
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025☆22Mar 6, 2026Updated last week
- [ICLR 2024] heterogeneous MoE: mixture of weak & strong experts on graphs https//openreview.net/pdf?id=wYvuY60SdD☆22Apr 6, 2025Updated 11 months ago
- ☆45Dec 12, 2024Updated last year
- Benchmarking for the attributed graphs☆18Nov 21, 2025Updated 3 months ago
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆55Jan 28, 2026Updated last month
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 2 months ago
- ☆18Jun 18, 2025Updated 9 months ago
- ☆171Nov 26, 2025Updated 3 months ago
- ☆32Oct 21, 2025Updated 4 months ago
- ☆16Dec 17, 2023Updated 2 years ago