qiancheng0 / ModelingAgentLinks
☆21Updated 5 months ago
Alternatives and similar repositories for ModelingAgent
Users that are interested in ModelingAgent are comparing it to the libraries listed below
Sorting:
- The OlymMATH dataset☆23Updated 8 months ago
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Updated 3 months ago
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15Updated 8 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆50Updated 2 weeks ago
- ☆239Updated last month
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆42Updated 7 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆149Updated 4 months ago
- Reinforced Multi-LLM Agents training☆70Updated 3 weeks ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆106Updated 2 weeks ago
- FinanceRAG project by KAIST students. Advanced Retrieval-Augmented Generation (RAG) system designed for the financial domain.☆15Updated 11 months ago
- Tree-of-Debate converts scientific papers into LLM personas that debate their respective novelties. To emphasize structured, critical rea…☆18Updated 6 months ago
- Official Code Release for "Training a Generally Curious Agent"☆44Updated 8 months ago
- ☆193Updated 3 months ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆18Updated last year
- ☆12Updated 11 months ago
- Open-source Agentic RL for LLMs — RLAnything & DemyAgent☆223Updated last week
- ☆134Updated 2 weeks ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆33Updated 5 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆298Updated last week
- ☆275Updated 5 months ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Updated last year
- ☆72Updated 8 months ago
- Now, Stronger AI Pushes Frontiers, Stronger Our Shared Future.☆284Updated 2 months ago
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆29Updated 2 years ago
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆16Updated last year
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆56Updated this week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆143Updated last year
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆101Updated 4 months ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆122Updated last week
- ☆25Updated last year