qiancheng0 / ModelingAgentLinks
☆19Updated 3 months ago
Alternatives and similar repositories for ModelingAgent
Users that are interested in ModelingAgent are comparing it to the libraries listed below
Sorting:
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆16Updated last month
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆14Updated 6 months ago
- FinanceRAG project by KAIST students. Advanced Retrieval-Augmented Generation (RAG) system designed for the financial domain.☆15Updated 10 months ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆11Updated last year
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆16Updated 11 months ago
- The OlymMATH dataset☆21Updated 6 months ago
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆16Updated 11 months ago
- Repo for Paper "From Role-Play to Drama-Interaction: An LLM Solution" @ACL 2024☆12Updated last year
- ☆20Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆136Updated last year
- OptiBench and ReSocratic Synthesis Method☆28Updated 2 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆112Updated 4 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆43Updated 3 months ago
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆82Updated 5 months ago
- ☆72Updated last month
- Reinforced Multi-LLM Agents training☆60Updated 6 months ago
- ☆155Updated last month
- Code for paper "Prompt Engineering a Prompt Engineer" (https://arxiv.org/abs/2311.05661)☆10Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆29Updated last year
- ☆42Updated last week
- The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …☆15Updated last year
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Updated last year
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆29Updated 2 weeks ago
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆73Updated 3 months ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆33Updated 3 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆219Updated 3 weeks ago
- Implements LLM-Lasso☆37Updated 4 months ago
- ☆20Updated last year
- ☆12Updated 9 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆181Updated last year