qiancheng0 / ModelingAgentLinks
☆18Updated 2 months ago
Alternatives and similar repositories for ModelingAgent
Users that are interested in ModelingAgent are comparing it to the libraries listed below
Sorting:
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆16Updated last month
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆14Updated 6 months ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Updated last year
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆27Updated last year
- official implementation of paper "Process Reward Model with Q-value Rankings"☆64Updated 9 months ago
- The OlymMATH dataset☆20Updated 5 months ago
- Official Code Release for "Training a Generally Curious Agent"☆38Updated 6 months ago
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆17Updated 6 months ago
- FinanceRAG project by KAIST students. Advanced Retrieval-Augmented Generation (RAG) system designed for the financial domain.☆15Updated 9 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆39Updated 2 months ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆16Updated 11 months ago
- ☆20Updated 2 weeks ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆112Updated 3 months ago
- OptiBench and ReSocratic Synthesis Method☆27Updated last month
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆25Updated 2 months ago
- A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models☆17Updated 5 months ago
- ☆25Updated 7 months ago
- ☆33Updated last year
- ☆20Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆132Updated 7 months ago
- Repo for Paper "From Role-Play to Drama-Interaction: An LLM Solution" @ACL 2024☆12Updated last year
- ☆30Updated last year
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆39Updated 2 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆20Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆133Updated last year
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆47Updated 5 months ago
- ☆49Updated 9 months ago
- ☆46Updated last year
- ☆16Updated last year