qiancheng0 / ModelingAgentLinks
☆18Updated last month
Alternatives and similar repositories for ModelingAgent
Users that are interested in ModelingAgent are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆14Updated 4 months ago
- FinanceRAG project by KAIST students. Advanced Retrieval-Augmented Generation (RAG) system designed for the financial domain.☆15Updated 7 months ago
- The OlymMATH dataset☆20Updated 4 months ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Updated last year
- OptiBench and ReSocratic Synthesis Method☆26Updated last week
- DataSciBench: An LLM Agent Benchmark for Data Science☆33Updated last month
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆22Updated 3 weeks ago
- Official Code Release for "Training a Generally Curious Agent"☆34Updated 4 months ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 11 months ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆15Updated 9 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆129Updated 11 months ago
- ☆11Updated 7 months ago
- ☆64Updated 2 weeks ago
- The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …☆15Updated last year
- ☆18Updated last year
- official implementation of paper "Process Reward Model with Q-value Rankings"☆63Updated 8 months ago
- Process Reward Models That Think☆55Updated 3 months ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆21Updated 4 months ago
- Repo for Paper "From Role-Play to Drama-Interaction: An LLM Solution" @ACL 2024☆12Updated last year
- ☆20Updated last year
- ☆33Updated 11 months ago
- ☆40Updated 6 months ago
- ☆48Updated 7 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆85Updated 4 months ago
- A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models☆17Updated 4 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆110Updated last year
- ☆94Updated 5 months ago
- ☆21Updated 4 months ago
- ☆25Updated last year