camel-ai / agent-trust
🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"
☆77Updated 2 weeks ago
Alternatives and similar repositories for agent-trust:
Users that are interested in agent-trust are comparing it to the libraries listed below
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆182Updated last week
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆71Updated 9 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆93Updated 6 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆79Updated 2 months ago
- A platform for developers to simulate collaborative research activities☆146Updated this week
- ☆107Updated 3 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆136Updated 11 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆63Updated 9 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆84Updated last month
- ☆91Updated 2 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆44Updated last year
- ☆42Updated 6 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆134Updated 5 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆86Updated last month
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆72Updated 2 weeks ago
- A banchmark list for evaluation of large language models.☆99Updated last month
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆92Updated 6 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆56Updated 2 months ago
- ☆196Updated 2 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆50Updated 2 months ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆68Updated 2 weeks ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆115Updated 11 months ago
- augmented LLM with self reflection☆119Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆86Updated 2 weeks ago
- ☆15Updated last month
- ☆46Updated 2 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆103Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆135Updated 5 months ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆22Updated 9 months ago