camel-ai / agent-trust
π€ The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"
β70Updated 4 months ago
Alternatives and similar repositories for agent-trust:
Users that are interested in agent-trust are comparing it to the libraries listed below
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.β69Updated 8 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"β86Updated 5 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ140Updated this week
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examplesβ80Updated this week
- β84Updated last month
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.β70Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scalingβ95Updated 2 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"β62Updated 8 months ago
- β41Updated 5 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimizationβ135Updated 10 months ago
- Sotopia-Ο: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)β61Updated 10 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generationβ63Updated last month
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology Viewβ114Updated 10 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!β61Updated last month
- A platform for developers to simulate collaborative research activitiesβ142Updated this week
- β103Updated 2 months ago
- augmented LLM with self reflectionβ117Updated last year
- π Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Papβ¦β170Updated last week
- β59Updated 3 months ago
- β34Updated 3 months ago
- β19Updated 4 months ago
- β102Updated 3 months ago
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ42Updated last year
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizabilityβ38Updated 2 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agentsβ79Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Modelsβ¦β33Updated last year
- o1 Chain of Thought Examplesβ33Updated 5 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]β130Updated 4 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."β49Updated 4 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.β82Updated this week