camel-ai / agent-trust
🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"
☆64Updated 3 months ago
Alternatives and similar repositories for agent-trust:
Users that are interested in agent-trust are comparing it to the libraries listed below
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆67Updated 7 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆60Updated 8 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆60Updated 10 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆76Updated last week
- augmented LLM with self reflection☆115Updated last year
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆58Updated 3 weeks ago
- o1 Chain of Thought Examples☆33Updated 5 months ago
- ☆56Updated 3 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆84Updated 4 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆45Updated 4 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆46Updated 4 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆101Updated 11 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆112Updated 9 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆155Updated 3 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆68Updated 3 weeks ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆91Updated 5 months ago
- A banchmark list for evaluation of large language models.☆84Updated this week
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆96Updated last week
- ☆101Updated last month
- official implementation of paper "Process Reward Model with Q-value Rankings"☆49Updated last month
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆52Updated 3 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆92Updated last year
- ☆41Updated 4 months ago
- ☆16Updated 5 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated last week
- [ICML 2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast☆97Updated 11 months ago