doslim / Evaluate-the-Opinion-Leadership-of-LLMs
Evaluate the Opinion Leadership of LLMs in the Werewolf Game
☆9Updated 7 months ago
Alternatives and similar repositories for Evaluate-the-Opinion-Leadership-of-LLMs:
Users that are interested in Evaluate-the-Opinion-Leadership-of-LLMs are comparing it to the libraries listed below
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated 11 months ago
- ☆44Updated 3 months ago
- ☆51Updated 8 months ago
- ☆94Updated 3 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 6 months ago
- Evaluation for AI apps and agent☆36Updated last year
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆59Updated last month
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆54Updated 3 months ago
- ☆101Updated 3 months ago
- ☆36Updated 6 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Updated last year
- ☆24Updated 6 months ago
- Reformatted Alignment☆115Updated 6 months ago
- ☆34Updated 3 months ago
- From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation☆83Updated 2 weeks ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆52Updated 10 months ago
- ☆29Updated 4 months ago
- Self-Controlled Memory System for LLMs☆46Updated 11 months ago
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆107Updated 7 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆97Updated 3 weeks ago
- ☆37Updated 10 months ago
- ☆36Updated 2 years ago
- ☆32Updated 3 months ago
- kimi-chat 测试数据☆7Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆53Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆33Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- ☆82Updated 4 months ago
- ☆49Updated last year
- The demo, code and data of FollowRAG☆70Updated 3 months ago