doslim / Evaluate-the-Opinion-Leadership-of-LLMs
Evaluate the Opinion Leadership of LLMs in the Werewolf Game
☆9Updated 7 months ago
Alternatives and similar repositories for Evaluate-the-Opinion-Leadership-of-LLMs:
Users that are interested in Evaluate-the-Opinion-Leadership-of-LLMs are comparing it to the libraries listed below
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- ☆32Updated last year
- ☆51Updated 9 months ago
- Evaluation for AI apps and agent☆40Updated last year
- ☆13Updated 8 months ago
- ☆47Updated 4 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Updated last year
- ☆91Updated last year
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 7 months ago
- ☆94Updated 4 months ago
- ☆37Updated 2 years ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆64Updated 2 months ago
- Reformatted Alignment☆115Updated 7 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆40Updated 2 weeks ago
- ☆101Updated 4 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆100Updated last month
- ☆56Updated 5 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 2 months ago
- ☆36Updated 4 months ago
- ☆31Updated last year
- Reasoning by Communicating with Agents☆26Updated 6 months ago
- ☆33Updated 4 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆31Updated last month
- ☆24Updated 7 months ago
- ☆49Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆64Updated 9 months ago
- ☆36Updated 7 months ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆115Updated 7 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆98Updated last year