doslim / Evaluate-the-Opinion-Leadership-of-LLMsLinks
Evaluate the Opinion Leadership of LLMs in the Werewolf Game
☆9Updated 10 months ago
Alternatives and similar repositories for Evaluate-the-Opinion-Leadership-of-LLMs
Users that are interested in Evaluate-the-Opinion-Leadership-of-LLMs are comparing it to the libraries listed below
Sorting:
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- ☆91Updated last year
- ☆130Updated last year
- Evaluation for AI apps and agent☆43Updated last year
- ☆94Updated 7 months ago
- ☆64Updated last year
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆46Updated 3 months ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆90Updated 3 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆38Updated 7 months ago
- connecting humans and agents☆86Updated 7 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆212Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆79Updated 2 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆104Updated last week
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors☆83Updated last month
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆62Updated 10 months ago
- ☆47Updated last month
- Deep Reasoning Translation (DRT) Project☆227Updated 2 months ago
- ☆36Updated 7 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 5 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- kimi-chat 测试数据☆7Updated last year
- ☆83Updated last year
- ☆35Updated 2 years ago
- ☆14Updated 11 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69Updated 2 years ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- The next generation of Multi-Modal Multi-Agent platform.☆100Updated 2 months ago
- ☆286Updated last month
- Designing Multi-Agent Systems with Zero Supervision☆86Updated 2 weeks ago
- open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains☆116Updated 6 months ago