doslim / Evaluate-the-Opinion-Leadership-of-LLMs
Evaluate the Opinion Leadership of LLMs in the Werewolf Game
☆9Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for Evaluate-the-Opinion-Leadership-of-LLMs
- A lightweight script for processing HTML page to markdown format with support for code blocks☆71Updated 6 months ago
- This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.☆83Updated this week
- Reformatted Alignment☆112Updated last month
- Evaluation for AI apps and agent☆35Updated 9 months ago
- ☆77Updated last month
- ☆82Updated 7 months ago
- ☆27Updated 8 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆50Updated 3 months ago
- ☆35Updated last year
- ☆78Updated 6 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆47Updated 5 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆97Updated 2 months ago
- ☆51Updated 3 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆59Updated last month
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆74Updated 9 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆58Updated 7 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated last month
- kimi-chat 测试数据☆7Updated last year
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆30Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆190Updated 3 weeks ago
- ☆44Updated last month
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆15Updated 4 months ago
- ☆34Updated 2 months ago
- ☆48Updated 8 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆73Updated this week
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆85Updated last month
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆45Updated 8 months ago