Using conversational games to evaluate powerful LLMs
☆18Sep 3, 2023Updated 2 years ago
Alternatives and similar repositories for GameEval
Users that are interested in GameEval are comparing it to the libraries listed below
Sorting:
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Nov 27, 2024Updated last year
- Source code of paper “A Novel Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation”☆16Nov 25, 2021Updated 4 years ago
- MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting☆21Jul 11, 2023Updated 2 years ago
- Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models☆34Oct 19, 2023Updated 2 years ago
- ☆27Mar 6, 2023Updated 3 years ago
- Text-based game of lies and deceit, made for language models.☆32Aug 25, 2023Updated 2 years ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago
- A tool to paste Excel ranges to Reddit☆11Sep 20, 2025Updated 5 months ago
- The first OpenSource Mafia Bot!☆10Oct 5, 2023Updated 2 years ago
- Comparative Study and Implementation of Five Factor Model and Myers-Briggs Type Indicator Model☆11Sep 28, 2023Updated 2 years ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- The code used to power DeepRole☆37Nov 21, 2022Updated 3 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆40Dec 14, 2025Updated 2 months ago
- DNH Werewolf Discord bot☆13Dec 19, 2024Updated last year
- Inspirational post ids collected from Reddit using pushift.io and RoBERTa☆10Jan 18, 2024Updated 2 years ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 9 months ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- Dataset and codes for SEntFiN☆10May 31, 2023Updated 2 years ago
- 小鸡词典🐤的Alfred🎩插件 咯咯咯☆11Apr 19, 2023Updated 2 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 2 years ago
- 翻牆的自由博客☆17Feb 27, 2026Updated last week
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- Get the best daily repositories☆10Updated this week
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated 10 months ago
- I have fineTuned FinBert Model on 4.9k Financial News Headlines, Got 81-82% ACC and it perfrom well in Financial Stock News Sentiment Ana…☆12Mar 4, 2024Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Scripts for KGIRNet model for ESWC☆10Jul 6, 2023Updated 2 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated last year
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆12May 24, 2022Updated 3 years ago