Using conversational games to evaluate powerful LLMs
☆18Sep 3, 2023Updated 2 years ago
Alternatives and similar repositories for GameEval
Users that are interested in GameEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- This is the codebase for pre-training, compressing, extending, and distilling LLMs with Megatron-LM.☆12Mar 11, 2024Updated 2 years ago
- [ICASSP 2025 Oral] The official implementation of paper "TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfe…☆16Mar 13, 2025Updated last year
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- Text-based game of lies and deceit, made for language models.☆32Aug 25, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆27Mar 6, 2023Updated 3 years ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated last year
- [ACL 2024 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"☆60Mar 20, 2024Updated 2 years ago
- m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks☆47Sep 26, 2024Updated last year
- Game-based AI Platforms☆26Jun 27, 2024Updated last year
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 11 months ago
- The code of ACL2022 paper "Conditional Bilingual Mutual Information based Adaptive Training for Neural Machine Translation"..☆14Aug 6, 2022Updated 3 years ago
- Display contacts from the AddressBook database☆11May 4, 2022Updated 4 years ago
- SGLang Kernel Wheel Index☆23Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [EMNLP 2024] Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by …☆16Nov 27, 2024Updated last year
- Repo for our AKBC-2021 paper: Abg-CoQA: Clarifying Ambiguity in Conversational Question Answering☆11Oct 10, 2021Updated 4 years ago
- ☆40Jun 19, 2024Updated last year
- MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting☆20Jul 11, 2023Updated 2 years ago
- ☆12Sep 29, 2016Updated 9 years ago
- Micrograd in Rust☆10Nov 14, 2024Updated last year
- Natural Language to Overpass Query Language☆31Mar 14, 2024Updated 2 years ago
- ☆21May 30, 2022Updated 4 years ago
- A PyTorch annotated replication of the paper: https://arxiv.org/abs/2006.10637☆20Nov 20, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- High Performance Sorting Based Distributed memory K-mer counter☆15Dec 8, 2025Updated 6 months ago
- A record of reading list on some MLsys popular topic☆25Mar 20, 2025Updated last year
- ☆21Jun 9, 2025Updated last year
- Some C++/C/CUDA Extension☆16Feb 2, 2022Updated 4 years ago
- These are tools for rgbd SLAM evaluation forked from TUM☆14Feb 1, 2016Updated 10 years ago
- Code for Semantic-Aligned Adversarial Evolution Triangle for High-Transferability Vision-Language Attack(TPAMI 2025)☆42Aug 28, 2025Updated 9 months ago
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- Implementations of a selection of clustering algorithms for VANETs, written in C++ for OMNeT++☆12Dec 29, 2014Updated 11 years ago
- ☆112Jul 15, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ECCV2024] Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajector…☆31Nov 15, 2025Updated 7 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated 3 months ago
- ☆62Sep 7, 2021Updated 4 years ago
- This framework is for resource allocation in C-V2X Mode 4☆17Nov 14, 2025Updated 7 months ago
- Simple example wire app☆14Oct 14, 2021Updated 4 years ago
- OS repo for Knowledge Retrieval starter kit☆73Jan 13, 2026Updated 5 months ago
- [NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)☆32Aug 8, 2025Updated 10 months ago