A benchmark for conversational bargaining by language models. In each 20‑round match one LLM plays buyer, one plays seller, and both hold a hidden private value. Every round they swap a short public message, then post a bid or ask; a deal clears whenever the bid meets the ask.
☆44Jun 23, 2026Updated last week
Alternatives and similar repositories for pact
Users that are interested in pact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆88Dec 9, 2025Updated 6 months ago
- Estimate the number of legal chess positions☆14Jan 14, 2021Updated 5 years ago
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆228Updated this week
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆33Mar 20, 2025Updated last year
- [ICLR26] Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆24Apr 8, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python SDK for healthcare AI: connect models to live EHR systems, skip the integration tax 💫 🏥☆210Jun 23, 2026Updated last week
- ☆15Sep 21, 2025Updated 9 months ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated 4 months ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 5 years ago
- The official baseline implementations for Chronocept☆10Mar 31, 2026Updated 3 months ago
- 🏋️A simple and self-hostable workout tracking web app build with Flask, SQLite, and Docker.☆32May 26, 2025Updated last year
- Learn new things using RSS.☆28Sep 11, 2025Updated 9 months ago
- ☆11Mar 19, 2023Updated 3 years ago
- marimo + pixi starter template☆18Jan 31, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Github Action that automates creation of llms.txt☆16Aug 7, 2025Updated 10 months ago
- An easier way to use CloudFormation on AWS☆11Sep 14, 2020Updated 5 years ago
- ☆13Sep 11, 2024Updated last year
- ☆32Apr 6, 2026Updated 2 months ago
- An Obsidian plugin that displays changelogs of the entire vault and individual files in the sidebar by utilizing Git commit history☆25May 15, 2026Updated last month
- ☆28Jun 25, 2026Updated last week
- A manga, comic and book reader for desktop and mobile.☆33Apr 3, 2026Updated 3 months ago
- We feature a project or marimo notebook from the community every Thursday!☆59Jul 25, 2025Updated 11 months ago
- ☆222Jun 25, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Kernel-Enforced Install-Time Policies (KEIP): An eBPF/LSM based security tool that detects and blocks malicious network activity during p…☆53Jun 1, 2026Updated last month
- Typed python equivalent for R pipes.☆14Oct 16, 2022Updated 3 years ago
- These examples demonstrate how to use the Cloudflare API within interactive Python notebooks.☆25Jun 3, 2026Updated last month
- The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024☆16May 11, 2024Updated 2 years ago
- [Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation☆26Oct 1, 2025Updated 9 months ago
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆43Nov 26, 2025Updated 7 months ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 3 years ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 3 years ago
- ☆12Nov 11, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 🔵 D2: Declarative Diagramming in Python via AnyWidget☆22Jun 23, 2026Updated last week
- Script to handle amazon reinvent 2021 personal schedule☆11Oct 11, 2022Updated 3 years ago
- ☆26Jan 15, 2026Updated 5 months ago
- end-to-end dialog system dataset☆13Sep 15, 2019Updated 6 years ago
- A GitHub Action to trigger an Antithesis test suite.☆13Jun 22, 2026Updated last week
- The missing link between your ears and your eyes. Seamlessly sync your reading progress between Audiobookshelf (Audiobooks) and KOReader…☆64Jan 6, 2026Updated 5 months ago
- CD4AutoML: Continuous Delivery for AutoML with Amazon SageMaker Autopilot and Amazon Step Functions☆13Dec 12, 2020Updated 5 years ago