Official Inspect Implementation for "ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases"
☆40Dec 1, 2025Updated 6 months ago
Alternatives and similar repositories for impossiblebench
Users that are interested in impossiblebench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆47Jul 4, 2025Updated 11 months ago
- ☆13Jul 20, 2023Updated 2 years ago
- This repository contains the source code for "Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble", In Pro…☆11Jan 2, 2026Updated 5 months ago
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 9 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FurNet: A Deep-Learning-Based Framework for Removing Furniture Objects in Room Image☆13Nov 22, 2022Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Nov 27, 2023Updated 2 years ago
- [NeurIPS'23] Binary Classification with Confidence Difference☆10May 13, 2024Updated 2 years ago
- ☆23Apr 5, 2023Updated 3 years ago
- ☆57Apr 7, 2026Updated 2 months ago
- Python library for building and sharing dataframe-agnostic, sklearn-style transformers and ml models for data science competitions.☆28Mar 10, 2026Updated 3 months ago
- Code for the paper: Proving Theorems Recursively☆12May 23, 2024Updated 2 years ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆67Oct 27, 2024Updated last year
- Official implementation of Data Contamination Can Cross Language Barriers☆12Sep 11, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- CowXNet: An Automated Cow Estrus Detection System☆10Apr 20, 2023Updated 3 years ago
- exercise for transformers-benchmarks, add 3090 benchmark☆13Feb 3, 2023Updated 3 years ago
- ☆34Nov 7, 2024Updated last year
- Implementation of [MNTDP](https://arxiv.org/abs/2012.12631)☆18Mar 9, 2022Updated 4 years ago
- An implementation of Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning without AllenNL…☆19Feb 20, 2024Updated 2 years ago
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆13Oct 23, 2023Updated 2 years ago
- A Template Repository for a Swift Package-based Stanford Byers Center for Biodesign Digital Health Project☆18Apr 1, 2026Updated 2 months ago
- naming convention library for CamelCase, snake_case and friends☆11Mar 25, 2023Updated 3 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆25Nov 14, 2022Updated 3 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated last year
- A GPU-Accelerated Mahjong Simulator for RL in JAX☆45May 27, 2026Updated 2 weeks ago
- ☆11Apr 17, 2023Updated 3 years ago
- ☆20Dec 19, 2025Updated 5 months ago
- All Auto Latex Equations Versions.☆48Jun 3, 2026Updated last week
- Auditing agents for fine-tuning safety☆21Oct 21, 2025Updated 7 months ago
- The AI that helps you achieve your goals☆11Feb 4, 2024Updated 2 years ago
- ☆40Dec 19, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆109Aug 8, 2024Updated last year
- Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.☆13Mar 20, 2025Updated last year
- Fast wavelet transforms on the sphere☆13Dec 20, 2016Updated 9 years ago
- ☆15Jan 21, 2025Updated last year
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- Analyzer: CommentMap utilities for static analysis in Go☆12Nov 15, 2024Updated last year
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆14Feb 13, 2023Updated 3 years ago