Statistical analysis methods for comparing prompt and model performance in LLM evaluations.
☆104May 13, 2026Updated 3 weeks ago
Alternatives and similar repositories for evalstats
Users that are interested in evalstats are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VulnGym: A Real-World, Project-Level Vulnerability Benchmark for White-Box Vulnerability-Hunting Agents☆156Updated this week
- Outcome-first plus directional language. A two-layer skill for writing prompts, agent directives, and skill descriptions. Works in Claude…☆108May 21, 2026Updated 2 weeks ago
- ☆11Mar 11, 2026Updated 2 months ago
- Curating Cognitive Behavioral Therapy☆13Dec 21, 2023Updated 2 years ago
- Docker-based robotics development environments with GPU and X11 support☆26Jun 1, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10May 11, 2024Updated 2 years ago
- ☆38Dec 15, 2025Updated 5 months ago
- ☆12May 30, 2025Updated last year
- Optimal TSP in Polynomial Time☆14May 30, 2025Updated last year
- Vibe-codable Bittensor Subnet Template☆26Mar 16, 2026Updated 2 months ago
- Gradient descent algorithms for LQG control☆14Feb 20, 2022Updated 4 years ago
- Cache the return values of your Python functions with a simple decorator.☆11Jan 17, 2017Updated 9 years ago
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- Python scripts for using mindmup JSON as a medium for developing attack trees☆15Aug 25, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆84Mar 3, 2026Updated 3 months ago
- ☆67Apr 9, 2026Updated last month
- set of utilities helping me build and navigate my personal flat-file markdown wiki☆19Dec 11, 2022Updated 3 years ago
- Command-line tool to manage your Google Calendar☆20Aug 16, 2024Updated last year
- ☆27Oct 29, 2021Updated 4 years ago
- A library for managing groups of lambdas.☆10May 21, 2026Updated 2 weeks ago
- Interactive brokers integration for live trading using Rob Carver's pysystem trade backtester.☆10May 15, 2018Updated 8 years ago
- ☆46Mar 31, 2026Updated 2 months ago
- ☆47May 9, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Carnatic Music Notation rendering engine☆14Nov 24, 2013Updated 12 years ago
- ☆10Sep 7, 2022Updated 3 years ago
- Polkadex Off-chain Orderbook☆11Feb 10, 2022Updated 4 years ago
- Anil's OCaml Claude plugin collection☆32May 24, 2026Updated 2 weeks ago
- Dynamic Telegram Trading Bot☆20Feb 21, 2025Updated last year
- Opinionated Ink Physics☆12Jan 31, 2025Updated last year
- ☆22Aug 14, 2013Updated 12 years ago
- Offload your test computation to ephemeral compute☆153May 27, 2026Updated last week
- An exploration of LLM steering☆26Jun 15, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Lab manual for Psyc 3400 @ Brooklyn College☆17Dec 10, 2020Updated 5 years ago
- Code for reproducing the results from "CrAM: A Compression-Aware Minimizer" accepted at ICLR 2023☆10Mar 1, 2023Updated 3 years ago
- Go library to parse markdown to grab various things☆26Jan 22, 2024Updated 2 years ago
- Latex Ph.D. thesis template for the University of Michigan☆17May 28, 2022Updated 4 years ago
- ☆21May 27, 2026Updated last week
- [ICML 2025] Official Implementation of "Hessian Geometry of Latent Space in Generative Models"☆18Aug 16, 2025Updated 9 months ago
- 🎞 Animate from one string to another.☆12Apr 30, 2022Updated 4 years ago