Statistical analysis methods for comparing prompt and model performance in LLM evaluations.
☆106Jun 20, 2026Updated last week
Alternatives and similar repositories for evalstats
Users that are interested in evalstats are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VulnGym: A Real-World, Project-Level Vulnerability Benchmark for White-Box Vulnerability-Hunting Agents☆177Jun 18, 2026Updated last week
- Outcome-first plus directional language. A two-layer skill for writing prompts, agent directives, and skill descriptions. Works in Claude…☆111May 21, 2026Updated last month
- ☆11Mar 11, 2026Updated 3 months ago
- Curating Cognitive Behavioral Therapy☆13Dec 21, 2023Updated 2 years ago
- Docker-based robotics development environments with GPU and X11 support☆28Jun 1, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10May 11, 2024Updated 2 years ago
- ☆41Dec 15, 2025Updated 6 months ago
- ☆12May 30, 2025Updated last year
- Optimal TSP in Polynomial Time☆15May 30, 2025Updated last year
- Vibe-codable Bittensor Subnet Template☆26Mar 16, 2026Updated 3 months ago
- Gradient descent algorithms for LQG control☆14Feb 20, 2022Updated 4 years ago
- Cache the return values of your Python functions with a simple decorator.☆11Jan 17, 2017Updated 9 years ago
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- Python scripts for using mindmup JSON as a medium for developing attack trees☆15Aug 25, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆88Mar 3, 2026Updated 3 months ago
- ☆71Apr 9, 2026Updated 2 months ago
- set of utilities helping me build and navigate my personal flat-file markdown wiki☆19Dec 11, 2022Updated 3 years ago
- Command-line tool to manage your Google Calendar