Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆51Sep 29, 2024Updated last year
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Annoucing Instructor Cloud☆38Aug 14, 2024Updated last year
- ☆197May 5, 2024Updated last year
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- Website for Applied-LLMs work☆29Jan 13, 2026Updated 3 months ago
- Structured outputs for LLMs☆54Jul 15, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆230Jan 18, 2026Updated 3 months ago
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- ☆19Sep 12, 2024Updated last year
- Run evals using LLM☆27Jan 8, 2026Updated 3 months ago
- ☆22Oct 14, 2024Updated last year
- Capture your conversations with transcripts and intelligence☆22Jan 13, 2025Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebook☆19Updated this week
- converts url content into JSON with a simple prefix☆73May 8, 2024Updated last year
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19Dec 4, 2025Updated 4 months ago
- ☆32Mar 1, 2023Updated 3 years ago
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models☆26May 31, 2025Updated 11 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆201Apr 29, 2024Updated 2 years ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆187Dec 29, 2025Updated 4 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆37Jul 6, 2023Updated 2 years ago
- Deprecated. An example app for an LLM chat in Highlight runtime.☆28Jan 10, 2025Updated last year
- A cog implementation of Nvidia's Triton server☆18Oct 23, 2024Updated last year
- SQL storage for CertMagic/Caddy TLS data.☆20Nov 11, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Creating Generative AI Apps which work☆17Apr 14, 2025Updated last year
- ☆26Dec 6, 2024Updated last year
- ☆10Jul 30, 2024Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Pydantic extension for annotating autocorrecting fields.☆219Jun 20, 2024Updated last year
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10May 16, 2022Updated 3 years ago
- Conformance Tests for MCP☆63Apr 24, 2026Updated last week
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110May 31, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repo consists of prompting style of different widely used LLMs in the LLM space.☆37Jul 31, 2024Updated last year
- Reproduction of the $80M Rari Finance Hack on April 30 2022 using on-chain fuzzing with Echidna☆14Jun 16, 2024Updated last year
- Python tools☆14Oct 22, 2023Updated 2 years ago
- ☆80Jun 5, 2024Updated last year
- Improved GPT-3.5 Agent (with tools) for GPT-3.5☆20May 1, 2023Updated 3 years ago
- DynamoDB Foreign Data Wrapper for PostgreSQL☆37Feb 10, 2025Updated last year
- ☆15Mar 30, 2025Updated last year