The-LLM-Data-Company / rubricLinks
A Python library for LLM-based evaluation using weighted rubrics.
☆45Updated this week
Alternatives and similar repositories for rubric
Users that are interested in rubric are comparing it to the libraries listed below
Sorting:
- Using LLMs to transpile from Coq to Lean (public version, may be out of date)☆19Updated last month
- ☆24Updated last week
- Cloudstate is a JavaScript database runtime.☆207Updated 7 months ago
- Prompt engineering, automated.☆352Updated 9 months ago
- 🐍 Sublingual helps you log and analyze all of your LLM calls, including the prompt template, call parameters, responses, tool calls, and…☆52Updated 10 months ago
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆751Updated 7 months ago
- An operator for streaming Kubernetes resource metadata, logs, events, and network traffic telemetry over mTLS to Kestrel Cloud.☆30Updated last month
- A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.☆249Updated 11 months ago
- The fastest, lightest, and easiest-to-integrate AI gateway on the market. Fully open-sourced.☆504Updated 2 months ago
- Production-Ready MCP Server Framework • Build, deploy & scale secure AI agent infrastructure • Includes Auth, Observability, Debugger, Te…☆807Updated this week
- Data-Driven Evaluation for LLM-Powered Applications☆515Updated last year
- Postman for MCP servers☆124Updated 5 months ago
- ☆53Updated this week
- An MCP server that autonomously evaluates web applications.☆1,235Updated last week
- A cache for AI agents to learn and replay complex behaviors.☆756Updated 7 months ago
- vscode extension to convert computationally intensive pytorch kernels to triton☆21Updated last year
- Ship billing in minutes, not weeks☆27Updated 4 months ago
- Ultrafast serverless GPU inference, sandboxes, and background jobs☆1,541Updated last week
- The best way to create, deploy, and share MCP Servers☆790Updated this week
- Optimize prompts, code, and more with AI-powered Reflective Text Evolution☆2,167Updated this week
- Laminar - open-source observability platform purpose-built for AI agents. YC S24.☆2,539Updated last week
- 🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨☆95Updated last year
- Open source solutions for SOC2, GDPR, and ISO27001☆934Updated this week
- Fine-tuning and serving LLMs on any cloud☆90Updated 2 years ago
- Browsers-as-a-service for automations and web agents☆616Updated this week
- Python SDK for running evaluations on LLM generated responses☆295Updated 7 months ago
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆783Updated last week
- A coding agent and general agent harness for building and orchestrating agentic applications.☆576Updated this week
- Multi-language code navigation API in a container☆99Updated 5 months ago
- Tzafon-WayPoint is a robust, scalable solution for managing large fleets of browser instances. WayPoint stands out with unmatched cold‑st…☆82Updated 9 months ago