TrainLoop / evalsLinks
Open Source Data Collection and Evaluation Framework
☆61Updated 6 months ago
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- ☆58Updated 10 months ago
- Recipes for AI agents that use Asteroid to be safe and reliable. Want yours featured? Submit a PR!☆49Updated 9 months ago
- AI Agents for Enterprise Software Automation☆44Updated last year
- A1Base NextJS template☆66Updated last month
- Taskiq plugin for postgres broker and results backend☆52Updated last month
- A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.☆249Updated last year
- Cloudstate is a JavaScript database runtime.☆207Updated 7 months ago
- 🐍 Sublingual helps you log and analyze all of your LLM calls, including the prompt template, call parameters, responses, tool calls, and…☆52Updated 10 months ago
- ☆53Updated this week
- Browsers-as-a-service for automations and web agents☆616Updated this week
- Postman for MCP servers☆124Updated 5 months ago
- Prompt engineering, automated.☆352Updated 9 months ago
- Robot communication and coordination network.☆70Updated 3 weeks ago
- vscode extension to convert computationally intensive pytorch kernels to triton☆21Updated last year
- superglue (YC W25) builds integrations and tools from natural language. Get production-grade tools for long tail and enterprise systems.☆1,974Updated this week
- ☆139Updated 11 months ago
- Serverless Posttraining☆68Updated this week
- A Python library for LLM-based evaluation using weighted rubrics.☆45Updated this week
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆335Updated last year
- ☆78Updated 2 years ago
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆381Updated last week
- Ship billing in minutes, not weeks☆27Updated 4 months ago
- Open Source Auth Built on Freestyle: own your auth + data https://docs.freestyle.dev/guides/authentication/☆23Updated last year
- Airtop SDK for Node.js☆16Updated 2 months ago
- Data-Driven Evaluation for LLM-Powered Applications☆515Updated last year
- CLI Tool for converting pydantic models into typescript definitions☆36Updated last year
- 🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨☆95Updated last year
- ☆1,290Updated 5 months ago
- Fine-tuning and serving LLMs on any cloud☆90Updated 2 years ago
- The fastest, lightest, and easiest-to-integrate AI gateway on the market. Fully open-sourced.☆504Updated 2 months ago