TrainLoop / evalsLinks
Open Source Data Collection and Evaluation Framework
☆61Updated 5 months ago
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- Recipes for AI agents that use Asteroid to be safe and reliable. Want yours featured? Submit a PR!☆49Updated 8 months ago
- ☆58Updated 9 months ago
- AI Agents for Enterprise Software Automation☆44Updated 11 months ago
- A1Base NextJS template☆66Updated 2 weeks ago
- A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.☆248Updated 11 months ago
- 🐍 Sublingual helps you log and analyze all of your LLM calls, including the prompt template, call parameters, responses, tool calls, and…☆52Updated 10 months ago
- Cloudstate is a JavaScript database runtime.☆205Updated 6 months ago
- Postman for MCP servers☆123Updated 5 months ago
- Prompt engineering, automated.☆350Updated 8 months ago
- ☆139Updated 10 months ago
- Lilac is an open-source tool that ensures your data scientists always have enough gpus for their work. We seamlessly connect compute from…☆116Updated 4 months ago
- ☆52Updated last week
- The open source AI app collection☆184Updated last year
- Browsers-as-a-service for automations and web agents☆585Updated this week
- Open source implementation of Poke☆421Updated 3 months ago
- CLI Tool for converting pydantic models into typescript definitions☆36Updated last year
- Deploy Astro.js to freestyle.sh with cloudstate javascript object persistence.☆48Updated 10 months ago
- An operator for streaming Kubernetes resource metadata, logs, events, and network traffic telemetry over mTLS to Kestrel Cloud.☆29Updated 3 weeks ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆335Updated last year
- Ship billing in minutes, not weeks☆24Updated 3 months ago
- Robot communication and coordination network.☆70Updated 7 months ago
- Fine-tuning and serving LLMs on any cloud☆90Updated 2 years ago
- Spongecake is the easiest way to launch computer use agents.☆162Updated 8 months ago
- ☆78Updated 2 years ago
- vscode extension to convert computationally intensive pytorch kernels to triton☆21Updated last year
- Airtop SDK for Node.js☆16Updated 2 months ago
- Serverless Posttraining☆67Updated this week
- A Python library for LLM-based evaluation using weighted rubrics.☆43Updated last week
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆739Updated 7 months ago
- Orchestrate zero-shot computer vision models☆392Updated last year