TrainLoop / evalsLinks
Open Source Data Collection and Evaluation Framework
☆54Updated this week
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- AI Agents for Enterprise Software Automation☆45Updated 5 months ago
- ☆58Updated 3 months ago
- A1Base NextJS template☆62Updated 2 weeks ago
- Recipes for AI agents that use Asteroid to be safe and reliable. Want yours featured? Submit a PR!☆47Updated 2 months ago
- Taskiq plugin for postgres broker and results backend☆45Updated this week
- Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding affinity model for drug discovery☆99Updated 3 months ago
- A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.☆185Updated 5 months ago
- Robot communication and coordination network.☆64Updated last month
- 🐍 Sublingual helps you log and analyze all of your LLM calls, including the prompt template, call parameters, responses, tool calls, and…☆51Updated 3 months ago
- OpenInt is the fastest way to add native product integrations to your app.☆184Updated 2 weeks ago
- LLM Testing SDK that helps you write and run tests to monitor your LLM app in production☆130Updated last year
- Cloudstate is a JavaScript database runtime.☆182Updated 2 months ago
- Smart glasses OS, with dozens of built-in apps. Users get AI assistant, notifications, translation, screen mirror, captions, and more. De…☆561Updated this week
- ☆76Updated last year
- Open source payments + billing infrastructure☆174Updated last week
- ☆134Updated 4 months ago
- 💸 The Map3 Supercharge SDK connects crypto apps to Wallets, Exchanges & Bridges, enabling cross-chain deposits and increasing volumes.☆99Updated 2 years ago
- A customizable, general purpose AI Agent that supports MCP. Talk to Saiki in natural language to control computers, applications and more…☆158Updated this week
- CLI Tool for converting pydantic models into typescript definitions☆35Updated 8 months ago
- The official Python library for the Atla API☆14Updated last week
- Prompt engineering, automated.☆329Updated 2 months ago
- Realtime Postgres data in React.☆55Updated 10 months ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆324Updated last year
- ☆21Updated last week
- ☆68Updated 7 months ago
- doteval☆20Updated 2 months ago
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆325Updated 2 weeks ago
- A control plane to oversee agents operating in the wild☆4Updated 6 months ago
- Rigorously test, monitor, and optimize agent systems—grounded in research from Stanford and Berkeley AI Labs.☆87Updated this week
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆437Updated last month