empirical-run / empiricalLinks
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
☆160Updated last year
Alternatives and similar repositories for empirical
Users that are interested in empirical are comparing it to the libraries listed below
Sorting:
- Prompt engineering, automated.☆338Updated 4 months ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆255Updated this week
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆194Updated last year
- Python SDK for running evaluations on LLM generated responses☆292Updated 2 months ago
- Foyle is a copilot to help developers deploy and operate their applications.☆132Updated 5 months ago
- ☆80Updated 9 months ago
- Superpipe - optimized LLM pipelines for structured data☆108Updated last year
- rerank library for easy reranking of results☆48Updated 11 months ago
- ActBot is a prototype for an injectable chatbot to give any website agentic capabilities☆58Updated last year
- ☆196Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆99Updated last year
- ☆63Updated last year
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆98Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated 10 months ago
- Detect and redact PII locally with SOTA performance☆70Updated 5 months ago
- Fluid Database☆113Updated 11 months ago
- Enforce structured output from LLMs 100% of the time☆250Updated last year
- ⛓️ build cognitive systems, pythonic☆339Updated 9 months ago
- ☆155Updated last week
- A curated list of open source repositories for AI Engineers☆118Updated 5 months ago
- A simple DAG for executing LLM calls and using tools.☆41Updated last year
- Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs☆117Updated this week
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆305Updated 7 months ago
- GPT-based Conversation Summarizer☆148Updated 2 years ago
- ☆107Updated 2 years ago
- Fine-tuning and serving LLMs on any cloud☆90Updated last year
- LLM-ready data connectors☆90Updated last year
- LLM Evals for Text Summarization and RAG use-cases.☆35Updated last year
- Personal memory for AI☆58Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 10 months ago