empirical-run / empiricalLinks
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
☆163Updated last year
Alternatives and similar repositories for empirical
Users that are interested in empirical are comparing it to the libraries listed below
Sorting:
- Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs☆121Updated 2 weeks ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆263Updated last week
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.☆133Updated 7 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated last year
- ActBot is a prototype for an injectable chatbot to give any website agentic capabilities☆58Updated last year
- Prompt engineering, automated.☆346Updated 6 months ago
- Superpipe - optimized LLM pipelines for structured data☆108Updated last year
- Work with web-enabled agents quickly — whether running a quick task or bootstrapping a full-stack product.☆93Updated 11 months ago
- Python SDK for running evaluations on LLM generated responses☆292Updated 4 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆195Updated last year
- ☆196Updated last year
- A curated list of open source repositories for AI Engineers☆117Updated 7 months ago
- Detect and redact PII locally with SOTA performance☆79Updated 7 months ago
- rerank library for easy reranking of results☆51Updated last year
- ☆83Updated 11 months ago
- Curated collection of AI dev tools from YC companies, aiming to serve as a reliable starting point for LLM/ML developers☆186Updated 2 years ago
- LLM-ready data connectors☆95Updated last year
- Logging and caching superpowers for the openai sdk☆104Updated last year
- Multi-language code navigation API in a container☆93Updated 2 months ago
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆97Updated last year
- ☆11Updated 8 months ago
- LLM Evals for Text Summarization and RAG use-cases.☆35Updated last year
- Fluid Database☆113Updated last year
- GPT-based Conversation Summarizer☆149Updated 2 years ago
- ☆172Updated this week
- ☆107Updated 2 years ago
- The Identity layer for the agentic world☆236Updated last week
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆162Updated 2 months ago
- Automatically reformat any JSON into any schema with AI☆335Updated 7 months ago