empirical-run / empiricalLinks

Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application

☆159

Alternatives and similar repositories for empirical

Users that are interested in empirical are comparing it to the libraries listed below

Sorting:

AI-Northstar-Tech / vector-io
Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…
☆252Updated last week
BodhiSearch / BodhiApp
Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs
☆116Updated this week
jlewi / foyle
Foyle is a copilot to help developers deploy and operate their applications.
☆131Updated 4 months ago
athina-ai / athina-evals
Python SDK for running evaluations on LLM generated responses
☆291Updated 2 months ago
omkaark / actbot
ActBot is a prototype for an injectable chatbot to give any website agentic capabilities
☆58Updated last year
sydverma123 / awesome-ai-repositories
A curated list of open source repositories for AI Engineers
☆117Updated 4 months ago
villagecomputing / superpipe
Superpipe - optimized LLM pipelines for structured data
☆108Updated last year
eugeneyan / align-app
☆77Updated 8 months ago
tensorlakeai / rerank-ts
rerank library for easy reranking of results
☆47Updated 10 months ago
OpenArchitectAI / open-architect
Your automated SWE fleet to get your tickets from the Backlog to Prod!
☆98Updated last year
zenbase-ai / core
Prompt engineering, automated.
☆335Updated 3 months ago
vitalops / datatune
Perform transformations on your data with natural language using LLMs
☆86Updated last week
jxnl / blog
☆151Updated this week
TheMind-AI / fluid-db
Fluid Database
☆114Updated 10 months ago
OpenPipe / pii-redaction
Detect and redact PII locally with SOTA performance
☆67Updated 4 months ago
instructor-ai / evals
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆52Updated 10 months ago
Meesho / BharatMLStack
BharatMLStack is an open-source, end-to-end machine learning infrastructure stack built at Meesho to support real-time and batch ML workl…
☆566Updated this week
567-labs / fastllm
A collection of LLM services you can self host via docker or modal labs to support your applications development
☆192Updated last year
BhabhaAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
☆150Updated 6 months ago
athina-ai / ariadne
LLM Evals for Text Summarization and RAG use-cases.
☆35Updated last year
hrishioa / mandark
Simple AI coder that can do most of my work for me, including working on himself.
☆248Updated 4 months ago
hdresearch / nolita
Work with web-enabled agents quickly — whether running a quick task or bootstrapping a full-stack product.
☆93Updated 9 months ago
advaitpaliwal / reminisc
Personal memory for AI
☆58Updated last year
villagecomputing / superopenai
Logging and caching superpowers for the openai sdk
☆105Updated last year
sidhq / YC-alum-ai-tools
Curated collection of AI dev tools from YC companies, aiming to serve as a reliable starting point for LLM/ML developers
☆186Updated last year
voxos-ai / bolna
End-to-end platform for building voice first multimodal agents
☆421Updated 9 months ago
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆151Updated 9 months ago
modal-labs / awesome-modal
A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.
☆150Updated last month
reworkd / bananalyzer
Open source AI Agent evaluation framework for web tasks 🐒🍌
☆304Updated 7 months ago
jxnl / n-levels-of-rag
☆195Updated last year