empirical-run / empiricalLinks
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
☆167Updated last year
Alternatives and similar repositories for empirical
Users that are interested in empirical are comparing it to the libraries listed below
Sorting:
- Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs☆123Updated last week
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆264Updated last week
- Foyle is a copilot to help developers deploy and operate their applications.☆132Updated 8 months ago
- ActBot is a prototype for an injectable chatbot to give any website agentic capabilities☆57Updated last year
- ☆173Updated last week
- Prompt engineering, automated.☆349Updated 7 months ago
- Python SDK for running evaluations on LLM generated responses☆293Updated 6 months ago
- Work with web-enabled agents quickly — whether running a quick task or bootstrapping a full-stack product.☆93Updated last year
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆98Updated last year
- Query language for blending SQL and LLMs across structured + unstructured data, with type constraints.☆121Updated this week
- rerank library for easy reranking of results☆53Updated last year
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆197Updated last year
- Fluid Database☆113Updated last year
- ☆198Updated last year
- GPT-based Conversation Summarizer☆151Updated 2 years ago
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆101Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆154Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated last year
- ☆84Updated last year
- Memory library for building stateful agents☆252Updated last week
- Multi-language code navigation API in a container☆95Updated 4 months ago
- Superpipe - optimized LLM pipelines for structured data☆108Updated last year
- Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.☆169Updated last year
- ☆169Updated last year
- A curated list of open source repositories for AI Engineers☆123Updated 8 months ago
- vscode extension to convert computationally intensive pytorch kernels to triton☆22Updated last year
- Replace expensive LLM calls with finetunes automatically☆66Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆166Updated last month
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- Curated collection of AI dev tools from YC companies, aiming to serve as a reliable starting point for LLM/ML developers☆190Updated 2 years ago