empirical-run / empirical
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
☆154Updated 5 months ago
Alternatives and similar repositories for empirical:
Users that are interested in empirical are comparing it to the libraries listed below
- Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs☆88Updated last week
- ActBot is a prototype for an injectable chatbot to give any website agentic capabilities☆59Updated 8 months ago
- A curated list of open source repositories for AI Engineers☆100Updated this week
- Python SDK for running evaluations on LLM generated responses☆266Updated this week
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆225Updated this week
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆95Updated 9 months ago
- A sample Personal Finance Management app built on the Account Aggregator framework.☆64Updated 2 years ago
- visually integration test your backend☆145Updated 6 months ago
- Personal memory for AI☆55Updated 8 months ago
- Fluid Database☆114Updated 4 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆48Updated 4 months ago
- Open-source Company Database: For Data-driven Deal Sourcing☆25Updated 8 months ago
- Superpipe - optimized LLM pipelines for structured data☆108Updated 7 months ago
- LLM-ready data connectors☆70Updated 8 months ago
- A framework for LLM's that works as a GPS to reduce hallucinations in production [WIP] - The Linux Kernel for Agents☆44Updated last week
- Foyle is a copilot to help developers deploy and operate their applications.☆121Updated last week
- Prompt engineering, automated.☆281Updated 2 months ago
- LLM Evals for Text Summarization and RAG use-cases.☆35Updated last year
- ☆194Updated 9 months ago
- Push platform for realtime and bidirectional communication between clients and servers☆307Updated last month
- Open source NLA. Hosted service in private beta, signup now! Alternative to Zapier NLA☆30Updated last year
- Logging and caching superpowers for the openai sdk☆102Updated 11 months ago
- ☆33Updated 10 months ago
- mahilo: Multi-Agent Human-in-the-Loop Framework is a flexible framework for creating multi-agent systems that can each interact with huma…☆127Updated last week
- Text to Python Objects via a LLM Function Call☆56Updated 10 months ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆85Updated 10 months ago
- ☆29Updated 3 months ago
- Simple AI coder that can do most of my work for me, including working on himself.☆232Updated this week
- Work with web-enabled agents quickly — whether running a quick task or bootstrapping a full-stack product.☆92Updated 3 months ago
- The open source AI app collection☆175Updated 11 months ago