empirical-run / empirical
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
☆156Updated 7 months ago
Alternatives and similar repositories for empirical:
Users that are interested in empirical are comparing it to the libraries listed below
- ActBot is a prototype for an injectable chatbot to give any website agentic capabilities☆58Updated 10 months ago
- Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs☆97Updated 3 weeks ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆234Updated last week
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆50Updated 6 months ago
- rerank library for easy reranking of results☆42Updated 7 months ago
- ☆194Updated 11 months ago
- The open source AI app collection☆177Updated last year
- Repository for fine-tuning gemma models using unsloth for indic languages☆89Updated last year
- LLM-ready data connectors☆75Updated 10 months ago
- A sample Personal Finance Management app built on the Account Aggregator framework.☆64Updated 3 years ago
- AutoGPT for Web App Development☆141Updated last year
- Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.☆88Updated this week
- Hitchcock a multi-agent movie maker, powered by mahilo☆65Updated last month
- ☆99Updated 2 months ago
- Multi-language code navigation API in a container☆74Updated 3 weeks ago
- npm like package ecosystem for Prompts 🤖☆49Updated 2 months ago
- vscode extension to convert computationally intensive pytorch kernels to triton☆22Updated 6 months ago
- GPT-powered bot that can automate complex online tasks using both the web browser and API calls.☆169Updated 2 years ago
- A curated list of open source repositories for AI Engineers☆110Updated 3 weeks ago
- Deep Research for your internal data☆305Updated this week
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆96Updated 11 months ago
- LLM fine-tuning and eval☆346Updated last year
- Cedana: Access and run on compute anywhere in the world, on any provider. Migrate seamlessly between providers, arbitraging price/perform…☆58Updated 2 weeks ago
- Logging and caching superpowers for the openai sdk☆104Updated last year
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆311Updated 3 weeks ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆149Updated 6 months ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆74Updated last year
- Work with web-enabled agents quickly — whether running a quick task or bootstrapping a full-stack product.☆93Updated 5 months ago
- Annoucing Instructor Cloud☆34Updated 8 months ago
- The Identity layer for the agentic world☆182Updated last week