data-prep-kit / data-prep-kit
Open source project for data preparation of LLM application builders
☆624Updated this week
Alternatives and similar repositories for data-prep-kit:
Users that are interested in data-prep-kit are comparing it to the libraries listed below
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆176Updated last week
- ☆257Updated 4 months ago
- Generate large synthetic data using an LLM☆410Updated this week
- Discover, run, and compose AI agents from any framework☆448Updated this week
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆149Updated this week
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆415Updated 2 months ago
- 🦄 Unitxt: a python library for getting data fired up and set for training and evaluation☆187Updated this week
- Run the entire bee application stack using docker-compose☆153Updated last month
- Running Docling as an API service☆292Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆277Updated this week
- awesome synthetic (text) datasets☆272Updated 5 months ago
- OpenTelemetry Instrumentation for AI Observability☆380Updated this week
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆667Updated 4 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,143Updated 3 months ago
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,155Updated this week
- Automatic evals for LLMs☆373Updated this week
- A Lightweight Library for AI Observability☆241Updated 2 months ago
- Build datasets using natural language☆459Updated last month
- ☆105Updated last week
- High quality resources & applications for LLMs, multi-modal models and VectorDBs☆752Updated this week
- TAG-Bench: A benchmark for table-augmented generation (TAG)☆722Updated 3 weeks ago
- A system for agentic LLM-powered data processing and ETL☆1,767Updated this week
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆303Updated 3 weeks ago
- An Awesome list of curated DSPy resources.☆307Updated 2 months ago
- Automatically evaluate your LLMs in Google Colab☆615Updated 11 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆240Updated last week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆163Updated 7 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,391Updated 3 weeks ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,438Updated last week
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆900Updated last week