data-prep-kit / data-prep-kitLinks
Open source project for data preparation for GenAI applications
☆832Updated last week
Alternatives and similar repositories for data-prep-kit
Users that are interested in data-prep-kit are comparing it to the libraries listed below
Sorting:
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆284Updated this week
- Complete pipeline for Training Model Behavior in Agentic Systems☆635Updated this week
- Discover, run, and compose AI agents from any framework.☆803Updated last week
- 👩🏻🍳 A collection of example notebooks using Haystack☆506Updated 3 weeks ago
- Tool for generating high quality Synthetic datasets☆1,306Updated 3 weeks ago
- OpenTelemetry Instrumentation for AI Observability☆675Updated this week
- An open-source tool for LLM prompt optimization.☆666Updated 3 weeks ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆433Updated 7 months ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆1,056Updated this week
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆905Updated last week
- Build datasets using natural language☆534Updated last month
- Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that…☆1,323Updated 2 weeks ago
- TAG-Bench: A benchmark for table-augmented generation (TAG)☆758Updated 6 months ago
- Open protocol for communication between AI agents, applications, and humans.☆878Updated 2 months ago
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆490Updated 8 months ago
- ☆264Updated 4 months ago
- Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured…☆1,403Updated last month
- ☆158Updated this week
- Run the entire bee application stack using docker-compose☆153Updated 7 months ago
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆732Updated 4 months ago
- Automated Evaluation of RAG Systems☆664Updated 7 months ago
- An Awesome list of curated DSPy resources.☆461Updated 2 weeks ago
- Ranking LLMs on agentic tasks☆194Updated last month
- A Lightweight Library for AI Observability☆251Updated 8 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆655Updated last week
- A curated list of awesome synthetic data tools (open source and commercial).☆215Updated last year
- CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, …☆73Updated last week
- This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.☆334Updated last week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆406Updated 3 weeks ago
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆241Updated this week