data-prep-kit / data-prep-kitLinks

Open source project for data preparation for GenAI applications

☆729

Alternatives and similar repositories for data-prep-kit

Users that are interested in data-prep-kit are comparing it to the libraries listed below

Sorting:

ibm-granite-community / granite-snack-cookbook
Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models
☆213Updated last week
i-am-bee / beeai-platform
Discover, run, and compose AI agents from any framework.
☆688Updated this week
Arize-ai / openinference
OpenTelemetry Instrumentation for AI Observability
☆497Updated this week
meta-llama / synthetic-data-kit
Tool for generating high quality Synthetic datasets
☆1,010Updated last week
meta-llama / llama-prompt-ops
An open-source tool for general prompt optimization.
☆557Updated last week
PAIR-code / llm-comparator
LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…
☆454Updated 5 months ago
NVIDIA-AI-Blueprints / rag
This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.
☆164Updated last week
brandonstarxel / chunking_evaluation
This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…
☆346Updated 4 months ago
StacklokLabs / promptwright
Generate large synthetic data using an LLM
☆433Updated last week
argilla-io / synthetic-data-generator
Build datasets using natural language
☆500Updated 2 months ago
i-am-bee / acp
Open protocol for communication between AI agents, applications, and humans.
☆527Updated 2 weeks ago
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆246Updated 4 months ago
vllm-project / guidellm
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
☆405Updated this week
ibm-granite / granite-3.0-language-models
☆259Updated 3 weeks ago
deepset-ai / haystack-cookbook
👩🏻‍🍳 A collection of example notebooks using Haystack
☆485Updated last week
NVIDIA-AI-Blueprints / multimodal-pdf-data-extraction
NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAG
☆335Updated 3 months ago
weaviate / recipes
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!
☆803Updated last week
microsoft / sammo
A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)
☆695Updated 3 weeks ago
huggingface / yourbench
🤗 Benchmark Large Language Models Reliably On Your Data
☆359Updated last week
stanford-futuredata / ARES
Automated Evaluation of RAG Systems
☆624Updated 3 months ago
ibm-granite-community / granite-retrieval-agent
Build Research and Rag agents with Granite on your laptop
☆138Updated last month
IBM / mcp-context-forge
A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be acc…
☆830Updated this week
i-am-bee / bee-stack
Run the entire bee application stack using docker-compose
☆155Updated 4 months ago
cvs-health / uqlm
UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection
☆779Updated last week
NVIDIA / NeMo-Agent-Toolkit
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
☆1,110Updated this week
huggingface / evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…
☆1,465Updated 6 months ago
IntelLabs / RAG-FiT
Framework for enhancing LLMs for RAG tasks using fine-tuning.
☆742Updated last month
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆123Updated last week
prometheus-eval / prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
☆963Updated 2 months ago
google / lmeval
☆213Updated 2 weeks ago