data-prep-kit / data-prep-kit
Open source project for data preparation of LLM application builders
β575Updated this week
Alternatives and similar repositories for data-prep-kit:
Users that are interested in data-prep-kit are comparing it to the libraries listed below
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite modelsβ160Updated last week
- π¦ Unitxt: a python library for getting data fired up and set for training and evaluationβ181Updated this week
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR tβ¦β401Updated last month
- β255Updated 3 months ago
- Discover, run, and compose AI agents from any frameworkβ351Updated this week
- Build Research and Rag agents with Granite on your laptopβ116Updated 3 weeks ago
- The NVIDIA AgentIQ toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.β409Updated this week
- InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy dataβ¦β1,207Updated this week
- NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAGβ316Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needsβ236Updated this week
- LOTUS: A semantic query engine for fast and easy LLM-powered data processingβ1,134Updated last week
- An Awesome list of curated DSPy resources.β300Updated last month
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)β308Updated 3 months ago
- A python library to define and validate data types in Docling.β92Updated this week
- β90Updated 2 weeks ago
- Prompt Declaration Language (PDL) is a declarative prompt programming language.β137Updated this week
- π Automatically annotate papers using LLMsβ309Updated 3 months ago
- Automated Evaluation of RAG Systemsβ563Updated 4 months ago
- Evaluate your LLM's response with Prometheus and GPT4 π―β893Updated last week
- Framework for enhancing LLMs for RAG tasks using fine-tuning.β736Updated last month
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,359Updated last week
- Generate large synthetic data using an LLMβ400Updated this week
- OpenTelemetry Instrumentation for AI Observabilityβ349Updated this week
- The Granite Guardian models are designed to detect risks in prompts and responses.β72Updated last week
- β1,577Updated 2 weeks ago
- Ranking LLMs on agentic tasksβ104Updated this week
- Build datasets using natural languageβ436Updated 3 weeks ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystackβ142Updated this week
- Tutorial for building LLM routerβ186Updated 8 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycleβ244Updated this week