data-prep-kit / data-prep-kitLinks
Open source project for data preparation for GenAI applications
☆859Updated this week
Alternatives and similar repositories for data-prep-kit
Users that are interested in data-prep-kit are comparing it to the libraries listed below
Sorting:
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆320Updated this week
- Tool for generating high quality Synthetic datasets☆1,411Updated last month
- OpenTelemetry Instrumentation for AI Observability☆751Updated this week
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆922Updated this week
- 👩🏻🍳 A collection of example notebooks using Haystack☆513Updated last week
- Train LLM Model Behavior☆667Updated this week
- An open-source tool for LLM prompt optimization.☆717Updated last week
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆502Updated 9 months ago
- ☆266Updated 5 months ago
- InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data…☆1,389Updated last week
- ☆173Updated last week
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆454Updated 8 months ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆1,079Updated this week
- Automated Evaluation of RAG Systems☆676Updated 8 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆730Updated this week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆414Updated last week
- Build datasets using natural language☆548Updated 2 months ago
- A Lightweight Library for AI Observability☆252Updated 9 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆832Updated 10 months ago
- A python library to define and validate data types in Docling.☆211Updated this week
- Deploy and share agents with open infrastructure, free from vendor lock-in.☆847Updated this week
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆741Updated 5 months ago
- Fast Semantic Text Deduplication & Filtering☆852Updated last month
- This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.☆385Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,017Updated 7 months ago
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆267Updated this week
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆758Updated 6 months ago
- ☆241Updated 5 months ago
- Simple package to extract text with coordinates from programmatic PDFs☆218Updated last month
- High quality resources & applications for LLMs, multi-modal models and VectorDBs☆883Updated 3 weeks ago