data-prep-kit / data-prep-kitLinks
Open source project for data preparation for GenAI applications
☆687Updated last week
Alternatives and similar repositories for data-prep-kit
Users that are interested in data-prep-kit are comparing it to the libraries listed below
Sorting:
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆195Updated last week
- Discover, run, and compose AI agents from any framework.☆587Updated this week
- Tool for generating high quality Synthetic datasets☆896Updated last week
- ☆257Updated 6 months ago
- ☆122Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,438Updated last week
- 👩🏻🍳 A collection of example notebooks using Haystack☆477Updated last week
- Simple package to extract text with coordinates from programmatic PDFs☆128Updated this week
- NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAG☆328Updated 2 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,724Updated this week
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆323Updated 2 months ago
- Framework-agnostic agent communication. Unified by design.☆251Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆317Updated this week
- A python library to define and validate data types in Docling.☆137Updated last week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆196Updated this week
- Build Research and Rag agents with Granite on your laptop☆133Updated 2 weeks ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆770Updated last week
- Build datasets using natural language☆483Updated 3 weeks ago
- Ranking LLMs on agentic tasks☆138Updated this week
- Automated Evaluation of RAG Systems☆599Updated 2 months ago
- An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.☆360Updated last week
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆741Updated 2 weeks ago
- Automatically evaluate your LLMs in Google Colab☆631Updated last year
- Code for explaining and evaluating late chunking (chunked pooling)☆396Updated 5 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,574Updated last week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,391Updated 5 months ago
- Run the entire bee application stack using docker-compose☆153Updated 2 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆423Updated last year
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,426Updated 2 weeks ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆204Updated 4 months ago