data-prep-kit / data-prep-kitLinks
Open source project for data preparation for GenAI applications
☆715Updated this week
Alternatives and similar repositories for data-prep-kit
Users that are interested in data-prep-kit are comparing it to the libraries listed below
Sorting:
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆202Updated this week
- Tool for generating high quality Synthetic datasets☆958Updated 2 weeks ago
- InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data…☆1,311Updated this week
- ☆259Updated this week
- Generate large synthetic data using an LLM☆428Updated this week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆200Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆358Updated this week
- ☆1,843Updated this week
- Open protocol for communication between AI agents, applications, and humans.☆310Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,444Updated 5 months ago
- Discover, run, and compose AI agents from any framework.☆609Updated this week
- Run the entire bee application stack using docker-compose☆155Updated 3 months ago
- An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.☆493Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆797Updated 4 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆741Updated last month
- An Awesome list of curated DSPy resources.☆348Updated 4 months ago
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆1,054Updated this week
- A python library to define and validate data types in Docling.☆148Updated this week
- 👩🏻🍳 A collection of example notebooks using Haystack☆482Updated 2 weeks ago
- Scalable data pre processing and curation toolkit for LLMs☆955Updated last week
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆444Updated 4 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,435Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,641Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆952Updated 2 months ago
- Build Research and Rag agents with Granite on your laptop☆135Updated last month
- A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be acc…☆409Updated this week
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,208Updated 3 weeks ago
- OpenTelemetry Instrumentation for AI Observability☆480Updated this week
- Build datasets using natural language☆493Updated last month
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆169Updated this week