data-prep-kit / data-prep-kitLinks
Open source project for data preparation for GenAI applications
☆876Updated last week
Alternatives and similar repositories for data-prep-kit
Users that are interested in data-prep-kit are comparing it to the libraries listed below
Sorting:
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆335Updated 2 weeks ago
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆501Updated 10 months ago
- VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector re…☆506Updated this week
- Tool for generating high quality Synthetic datasets☆1,441Updated 2 months ago
- An open-source tool for LLM prompt optimization.☆734Updated last week
- InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data…☆1,392Updated this week
- 👩🏻🍳 A collection of example notebooks using Haystack☆516Updated last week
- OpenTelemetry Instrumentation for AI Observability☆777Updated this week
- Curate High Quality Datasets, Train, Evaluate and Ship! 🚀☆753Updated this week
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆932Updated 2 weeks ago
- A python library to define and validate data types in Docling.☆219Updated last week
- ☆174Updated 3 weeks ago
- CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, …☆556Updated last week
- Deploy and share agents with open infrastructure, free from vendor lock-in.☆864Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,522Updated 7 months ago
- An Awesome list of curated DSPy resources.☆494Updated 2 weeks ago
- Simple package to extract text with coordinates from programmatic PDFs☆226Updated 3 weeks ago
- Taxonomy tree that will allow you to create models tuned with your data☆287Updated 3 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆765Updated this week
- Build Research and Rag agents with Granite on your laptop☆149Updated 2 months ago
- Open protocol for communication between AI agents, applications, and humans.☆911Updated 4 months ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆461Updated 2 weeks ago
- A Lightweight Library for AI Observability☆252Updated 10 months ago
- The Agent Lifecycle Toolkit (ALTK) is a library of components to help agent builders improve their agent with minimal integration effort …☆94Updated this week
- ☆267Updated 6 months ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆176Updated last week
- Build datasets using natural language☆556Updated 3 months ago
- Run the entire bee application stack using docker-compose☆153Updated 9 months ago
- Ranking LLMs on agentic tasks☆204Updated last month
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆762Updated last week