gioelecrispo / chunkipyLinks
chunkipy is an extremely useful tool for segmenting long texts into smaller chunks, based on either a character or token count. With customizable chunk sizes and splitting strategies, chunkipy provides flexibility and control for various text processing tasks.
☆36Updated last week
Alternatives and similar repositories for chunkipy
Users that are interested in chunkipy are comparing it to the libraries listed below
Sorting:
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆85Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆105Updated 3 weeks ago
- Data extraction with LLM on CPU☆114Updated last year
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆80Updated 3 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆120Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆167Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆81Updated last year
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆92Updated 9 months ago
- ☆101Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 9 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!☆49Updated last year
- ☆45Updated last year
- ☆122Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago
- Search your favorite websites and chat with them, on your desktop🌐☆30Updated 5 months ago
- Example demonstrating how to use gpt-4o-mini for fine-tuning☆27Updated 10 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 8 months ago
- ☆93Updated last year
- auto fine tune of models with synthetic data☆76Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 9 months ago
- Claude API Test Project☆87Updated last year
- Examples of Chat Bots using Panels chat features: Traditional, LLMs, AI Agents, LangChain, OpenAI etc☆121Updated 6 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆68Updated 2 months ago
- RAG example using DSPy, Gradio, FastAPI☆83Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆73Updated 8 months ago
- 🚀 Template Haystack Search Application with Streamlit☆27Updated 6 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- Unattended Lightweight Text Classifiers with LLM Embeddings☆185Updated 10 months ago