gioelecrispo / chunkipyLinks
chunkipy is an extremely useful tool for segmenting long texts into smaller chunks, based on either a character or token count. With customizable chunk sizes and splitting strategies, chunkipy provides flexibility and control for various text processing tasks.
☆37Updated last week
Alternatives and similar repositories for chunkipy
Users that are interested in chunkipy are comparing it to the libraries listed below
Sorting:
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆119Updated last week
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Data extraction with LLM on CPU☆112Updated last year
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆93Updated last year
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆81Updated 9 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆45Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆181Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- ☆103Updated last year
- ☆125Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated last month
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆147Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆46Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆166Updated last year
- ☆95Updated 2 years ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 3 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆123Updated last year
- ☆21Updated last year
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆87Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆49Updated last year
- ☆66Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆81Updated 10 months ago
- ☆242Updated 6 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆71Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆156Updated last year
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆77Updated 8 months ago
- ☆45Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 10 months ago