gioelecrispo / chunkipyLinks
chunkipy is an extremely useful tool for segmenting long texts into smaller chunks, based on either a character or token count. With customizable chunk sizes and splitting strategies, chunkipy provides flexibility and control for various text processing tasks.
☆37Updated this week
Alternatives and similar repositories for chunkipy
Users that are interested in chunkipy are comparing it to the libraries listed below
Sorting:
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆176Updated 11 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆77Updated 10 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆122Updated last year
- Generalist and Lightweight Model for Text Classification☆156Updated 2 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 11 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆37Updated last year
- ☆122Updated 6 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆114Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 10 months ago
- ☆65Updated last year
- ☆93Updated last year
- ☆234Updated 2 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆46Updated last year
- ☆101Updated last year
- Code to extract Knowledge Graph from normal, unstructured text and visualize the resulting graph☆57Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆168Updated last year
- A Lightweight Library for AI Observability☆250Updated 6 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆84Updated last year
- Visualize Different Text Splitting Methods☆286Updated 8 months ago
- Data extraction with LLM on CPU☆112Updated last year
- Repo to experiment with Graph RAG strategies using Kùzu☆57Updated 9 months ago
- Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚☆192Updated 8 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- Unattended Lightweight Text Classifiers with LLM Embeddings☆184Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- A project that enables identification and classification of an intent of a message with dynamic labels☆43Updated 8 months ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆85Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated 11 months ago