gioelecrispo / chunkipy
chunkipy is an extremely useful tool for segmenting long texts into smaller chunks, based on either a character or token count. With customizable chunk sizes and splitting strategies, chunkipy provides flexibility and control for various text processing tasks.
β36Updated last year
Alternatives and similar repositories for chunkipy
Users that are interested in chunkipy are comparing it to the libraries listed below
Sorting:
- Explore the use of DSPy for extracting features from PDFs πβ39Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 6 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- β77Updated 11 months ago
- π A list of Haystack Integrations, maintained by the community or deepset.β85Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 8 months ago
- AI real estate agentβ34Updated last year
- β19Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ61Updated last year
- β14Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β31Updated 8 months ago
- β68Updated 6 months ago
- Mistral + Haystack: build RAG pipelines that rock π€β103Updated last year
- A tutorial on DSPy and whether automated prompt engineering lives up to the hypeβ22Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ103Updated last month
- A RAG that can scale π§π»βπ»β11Updated 11 months ago
- Overview and tutorials of the LlamaIndex Libraryβ18Updated last year
- β45Updated last year
- β122Updated 2 months ago
- Search your favorite websites and chat with them, on your desktopπβ30Updated 3 months ago
- Pre-train Static Word Embeddingsβ60Updated last month
- A framework for evaluating function calls made by LLMsβ37Updated 9 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β69Updated 6 months ago
- Python library to use Pleias-RAG modelsβ49Updated 2 weeks ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β166Updated 7 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β99Updated 3 weeks ago
- Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFsβ26Updated 3 months ago
- β17Updated last year