A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
☆254Jan 28, 2025Updated last year
Alternatives and similar repositories for zchunk
Users that are interested in zchunk are comparing it to the libraries listed below
Sorting:
- AI Agents for Enterprise Software Automation☆45Jan 10, 2025Updated last year
- Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding affinity model for drug discovery☆114Mar 27, 2025Updated 11 months ago
- 🐍 Sublingual helps you log and analyze all of your LLM calls, including the prompt template, call parameters, responses, tool calls, and…☆52Mar 5, 2025Updated last year
- superglue (YC W25) builds integrations and tools from natural language. Get production-grade tools for long tail and enterprise systems.☆1,994Feb 27, 2026Updated 3 weeks ago
- An exploration into rewriting git with typescript☆34Jan 6, 2026Updated 2 months ago
- AI management tool☆121Nov 9, 2024Updated last year
- The LLM Evaluation Framework☆14,115Mar 13, 2026Updated last week
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- ☆46May 9, 2025Updated 10 months ago
- Simple AI chat bubble for your website: Wordpress, React, HTML, Shopify. Answer questions about a website's content using RAG, streaming,…☆21Mar 24, 2025Updated 11 months ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆17Dec 8, 2025Updated 3 months ago
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆90Jul 26, 2024Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Mar 6, 2025Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 6 months ago
- ☆29Apr 22, 2024Updated last year
- A frontend for creative writing with LLMs☆156Jul 15, 2024Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- ClosingStats is an open source "Glassdoor" for sharing anonymized structured financial data.☆20Nov 24, 2024Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- rerank library for easy reranking of results☆54Sep 17, 2024Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- ☆17Dec 16, 2024Updated last year
- A data visualisation of a 100 responses when asking local LLMs to imagine a random person.☆24Nov 4, 2024Updated last year
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Mar 20, 2025Updated last year
- ☆24Nov 20, 2025Updated 4 months ago
- A python package for developing AI applications with local LLMs.☆150Jan 4, 2025Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Sep 22, 2024Updated last year
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆497Jul 23, 2025Updated 7 months ago
- A Python micro framework for creating LLM-driven agents☆23May 20, 2025Updated 10 months ago
- High-performance retrieval engine for unstructured data☆1,566Nov 10, 2025Updated 4 months ago
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆806Feb 9, 2026Updated last month
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,857Updated this week
- A performance insights and knowledge assistant agent built on top of Chrome DevTools internals, Mastra, AI SDK and NextJS☆21Dec 13, 2025Updated 3 months ago
- Efficient BM25 with DuckDB 🦆☆65Dec 20, 2024Updated last year
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- ☆30Jun 7, 2024Updated last year
- A coding agent and general agent harness for building and orchestrating agentic applications.☆597Updated this week
- ☆24Jan 22, 2025Updated last year