A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
☆252Jan 28, 2025Updated last year
Alternatives and similar repositories for zchunk
Users that are interested in zchunk are comparing it to the libraries listed below
Sorting:
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆161May 30, 2025Updated 9 months ago
- AI management tool☆121Nov 9, 2024Updated last year
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated 9 months ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆90Jul 26, 2024Updated last year
- Model implementation for the contextual embeddings project☆40Jun 2, 2025Updated 9 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- ☆29Apr 22, 2024Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Jun 25, 2024Updated last year
- something for paper agent☆11Dec 18, 2024Updated last year
- A frontend for creative writing with LLMs☆151Jul 15, 2024Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆68Feb 6, 2026Updated 3 weeks ago
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆23Dec 15, 2025Updated 2 months ago
- rerank library for easy reranking of results☆53Sep 17, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆495Jul 23, 2025Updated 7 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- Efficient BM25 with DuckDB 🦆☆64Dec 20, 2024Updated last year
- ☆46May 9, 2025Updated 9 months ago
- A python package for developing AI applications with local LLMs.☆150Jan 4, 2025Updated last year
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Mar 20, 2025Updated 11 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Mar 6, 2025Updated 11 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- CHATGPT-In-Jupyter☆11Jun 2, 2023Updated 2 years ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Sep 22, 2024Updated last year
- ☆24Jan 30, 2025Updated last year
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆56Feb 24, 2026Updated last week
- High-performance retrieval engine for unstructured data☆1,561Nov 10, 2025Updated 3 months ago
- The LLM Evaluation Framework☆13,787Feb 23, 2026Updated last week
- golang browser-use port☆22Jul 7, 2025Updated 7 months ago
- ☆24Nov 20, 2025Updated 3 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆32Nov 30, 2025Updated 3 months ago
- ☆32Aug 26, 2025Updated 6 months ago
- gRPC server for hnswlib☆16Mar 6, 2023Updated 2 years ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Dec 23, 2024Updated last year
- ☆43Apr 22, 2025Updated 10 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago