A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
☆261Jan 28, 2025Updated last year
Alternatives and similar repositories for zchunk
Users that are interested in zchunk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Recipes for AI agents that use Asteroid to be safe and reliable. Want yours featured? Submit a PR!☆51Apr 17, 2025Updated last year
- ☆59Mar 11, 2025Updated last year
- A1Base NextJS template☆66Apr 13, 2026Updated 2 months ago
- Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding affinity model for drug discovery☆118Mar 27, 2025Updated last year
- Open Source Data Collection and Evaluation Framework☆61Jul 23, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MentraOS is the leading smart glasses OS. See live captions, stream your view, talk to AI, and capture photos hands-free on compatible gl …☆2,197Updated this week
- 🐍 Sublingual helps you log and analyze all of your LLM calls, including the prompt template, call parameters, responses, tool calls, and…☆52Mar 5, 2025Updated last year
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆193May 30, 2025Updated last year
- The open-source React.js Autonomous LLM Agent☆1,654Apr 12, 2024Updated 2 years ago
- superglue (YC W25) builds integrations and tools from natural language. Get production-grade tools for long tail and enterprise systems.☆2,028Updated this week
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Airtop SDK for Node.js☆16Nov 6, 2025Updated 7 months ago
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated last year
- AI management tool☆119Nov 9, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Typescript/React Library for AI Chat💬🚀☆10,813Updated this week
- ☆39Updated this week
- The LLM Evaluation Framework☆16,516Updated this week
- something for paper agent☆11Dec 18, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- ☆44May 9, 2025Updated last year
- Simple AI chat bubble for your website: Wordpress, React, HTML, Shopify. Answer questions about a website's content using RAG, streaming,…☆23Mar 24, 2025Updated last year
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆18Updated this week
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆91Jul 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆266Mar 6, 2025Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 9 months ago
- ☆31Apr 22, 2024Updated 2 years ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Mar 15, 2026Updated 3 months ago
- ClosingStats is an open source "Glassdoor" for sharing anonymized structured financial data.☆20Nov 24, 2024Updated last year
- A frontend for creative writing with LLMs☆167Jul 15, 2024Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- ☆141Feb 26, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated last year
- rerank library for easy reranking of results☆56Sep 17, 2024Updated last year
- ☆16Dec 16, 2024Updated last year
- A data visualisation of a 100 responses when asking local LLMs to imagine a random person.☆24Nov 4, 2024Updated last year
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆33Mar 20, 2025Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆89Sep 22, 2024Updated last year