An Overview of the Latest Document Chunking Research
☆90Nov 25, 2024Updated last year
Alternatives and similar repositories for chunking-strategies
Users that are interested in chunking-strategies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploring and demonstrating OpenAI's Swarm framework☆20Oct 20, 2024Updated last year
- An overview of popular reranking models and architectures for 2 stage RAG pipelines☆22Jun 10, 2025Updated last year
- Optimize Document Retrieval with Fine-Tuned KnowledgeBases☆186Nov 5, 2025Updated 7 months ago
- Implementing cognitive architecture and psychological memory concepts into Agentic LLM Systems☆540Dec 12, 2024Updated last year
- ☆22Mar 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Applying domain specific evaluations to RAG chunking and embedding functions☆18Dec 25, 2024Updated last year
- An intuitive approach towards understanding how Retrieval Augmented Generation (RAG) systems work, for the curious yet daunted reader☆30Jul 12, 2025Updated 10 months ago
- Simple Social Media Mockup Using Sqlite3 FastAPI and HTMX☆19Aug 11, 2024Updated last year
- An automatic, multi-threaded mass sample (malware) execution based on that used by the PC Security Channel (YouTube)☆31May 9, 2026Updated last month
- ☆29Aug 5, 2024Updated last year
- ☆10Nov 14, 2024Updated last year
- A simple webapp to visualise TOML☆11Nov 29, 2023Updated 2 years ago
- dugite-extra - High-level Git commands for dugite☆14Nov 27, 2023Updated 2 years ago
- Cradlepoint ECM Command Line Interface☆11Mar 7, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆30Jun 7, 2024Updated 2 years ago
- Very minimal (and stateless) agent framework☆44Jan 12, 2025Updated last year
- Fully local RAG setup: GPT4ALL, HuggingFace Embeddings model, FAISS, LangChain☆10May 10, 2023Updated 3 years ago
- Async dependency injector for JavaScript & Typescript with full type-safety☆10Feb 17, 2023Updated 3 years ago
- 7-phase deep research system for Claude Code. Multi-source verification, Graph of Thoughts methodology, domain overlays for healthcare/fi…☆20Dec 19, 2025Updated 5 months ago
- ☆16Jan 5, 2023Updated 3 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 3 months ago
- Fork of OpenAI's Realtime Console, adapted for Vocal RAG☆36Oct 18, 2024Updated last year
- Directives, utils, and events for working with angular zoneless☆13Jan 11, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository hosts the DataAssistant, a robust Python class designed to integrate seamlessly with OpenAI's API. It facilitates the cre…☆13Jul 2, 2024Updated last year
- Open Source AI Database for Voice Agent Transcripts | Call Analysis & Insights | Extraction | Labelling & Classification☆30Nov 3, 2025Updated 7 months ago
- PathRAG System - A Path-based Retrieval-Augmented Generation implementation with knowledge graph visualization and Ollama integration for…☆13Mar 3, 2025Updated last year
- ☆12Mar 14, 2023Updated 3 years ago
- Upload files directly to AWS S3, Google Cloud Storage and others in meteor☆13Jul 19, 2025Updated 10 months ago
- ☆12Jul 17, 2024Updated last year
- LockBit-Black-Builder_ ;this is Lockbit Black Builder☆10Sep 28, 2022Updated 3 years ago
- Use built-in macOS optical character recognition (OCR) via the command line☆18Nov 17, 2025Updated 6 months ago
- Watch local files for changes and share them with the world 🌎☆13Jan 30, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Interface and control for the machinon board☆14Oct 30, 2020Updated 5 years ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated 3 months ago
- Project Interoperability: A Start-Up Guide to Info Sharing☆29Nov 22, 2016Updated 9 years ago
- oh-my-codex (omx) — Orchestration layer for OpenAI Codex CLI. Async Claude Code delegation (no timeouts), structured workflows (autopilot…☆68Apr 1, 2026Updated 2 months ago
- ☆53Updated this week
- Unofficial wrapper for read/write of GPIO pins on the Onion Omega☆11Nov 7, 2015Updated 10 years ago
- Threat Simulator for Enterprise Networks☆14May 14, 2022Updated 4 years ago