Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception
☆274Sep 25, 2025Updated 8 months ago
Alternatives and similar repositories for Meta-Chunking
Users that are interested in Meta-Chunking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Jun 10, 2025Updated 11 months ago
- This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João…☆106Feb 9, 2026Updated 3 months ago
- PGRAG☆53Jul 16, 2024Updated last year
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆36Aug 20, 2025Updated 9 months ago
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆382May 20, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Jou…☆34Jun 25, 2024Updated last year
- ☆19Jun 14, 2024Updated last year
- ☆229Apr 2, 2025Updated last year
- ☆19Nov 3, 2025Updated 6 months ago
- ☆59Mar 11, 2025Updated last year
- Code for explaining and evaluating late chunking (chunked pooling)☆512Dec 23, 2024Updated last year
- Code for KaLM-Embedding models☆118Jun 30, 2025Updated 10 months ago
- Grimoire is All You Need for Enhancing Large Language Models☆119Feb 29, 2024Updated 2 years ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆174Dec 7, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆46Aug 25, 2025Updated 8 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆170Jan 8, 2024Updated 2 years ago
- [ACL 2024]Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs☆40Sep 24, 2024Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆346Dec 21, 2024Updated last year
- ☆43May 9, 2024Updated 2 years ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,240Sep 11, 2025Updated 8 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆3,490Apr 10, 2026Updated last month
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆1,670Sep 3, 2024Updated last year
- Controllable Text Generation for Large Language Models: A Survey☆205Aug 27, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆55Apr 12, 2025Updated last year
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Mar 25, 2026Updated last month
- TrustRAG:The RAG Framework within Reliable input,Trusted output☆1,261Jan 7, 2026Updated 4 months ago
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆15Aug 20, 2025Updated 9 months ago
- [EMNLP'25 findings] This is the official repo for the paper, HiRAG: Retrieval-Augmented Generation with Hierarchical Knowledge.☆542Updated this week
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated 2 years ago
- Control LLM☆23Apr 6, 2025Updated last year
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆518Dec 31, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Psy-Insight: Mental Health Oriented Interpretable Multi-turn Bilingual Counseling Dataset for Large Language Model Finetuning☆26Jan 4, 2026Updated 4 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆89Jan 18, 2025Updated last year
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆405Mar 2, 2025Updated last year
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 8 months ago
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆24May 28, 2025Updated 11 months ago
- ☆16May 18, 2026Updated last week
- ☆19Feb 25, 2023Updated 3 years ago