joaodsmarques / LumberChunkerLinks

This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li and Arlindo L. Oliveira (accepted at EMNLP 2024 Findings)

☆70

Alternatives and similar repositories for LumberChunker

Users that are interested in LumberChunker are comparing it to the libraries listed below

Sorting:

IAAR-Shanghai / Meta-Chunking
Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception
☆236Updated last month
Ingvarstep / open-r1-text2graph
Open replication of DeepSeek R1 for text-to-graph extraction.
☆96Updated 6 months ago
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆86Updated 6 months ago
gomate-community / rageval
Evaluation tools for Retrieval-augmented Generation (RAG) methods.
☆162Updated 8 months ago
icip-cas / StructRAG
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization
☆141Updated 6 months ago
QingFei1 / LongRAG
[EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering
☆107Updated 6 months ago
TebooNok / HiQA
Code implement reposity of Paper HiQA
☆101Updated 5 months ago
zjunlp / OneKE
[WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.
☆95Updated this week
Yangjiaxi / Sense
[ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"
☆67Updated 11 months ago
ictnlp / LevelRAG
The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…
☆37Updated 3 months ago
Alibaba-NLP / CoFE-RAG
☆37Updated 3 months ago
OpenBMB / RAGEval
☆182Updated 4 months ago
LMMApplication / RAKG
☆136Updated 2 months ago
ibm-self-serve-assets / Blended-RAG
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers
☆69Updated 2 months ago
fate-ubw / RAGLAB
[EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
☆302Updated 9 months ago
linancn / TianGong-AI-Unstructure
TianGong-AI-Unstructure
☆68Updated last month
TIGER-AI-Lab / LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
☆236Updated 11 months ago
RUCKBReasoning / TableLLM
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
☆211Updated 10 months ago
chanchimin / RQ-RAG
Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"
☆179Updated 11 months ago
ictnlp / Auto-RAG
This is the official repository for Auto-RAG.
☆217Updated 2 weeks ago
LongxingTan / open-retrievals
All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers
☆63Updated last week
myeon9h / PlanRAG
Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24
☆142Updated last year
TableBench / TableBench
Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"
☆67Updated 2 months ago
Lightblues / AgentRE
Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".
☆68Updated last year
YuhangWuAI / tablerag
made RAG pipeline better in table data
☆94Updated 9 months ago
shibing624 / deep-research
Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…
☆46Updated 4 months ago
gangiswag / llm-reranker
☆50Updated 6 months ago
Reason-Wang / ToolGen
[ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
☆154Updated 4 months ago
thunlp / Adaptive-Note
☆58Updated 9 months ago
jina-ai / late-chunking
Code for explaining and evaluating late chunking (chunked pooling)
☆426Updated 7 months ago