joaodsmarques / LumberChunkerLinks
This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li and Arlindo L. Oliveira (accepted at EMNLP 2024 Findings)
☆70Updated 9 months ago
Alternatives and similar repositories for LumberChunker
Users that are interested in LumberChunker are comparing it to the libraries listed below
Sorting:
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆236Updated last month
- Open replication of DeepSeek R1 for text-to-graph extraction.☆96Updated 6 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆86Updated 6 months ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆162Updated 8 months ago
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆141Updated 6 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆107Updated 6 months ago
- Code implement reposity of Paper HiQA☆101Updated 5 months ago
- [WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.☆95Updated this week
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆67Updated 11 months ago
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆37Updated 3 months ago
- ☆37Updated 3 months ago
- ☆182Updated 4 months ago
- ☆136Updated 2 months ago
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers☆69Updated 2 months ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆302Updated 9 months ago
- TianGong-AI-Unstructure☆68Updated last month
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆236Updated 11 months ago
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆211Updated 10 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆179Updated 11 months ago
- This is the official repository for Auto-RAG.☆217Updated 2 weeks ago
- All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers☆63Updated last week
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆142Updated last year
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆67Updated 2 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆68Updated last year
- made RAG pipeline better in table data☆94Updated 9 months ago
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆46Updated 4 months ago
- ☆50Updated 6 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆154Updated 4 months ago
- ☆58Updated 9 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆426Updated 7 months ago