joaodsmarques / LumberChunker
This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li and Arlindo L. Oliveira (accepted at EMNLP 2024 Findings)
☆64Updated 7 months ago
Alternatives and similar repositories for LumberChunker:
Users that are interested in LumberChunker are comparing it to the libraries listed below
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆152Updated 3 weeks ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆94Updated 3 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆82Updated 3 months ago
- ☆162Updated last month
- [WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.☆65Updated last week
- ☆73Updated last week
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆152Updated 5 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆174Updated 8 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆65Updated 9 months ago
- ☆144Updated this week
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆129Updated 3 months ago
- ☆34Updated 3 weeks ago
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆44Updated last month
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆295Updated 6 months ago
- Code implement reposity of Paper HiQA☆100Updated 2 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆102Updated 3 months ago
- ☆55Updated 6 months ago
- ☆144Updated 2 months ago
- The LLM of NL2GQL with NebulaGraph or Neo4j☆92Updated last year
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆153Updated last week
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆195Updated 7 months ago
- ☆65Updated 7 months ago
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆28Updated 3 weeks ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆230Updated 8 months ago
- ☆33Updated last year
- Code for KaLM-Embedding models☆76Updated last month
- A Toolkit for Table-based Question Answering☆112Updated last year
- 中文原生检索增强生成测评基准☆115Updated last year
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆66Updated 8 months ago
- All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers☆56Updated this week