joaodsmarques / LumberChunker
This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li and Arlindo L. Oliveira (accepted at EMNLP 2024 Findings)
☆52Updated 4 months ago
Alternatives and similar repositories for LumberChunker:
Users that are interested in LumberChunker are comparing it to the libraries listed below
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆59Updated 6 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆114Updated 3 weeks ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 10 months ago
- TechGPT 2.0: Technology-Oriented Generative Pretrained Transformer 2.0☆105Updated 5 months ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆64Updated 6 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆77Updated last month
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated 7 months ago
- Imitate OpenAI with Local Models☆86Updated 5 months ago
- ☆54Updated 4 months ago
- ☆91Updated 2 months ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆147Updated 3 months ago
- TianGong-AI-Unstructure☆58Updated 3 weeks ago
- ☆125Updated 3 weeks ago
- ☆32Updated 2 months ago
- ☆36Updated 9 months ago
- 中文原生检索增强生成测评基准☆109Updated 10 months ago
- The LLM of NL2GQL with NebulaGraph or Neo4j☆90Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆28Updated 8 months ago
- A Toolkit for Table-based Question Answering☆109Updated last year
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆105Updated last month
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆15Updated 3 months ago
- Informative Conversational Query Rewriting☆26Updated last year
- PGRAG☆47Updated 7 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆184Updated last month
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆125Updated 2 months ago
- Code implement reposity of Paper HiQA☆96Updated 7 months ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆38Updated 4 months ago
- ☆168Updated 2 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆58Updated this week