joaodsmarques / LumberChunker
This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li and Arlindo L. Oliveira (under review for EMNLP 2024)
☆27Updated 2 weeks ago
Related projects: ⓘ
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆27Updated 3 weeks ago
- the newest version of llama3,source code explained line by line using Chinese☆21Updated 5 months ago
- TianGong-AI-Unstructure☆48Updated this week
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆58Updated last month
- 中文原生检索增强生成测评基准☆92Updated 5 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆27Updated 2 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated 5 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆41Updated 3 months ago
- ☆57Updated 3 weeks ago
- code for piccolo embedding model from SenseTime☆93Updated 3 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆23Updated 2 months ago
- Code implement reposity of Paper HiQA☆86Updated 2 months ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆39Updated 2 months ago
- The LLM of NL2GQL with NebulaGraph or Neo4j☆83Updated 9 months ago
- ☆90Updated 5 months ago
- A Toolkit for Table-based Question Answering☆94Updated 11 months ago
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated 9 months ago
- Evaluation for AI apps and agent☆35Updated 8 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆11Updated 3 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆32Updated 8 months ago
- ☆32Updated 3 months ago
- bge推理优化相关脚本☆18Updated 7 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆95Updated last month
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆14Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆37Updated last month
- SmartSearch: Building a quick conversation-based search engine with LLMs.☆42Updated 4 months ago
- Psychological Counselor's Digital Twin Framework(心理咨询师数字孪生框架)☆24Updated 2 weeks ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆112Updated 2 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆151Updated 6 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated 5 months ago