hwb96 / markdown-structure-splitterLinks
一个为RAG系统设计的Markdown文档工具,提供标题结构自动抽取和文档分割两大功能。完整保留文档层级结构,解决传统切分器丢失标题层级与破坏表格完整性的问题。A hierarchy-preserving Markdown document splitter for RAG (Retrieval-Augmented Generation) systems that maintains document structure and table integrity.
☆12Updated last year
Alternatives and similar repositories for markdown-structure-splitter
Users that are interested in markdown-structure-splitter are comparing it to the libraries listed below
Sorting:
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆132Updated 10 months ago
- 通过paddle ocr实现pdf转markdown☆79Updated last year
- DSPy中文文档☆49Updated last year
- 基于 Dify + Langfuse 的自动化评估服务☆88Updated 8 months ago
- A LLM RAG system runs on your laptop. 大模型检索增强生成系统,可以轻松部署在笔记本电脑上,实现本地知识库智能问答。企业级SaaS版本请访问:☆308Updated last week
- dify's rag patch module☆277Updated 5 months ago
- PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取☆213Updated 2 years ago
- A collection of RAG systems powered by LLM.☆216Updated 11 months ago
- MinerU API server☆85Updated last year
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆244Updated this week
- RAG 系列教程源码仓库☆101Updated 9 months ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Updated last year
- XRAG: eXamining the Core - Benchmarking Foundational Component Modules in Advanced Retrieval-Augmented Generation☆116Updated this week
- SDK for Dify plugins☆123Updated this week
- OpenSearch-SQL code☆166Updated 8 months ago
- A method and corresponding code for automatic description generation for Text-to-SQL☆107Updated 5 months ago
- 筱可的工程实验仓库!☆109Updated 3 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆244Updated last week
- LightRAG与GraphRAG在索引构建、检索测试中的耗时、模型请求次数、Token消耗金额、检索质量等方面进行对比☆159Updated last year
- MCP Agent Graph is a Multi-Agent System built on the principles of Context Engineering☆187Updated last month
- TianGong-AI-Unstructure☆69Updated this week
- Intelligent data apps and assets with LLMs☆186Updated 11 months ago
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆194Updated last month
- Web one-click mode full process platform, including train data upload, fine-tuning, model merge, model deploy, gpu monitor etc., no need …☆19Updated 2 years ago
- 本项目主要介绍prompt工程相关用例。包括模拟智能推荐客服系统构建和问答、思维链、自洽性、思维树等相关进阶demo,旨在帮助大家理解prompt。通过一份代码实现了同时支持多种大模型(如OpenAI、阿里通义千问等)并使用FastAPI对应用进行API封装。☆51Updated last year
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆139Updated last year
- knowledge graph, llm, knowledge-intensive domains☆112Updated 8 months ago
- 探索 LLM 在法律行业的应用潜力☆96Updated last year
- ☆36Updated 3 months ago
- ☆46Updated 9 months ago