opendatalab / magic-doc
☆463Updated 8 months ago
Alternatives and similar repositories for magic-doc:
Users that are interested in magic-doc are comparing it to the libraries listed below
- The Open-Source Data Annotation Platform☆765Updated last month
- Data annotation toolbox supports image, audio and video data.☆1,130Updated this week
- ☆418Updated 3 weeks ago
- 万卷1.0多模态语料☆556Updated last year
- Dingo: A Comprehensive Data Quality Evaluation Tool☆109Updated this week
- Analysis of Chinese and English layouts 中英文版面分析☆187Updated last week
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆252Updated 2 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆271Updated 6 months ago
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆242Updated last week
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆294Updated last week
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆412Updated 2 weeks ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆188Updated 5 months ago
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆247Updated last month
- datasets resource☆107Updated 3 weeks ago
- GraphRAG的应用实例,项目特点在于提供了替换OpenAI模型的方法,并通过修改原有提示和切分文档的方法,提高了GraphRAG处理中文内容的能力。☆131Updated 5 months ago
- gpt_server是一个用于生产级部署LLMs或Embedding的开源框架。☆163Updated this week
- 一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索☆473Updated 7 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆223Updated 3 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆83Updated 4 months ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post…☆622Updated this week
- Build & Optimize your RAG.☆588Updated last week
- A python native agent framework☆447Updated 4 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆102Updated 7 months ago
- A pre-built agent for TableGPT2.☆549Updated last week
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,012Updated last week
- 文档方向分类☆216Updated 4 months ago
- Based on RapidOCR, extract the PDF content.☆157Updated last week
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆161Updated last week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆484Updated 3 months ago
- ☆234Updated 3 months ago