magicpdf / Magic-DocLinks

conversion doc（pdf/html/doc/docx/ppt/pptx）to markdown

☆46

Alternatives and similar repositories for Magic-Doc

Users that are interested in Magic-Doc are comparing it to the libraries listed below

Sorting:

360AILAB-NLP / 360LayoutAnalysis
360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute
☆292Updated 10 months ago
intsig-textin / markdown_tester
如需体验textin文档解析，请点击https://cc.co/16YSIy
☆111Updated 2 weeks ago
RapidAI / RapidLayout
Analysis of Chinese and English layouts 中英文版面分析
☆226Updated this week
linancn / TianGong-AI-Unstructure
TianGong-AI-Unstructure
☆68Updated last month
opendatalab / Miner-PDF-Benchmark
MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
☆23Updated 7 months ago
shibing624 / agentica
Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。
☆187Updated last week
LingyvKong / OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆224Updated 3 months ago
THUDM / LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
☆502Updated 6 months ago
MigoXLab / dingo
Dingo: A Comprehensive AI Data Quality Evaluation Tool
☆288Updated this week
Veason-silverbullet / ViTLP
[NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence
☆144Updated 10 months ago
riddle911 / SuperInsights
☆66Updated 9 months ago
shibing624 / deep-research
Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…
☆46Updated 3 months ago
ictnlp / FlexRAG
FlexRAG: A RAG Framework for Information Retrieval and Generation.
☆194Updated last month
CLUEbenchmark / SuperCLUE-RAG
中文原生检索增强生成测评基准
☆119Updated last year
YuhangWuAI / tablerag
made RAG pipeline better in table data
☆90Updated 9 months ago
OpenSearch-AI / OpenSearch-SQL
OpenSearch-SQL code
☆129Updated last month
xverse-ai / XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆139Updated last year
Reason-Wang / ToolGen
[ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
☆150Updated 3 months ago
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆136Updated 7 months ago
open-sciencelab / GraphGen
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
☆251Updated this week
RapidAI / RapidTable
基于序列表格识别算法推理库，集成PP-Structure和modelscope等表格识别算法。
☆332Updated this week
opendatalab / OmniDocBench
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
☆602Updated last week
tpoisonooo / ROGRAG
[ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework
☆163Updated 2 weeks ago
riddle911 / autobid
☆61Updated 4 months ago
MetaGLM / LawGLM
探索 LLM 在法律行业的应用潜力
☆90Updated 7 months ago
shell-nlp / gpt_server
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。
☆199Updated last week
ppaanngggg / layoutreader
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆257Updated last month
TebooNok / HiQA
Code implement reposity of Paper HiQA
☆101Updated 4 months ago
Alpha-Innovator / StructEqTable-Deploy
A High-efficiency Open-source Toolkit for Table-to-Latex Task
☆253Updated 7 months ago
llm-factory / imitater
Imitate OpenAI with Local Models
☆87Updated 10 months ago