magicpdf / Magic-Doc
conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown
☆35Updated 5 months ago
Alternatives and similar repositories for Magic-Doc:
Users that are interested in Magic-Doc are comparing it to the libraries listed below
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆20Updated last month
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 7 months ago
- Imitate OpenAI with Local Models☆85Updated 4 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆259Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆127Updated last month
- 中文原生检索增强生成测评基准☆105Updated 9 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆135Updated 7 months ago
- Repo for Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆61Updated last week
- TianGong-AI-Unstructure☆56Updated 2 weeks ago
- ☆62Updated 4 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆156Updated 3 weeks ago
- ☆112Updated 2 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆33Updated last month
- The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆119Updated last month
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆104Updated last month
- ☆221Updated 8 months ago
- A Comprehensive Benchmark for Document Parsing and Evaluation☆201Updated this week
- ☆24Updated 3 months ago
- code for piccolo embedding model from SenseTime☆117Updated 8 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆213Updated last month
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆70Updated 2 months ago
- Code implement reposity of Paper HiQA☆96Updated 6 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆182Updated last week
- ☆159Updated last month
- Mixture-of-Experts (MoE) Language Model☆183Updated 4 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 9 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆57Updated 2 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆133Updated 9 months ago
- 大模型检索增强生成技术最佳实践。☆54Updated 4 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆192Updated last month