intsig-textin / textin-ocr-frontendLinks
☆25Updated 4 months ago
Alternatives and similar repositories for textin-ocr-frontend
Users that are interested in textin-ocr-frontend are comparing it to the libraries listed below
Sorting:
- 如需体验TextIn文档解析,请访问 https://cc.co/16YSIy☆164Updated 3 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆244Updated last month
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆146Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆120Updated 2 months ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆126Updated 5 months ago
- 中文论文、证券类、财报类PDF数据☆34Updated last year
- Based on RapidOCR, extract the PDF content☆182Updated 4 months ago
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆80Updated 9 months ago
- bisheng-unstructured library☆55Updated 4 months ago
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆72Updated 3 months ago
- ChatBI is a BI system interacted by chat with LLM☆284Updated 6 months ago
- Data annotation component library --provided as NPM packages☆128Updated last month
- 文档方向分类☆225Updated 9 months ago
- TorchV开源的解析代码仓库☆136Updated last month
- python package to parse pdfs with different parsers☆202Updated last week
- 检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…☆109Updated 9 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆299Updated last year
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆372Updated 2 weeks ago
- MinerU API Server☆20Updated last year
- Nimir 是一个基于 workflow 的标注、训练、推理一体化平台。它提供了直观的用户界面和强大的功能,通过工作流的方式将数据处理全流程有机地串联起来,实现端到端的 AI 应用开发。☆37Updated 10 months ago
- PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析☆62Updated 10 months ago
- 标书大模型(Proposal-LLM Chinese version )☆271Updated 10 months ago
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆45Updated 3 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆61Updated 10 months ago
- 😆 Generate PPT by LLM follow your template. 📢 Not only use llm to generate ppt, but also according to your favorite ppt template. Just…☆92Updated last year
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆160Updated 5 months ago
- 一款基于react-flow的工作流编辑器☆151Updated 10 months ago
- 识别图像中的表格+OCR识别☆22Updated last year
- Converted the Jina Tokenizer regex pattern to python.☆26Updated last year