Object Detection Model for Scanned Documents
☆94Mar 6, 2025Updated last year
Alternatives and similar repositories for Document-Layout-Analysis
Users that are interested in Document-Layout-Analysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆156Mar 10, 2026Updated last month
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆431Feb 1, 2023Updated 3 years ago
- YOLOv10 trained on DocLayNet dataset.☆80Nov 1, 2024Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 8 months ago
- Document Layout Analysis☆403Mar 27, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- [데이콘] 가스공급량 수요예측 모델 개발 대회 3등☆11Apr 12, 2022Updated 4 years ago
- ☆32Apr 14, 2024Updated 2 years ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Oct 6, 2023Updated 2 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆29Apr 16, 2023Updated 3 years ago
- ☆16Apr 13, 2023Updated 3 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆50Apr 16, 2023Updated 3 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Checkbox Detection Model for Scanned Documents☆95Mar 6, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 阅读顺序、Layoutreader☆19May 8, 2025Updated 11 months ago
- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition☆285Sep 5, 2022Updated 3 years ago
- Table Structure Recognition☆82Mar 11, 2023Updated 3 years ago
- ☆42Jun 15, 2024Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,134Apr 14, 2025Updated last year
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆166May 14, 2025Updated 11 months ago
- AutoTag-YOLOv8 is an object detection project that uses the YOLOv8 model and leverages the power of SAM and DINGO models for automatic la…☆13May 3, 2023Updated 2 years ago
- 文档方向分类☆221Feb 3, 2026Updated 2 months ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆218Sep 26, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,829Mar 17, 2026Updated last month
- DocBank: A Benchmark Dataset for Document Layout Analysis☆644Aug 12, 2024Updated last year
- Run AI Task on your Edge Device.☆15Nov 19, 2025Updated 5 months ago
- A small tool that uses the OpenAI, Gemini API, etc.. to perform code reviews on GitLab merge requests.☆12Dec 4, 2025Updated 4 months ago
- Korean Streaming ASR(with Denoiser and Conformer CTC)☆43Apr 28, 2024Updated 2 years ago
- Repo☆12Mar 7, 2022Updated 4 years ago
- ☆10Jun 22, 2020Updated 5 years ago
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆103May 30, 2024Updated last year
- RepoCoder is a Python package that allows you to send your code for review using Large Language Models (LLMs) like Anthropic's Claude or …☆15Nov 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆227Sep 9, 2024Updated last year
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,720Aug 15, 2024Updated last year
- REST API which exposes endpoints for YOLOv8 inference, all running in a Docker Container☆18Jul 17, 2024Updated last year
- This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…☆10Jul 19, 2017Updated 8 years ago
- Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)☆14Aug 29, 2023Updated 2 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆100Dec 17, 2025Updated 4 months ago