Page Segmentation Code. I'm working with OCRopus and the UW-III data set to test how the page segmentation algorithms work with smaller strips of an image rather than the entire image.
☆20Feb 23, 2013Updated 13 years ago
Alternatives and similar repositories for page_segmentation
Users that are interested in page_segmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools for web page segmentation. In development☆17Nov 7, 2018Updated 7 years ago
- table understanding dataset for comparative evaluation of different table understanding algorithms☆13Jun 15, 2018Updated 7 years ago
- Web page segmentation and noise removal☆55Feb 4, 2024Updated 2 years ago
- This project deals with hierarchical classification of web pages based on dmoz dataset.☆14Apr 10, 2014Updated 12 years ago
- ☆40Aug 18, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A semantic web crawler☆20Sep 20, 2010Updated 15 years ago
- A recommender system for GitHub repositories☆14Jun 21, 2014Updated 11 years ago
- This repository is the official implementation of `A Semantic-based Arbitrarily-Oriented Scene Text Detector`(named STD++ as it is the im…☆29Aug 14, 2019Updated 6 years ago
- ☆71Apr 3, 2018Updated 8 years ago
- A curated list (and summaries) of awesome research publications on topic of data extraction from photos of receipts.☆40Jan 6, 2023Updated 3 years ago
- Universal Character Recognizer (UCR): Simple, Intuitive, Extensible, Multi-Lingual OCR engine☆15Apr 23, 2021Updated 5 years ago
- Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowd…☆14Oct 3, 2017Updated 8 years ago
- Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts☆23May 17, 2019Updated 7 years ago
- collection of fonts for Uyghur arabic script☆13Feb 4, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Since I originally wrote this a module called request has come on the scene. You might want to try that before mucking about with extrac…☆26Nov 16, 2015Updated 10 years ago
- Generic framework for historical document processing☆383Jul 9, 2021Updated 4 years ago
- a deep learning model for page layout analysis / segmentation.☆101Nov 4, 2019Updated 6 years ago
- High-level Rust library that binds to Poppler to extract text from a PDF☆11Dec 16, 2020Updated 5 years ago
- Skeleton cookiecutter based structure for easily creating base structure for fastapi and database projects.☆14Oct 19, 2022Updated 3 years ago
- This is a Uyghur language text convert tool to display the text in gui programs ...☆14Aug 29, 2024Updated last year
- Deep learning for named entity recognition on CoNLL-2003☆10Dec 23, 2016Updated 9 years ago
- memory efficient densenet+lstm+ctc实现中文识别☆31Jun 21, 2022Updated 3 years ago
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Arbitrary-Oriented Scene Text Detection via Rotation Proposals (TMM 2018)☆435Oct 16, 2020Updated 5 years ago
- Image thumbnailing middleware for Connect.js/Express.js utilizing Smartcrop.js☆30Apr 3, 2018Updated 8 years ago
- LOC Standards, Schemas, Stylesheets, etc.☆11Sep 30, 2025Updated 7 months ago
- Returns true if a windows file path does not contain any invalid characters.☆12Jan 27, 2023Updated 3 years ago
- Web-based page layout editor created for EMOP (Early Modern OCR Project).☆11May 21, 2021Updated 5 years ago
- MXNet finetune baseline (res152) for iNaturalist Challenge at FGVC 2017☆30Jun 15, 2017Updated 8 years ago
- This is a simple demonstration for running Tensorflow inception v3 model on TensorRT☆12Jun 5, 2018Updated 7 years ago
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- Perspective Transformation for Indoor Image Aesthetic Enhancement☆12Jan 8, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A JS parser for (binary) `.npy` files.☆16Jan 3, 2023Updated 3 years ago
- Based EAST implements "Self-organized Text Detection with Minimal Post-processing via Border Learning"☆16Nov 7, 2018Updated 7 years ago
- Train small sequence models in your browser with WebGPU.☆34Dec 3, 2025Updated 5 months ago
- Simple OCR service using deep learning☆59Oct 1, 2020Updated 5 years ago
- Implementation of the paper "Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network"☆16Nov 1, 2019Updated 6 years ago
- Solve ciphers with python☆10Oct 24, 2018Updated 7 years ago
- Node.js client library for interacting with the OpenSSH Agent☆24Mar 16, 2023Updated 3 years ago