Page Segmentation Code. I'm working with OCRopus and the UW-III data set to test how the page segmentation algorithms work with smaller strips of an image rather than the entire image.
☆20Feb 23, 2013Updated 13 years ago
Alternatives and similar repositories for page_segmentation
Users that are interested in page_segmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools for web page segmentation. In development☆17Nov 7, 2018Updated 7 years ago
- This project deals with hierarchical classification of web pages based on dmoz dataset.☆14Apr 10, 2014Updated 12 years ago
- A semantic web crawler☆20Sep 20, 2010Updated 15 years ago
- Automatically generate changelogs for your repositories by CLI.☆11May 22, 2025Updated last year
- Collects multimedia content shared through social networks.☆19Feb 18, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A recommender system for GitHub repositories☆14Jun 21, 2014Updated 12 years ago
- This repository is the official implementation of `A Semantic-based Arbitrarily-Oriented Scene Text Detector`(named STD++ as it is the im…☆29Aug 14, 2019Updated 6 years ago
- ☆71Apr 3, 2018Updated 8 years ago
- Weighted multiple-instance learning algorithm☆18Oct 9, 2018Updated 7 years ago
- A curated list (and summaries) of awesome research publications on topic of data extraction from photos of receipts.☆41Jan 6, 2023Updated 3 years ago
- 复现论文《Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks》☆26Nov 26, 2018Updated 7 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts☆23May 17, 2019Updated 7 years ago
- Simple Docker Compose Drupal Setup☆10Oct 24, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Document Classification and Post-OCR Key Value Extraction☆62Nov 6, 2019Updated 6 years ago
- Since I originally wrote this a module called request has come on the scene. You might want to try that before mucking about with extrac…☆26Nov 16, 2015Updated 10 years ago
- Generic framework for historical document processing☆383Jul 9, 2021Updated 4 years ago
- a deep learning model for page layout analysis / segmentation.☆101Nov 4, 2019Updated 6 years ago
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago
- Image thumbnailing middleware for Connect.js/Express.js utilizing Smartcrop.js☆30Apr 3, 2018Updated 8 years ago
- Open Source Booking & Room Management module for Drupal☆21Aug 9, 2019Updated 6 years ago
- Returns true if a windows file path does not contain any invalid characters.☆12Jan 27, 2023Updated 3 years ago
- Blazingly fast neighborhood attention☆15Nov 28, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Web-based page layout editor created for EMOP (Early Modern OCR Project).☆11May 21, 2021Updated 5 years ago
- Custom migration code for miscellaneous D8 fields and content☆13Nov 27, 2019Updated 6 years ago
- Tools for handling GRNTI list☆10Sep 2, 2023Updated 2 years ago
- MXNet finetune baseline (res152) for iNaturalist Challenge at FGVC 2017☆30Jun 15, 2017Updated 9 years ago
- Arduino library to generate a PWM signal over a shift register (74HC595)☆12Sep 25, 2020Updated 5 years ago
- CRNN with attention to do OCR,add Chinese recognition☆338Jun 16, 2024Updated 2 years ago
- Perspective Transformation for Indoor Image Aesthetic Enhancement☆12Jan 8, 2020Updated 6 years ago
- 2019年达观杯智能信息抽取挑战赛获奖方案☆17Dec 28, 2019Updated 6 years ago
- This repository contains notebooks showing how to perform mixed precision training in tf.keras 2.0☆12Dec 15, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A JS parser for (binary) `.npy` files.☆16Jan 3, 2023Updated 3 years ago
- Limiting concurrent operations in JavaScript.☆14Aug 28, 2018Updated 7 years ago
- Based EAST implements "Self-organized Text Detection with Minimal Post-processing via Border Learning"☆16Nov 7, 2018Updated 7 years ago
- Custom button to show loading state with an activity indicator sit next to title label.☆11Mar 2, 2016Updated 10 years ago
- Train small sequence models in your browser with WebGPU.☆35Dec 3, 2025Updated 6 months ago
- Implementation of the paper "Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network"☆16Nov 1, 2019Updated 6 years ago
- Node.js client library for interacting with the OpenSSH Agent☆24Mar 16, 2023Updated 3 years ago