Page Segmentation Code. I'm working with OCRopus and the UW-III data set to test how the page segmentation algorithms work with smaller strips of an image rather than the entire image.
☆20Feb 23, 2013Updated 13 years ago
Alternatives and similar repositories for page_segmentation
Users that are interested in page_segmentation are comparing it to the libraries listed below
Sorting:
- table understanding dataset for comparative evaluation of different table understanding algorithms☆13Jun 15, 2018Updated 7 years ago
- ☆40Aug 18, 2021Updated 4 years ago
- ☆15May 20, 2018Updated 7 years ago
- Collects multimedia content shared through social networks.☆19Feb 18, 2015Updated 11 years ago
- A recommender system for GitHub repositories☆14Jun 21, 2014Updated 11 years ago
- ☆71Apr 3, 2018Updated 7 years ago
- Weighted multiple-instance learning algorithm☆18Oct 9, 2018Updated 7 years ago
- A curated list (and summaries) of awesome research publications on topic of data extraction from photos of receipts.☆39Jan 6, 2023Updated 3 years ago
- Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowd…☆14Oct 3, 2017Updated 8 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- collection of fonts for Uyghur arabic script☆13Feb 4, 2019Updated 7 years ago
- Since I originally wrote this a module called request has come on the scene. You might want to try that before mucking about with extrac…☆26Nov 16, 2015Updated 10 years ago
- Generic framework for historical document processing☆382Jul 9, 2021Updated 4 years ago
- a deep learning model for page layout analysis / segmentation.☆101Nov 4, 2019Updated 6 years ago
- High-level Rust library that binds to Poppler to extract text from a PDF☆11Dec 16, 2020Updated 5 years ago
- Skeleton cookiecutter based structure for easily creating base structure for fastapi and database projects.☆14Oct 19, 2022Updated 3 years ago
- This is a Uyghur language text convert tool to display the text in gui programs ...☆14Aug 29, 2024Updated last year
- memory efficient densenet+lstm+ctc实现中文识别☆31Jun 21, 2022Updated 3 years ago
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago
- Returns true if a windows file path does not contain any invalid characters.☆12Jan 27, 2023Updated 3 years ago
- Blazingly fast neighborhood attention☆14Nov 28, 2023Updated 2 years ago
- Web-based page layout editor created for EMOP (Early Modern OCR Project).☆11May 21, 2021Updated 4 years ago
- Tools for handling GRNTI list☆10Sep 2, 2023Updated 2 years ago
- Arduino library to generate a PWM signal over a shift register (74HC595)☆12Sep 25, 2020Updated 5 years ago
- This is a simple demonstration for running Tensorflow inception v3 model on TensorRT☆12Jun 5, 2018Updated 7 years ago
- It's a unicode based visual CAPTCHA scheme that can be solved with 2-4 mouse clicks.☆11Feb 12, 2022Updated 4 years ago
- Perspective Transformation for Indoor Image Aesthetic Enhancement☆12Jan 8, 2020Updated 6 years ago
- A JS parser for (binary) `.npy` files.☆16Jan 3, 2023Updated 3 years ago
- Limiting concurrent operations in JavaScript.☆14Aug 28, 2018Updated 7 years ago
- Train small sequence models in your browser with WebGPU.☆33Dec 3, 2025Updated 3 months ago
- Ice segment plugin for Bluge☆12Jul 4, 2022Updated 3 years ago
- Solve ciphers with python☆10Oct 24, 2018Updated 7 years ago
- Node.js client library for interacting with the OpenSSH Agent☆23Mar 16, 2023Updated 3 years ago
- Training program for keras implement crnn in chinese-ocr project☆36May 30, 2018Updated 7 years ago
- Go Based Lightweight RAG / LLM Tool with CLI + API☆14Sep 28, 2023Updated 2 years ago
- A tool to find all duplicates in large sets of text documents.☆16Sep 29, 2021Updated 4 years ago
- Enables AI agents to use Google Maps features (geocoding, elevation, search, directions) via the Agent-to-Agent (A2A) protocol.☆17Apr 29, 2025Updated 10 months ago
- Implementation of Bidirectional Scene Text Recognition with a Single Decoder☆65Nov 24, 2024Updated last year
- Implementation of semi-supervised learning: UDA, MixMatch, Mean-teacher, focusing on NLP, powered by Pytorch☆12Jan 6, 2021Updated 5 years ago