A model(ing framework) for sample efficient OCR
☆64Apr 7, 2023Updated 2 years ago
Alternatives and similar repositories for effocr
Users that are interested in effocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Jul 22, 2021Updated 4 years ago
- uncover old chinese textual parallels based on sound☆15Feb 23, 2026Updated last month
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Oct 2, 2024Updated last year
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- ☆13Jun 25, 2019Updated 6 years ago
- ☆15Mar 8, 2024Updated 2 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Oct 16, 2024Updated last year
- time-series row column classification☆14Jan 7, 2022Updated 4 years ago
- Chinese character variant converter. 中文异体字转换器。☆22Oct 17, 2025Updated 5 months ago
- Detect and align similar passages☆118Mar 17, 2026Updated last week
- ☆10Oct 15, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆39Dec 2, 2023Updated 2 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆14Mar 2, 2024Updated 2 years ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 11 months ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated last year
- Named Entity Recognition☆19Feb 13, 2026Updated last month
- Korean politics data for research and development.☆12Jun 21, 2016Updated 9 years ago
- Repository for contributions for Data Generation for Post-OCR correction of Cyrillic handwriting paper☆21Nov 27, 2023Updated 2 years ago
- ☆12Jan 29, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Jun 26, 2023Updated 2 years ago
- Portal for the course "Economic Slack" at UCSC [ECON 221]☆38Dec 13, 2025Updated 3 months ago
- Installs and manages Stata programs tracked as git repositories.☆11Sep 13, 2017Updated 8 years ago
- This file maps a given list of company names to their proper website and also maps a give list of websites to the company name.☆15Nov 16, 2018Updated 7 years ago
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago
- some python scripts for Stock and Funds☆11Sep 13, 2018Updated 7 years ago
- The current version of Data by Design, an interactive history of data visualization☆13Mar 13, 2026Updated 2 weeks ago
- This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…☆19Jun 5, 2025Updated 9 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Document Layout Analysis☆404Updated this week
- A supplementary material to "The Evolution of Work in the United States"☆12Jun 23, 2021Updated 4 years ago
- Utilities and applications for the FlatGov project by Demand Progress☆16Feb 8, 2023Updated 3 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 10 months ago
- AES - Ancient Egyptian Sentences; Corpus of Ancient Egyptian sentences for corpus-linguistic research☆10May 18, 2021Updated 4 years ago
- A public dataset containing chord/beat annotation from a music game named 'osu!'.☆11Oct 17, 2017Updated 8 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 8 months ago