A model(ing framework) for sample efficient OCR
☆64Apr 7, 2023Updated 3 years ago
Alternatives and similar repositories for effocr
Users that are interested in effocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Noise-robust de-duplication at scale☆19Apr 9, 2023Updated 3 years ago
- ☆14Feb 20, 2024Updated 2 years ago
- ☆20Jul 22, 2021Updated 4 years ago
- uncover old chinese textual parallels based on sound☆16Updated this week
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Inital build of digital edition of Capital Volume 1 using Ed. and hypothes.is☆13Jan 20, 2023Updated 3 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- ☆10Oct 2, 2024Updated last year
- Template for research repository using scons.☆14Updated this week
- Website for Harvard's Gov 50 in Fall 2023☆13Dec 5, 2023Updated 2 years ago
- ☆15Mar 8, 2024Updated 2 years ago
- My user-written commands/packages for data analysis in Stata☆20Oct 12, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is an R wrapper for the APIs on government of India's open data platform - data.gov.in.☆18Sep 22, 2024Updated last year
- time-series row column classification☆14Jan 7, 2022Updated 4 years ago
- Slides and jupter notebooks for course on text analysis and machine learning for social science☆26Aug 18, 2021Updated 4 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Template repository for research papers.☆117Nov 2, 2022Updated 3 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆39Dec 2, 2023Updated 2 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆14Mar 2, 2024Updated 2 years ago
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Nov 3, 2024Updated last year
- Python code for the procedure in Duarte, Magnolfi, Sølvsten, and Sullivan (2023) to test firm conduct.☆14Apr 11, 2026Updated last week
- ☆41Jun 15, 2024Updated last year
- ☆24Jul 25, 2024Updated last year
- Named Entity Recognition☆19Feb 13, 2026Updated 2 months ago
- Korean politics data for research and development.☆12Jun 21, 2016Updated 9 years ago
- Strips boilerplate from Project Gutenberg text files☆17Jul 28, 2021Updated 4 years ago
- ☆12Jan 29, 2021Updated 5 years ago
- Repository for contributions for Data Generation for Post-OCR correction of Cyrillic handwriting paper☆21Nov 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Jun 26, 2023Updated 2 years ago
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- Portal for the course "Economic Slack" at UCSC [ECON 221]☆38Dec 13, 2025Updated 4 months ago
- OCR a IIIF images in a manifest and generate annotations☆26Feb 11, 2025Updated last year
- This file maps a given list of company names to their proper website and also maps a give list of websites to the company name.☆15Nov 16, 2018Updated 7 years ago