A model(ing framework) for sample efficient OCR
☆65Apr 7, 2023Updated 3 years ago
Alternatives and similar repositories for effocr
Users that are interested in effocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Noise-robust de-duplication at scale☆19Apr 9, 2023Updated 3 years ago
- ☆14Feb 20, 2024Updated 2 years ago
- ☆20Jul 22, 2021Updated 4 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆19Sep 29, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- ☆10Oct 2, 2024Updated last year
- Python SDK for Data API and Solr API access☆12May 3, 2024Updated 2 years ago
- ☆13Jun 25, 2019Updated 6 years ago
- ☆15Mar 8, 2024Updated 2 years ago
- My user-written commands/packages for data analysis in Stata☆20Oct 12, 2020Updated 5 years ago
- dev repo for article☆33Mar 14, 2023Updated 3 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Oct 16, 2024Updated last year
- time-series row column classification☆14Jan 7, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Detect and align similar passages☆122Apr 27, 2026Updated last month
- ☆10Oct 15, 2019Updated 6 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆39Dec 2, 2023Updated 2 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆15Mar 2, 2024Updated 2 years ago
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- Discover internal APIs from any website. Captures XHR/fetch calls, extracts auth headers, outputs structured endpoint catalogs. Like open…☆34Feb 5, 2026Updated 4 months ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated last year
- Python code for the procedure in Duarte, Magnolfi, Sølvsten, and Sullivan (2023) to test firm conduct.☆14May 28, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆42Jun 15, 2024Updated last year
- Named Entity Recognition☆19Feb 13, 2026Updated 3 months ago
- Strips boilerplate from Project Gutenberg text files☆17Jul 28, 2021Updated 4 years ago
- ☆12Jan 29, 2021Updated 5 years ago
- Repository for contributions for Data Generation for Post-OCR correction of Cyrillic handwriting paper☆23Nov 27, 2023Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 3 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Jun 26, 2023Updated 2 years ago
- Portal for the course "Economic Slack" at UCSC [ECON 221]☆38Dec 13, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 8 months ago
- OCR a IIIF images in a manifest and generate annotations☆27May 1, 2026Updated last month
- R package for the identification of functionally important subnetworks☆14May 6, 2024Updated 2 years ago
- Installs and manages Stata programs tracked as git repositories.☆11Sep 13, 2017Updated 8 years ago
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago
- This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…☆20Jun 5, 2025Updated last year