OCR, Archive, Index and Search: Implementation agnostic OCR framework.
β226Nov 3, 2023Updated 2 years ago
Alternatives and similar repositories for ocrpy
Users that are interested in ocrpy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 𧬠Modularised Evolutionary Algorithms For Python with Optional JIT and Multiprocessing (Ray) support. Inspired by PyTorch Lightningβ52Mar 29, 2023Updated 3 years ago
- Labelling platform for text using weak supervision.β260Jun 24, 2022Updated 3 years ago
- Python functions to obtain and clean data required for the version 2 Housing Unit Allocation. Workflow uses Census API.β20Updated this week
- eSNN - Learning similarity measure from dataβ12Nov 28, 2019Updated 6 years ago
- URSC 645 - Urban and Regional Analytics Courseβ13Apr 7, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β32Dec 15, 2023Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.β20Feb 7, 2023Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasksβ927Sep 2, 2024Updated last year
- Doubt your data, find bad labels.β516Jul 15, 2024Updated last year
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognitionβ31Jan 31, 2022Updated 4 years ago
- Active Learning for Text Classification in Pythonβ643May 17, 2026Updated last week
- Compare different encoding methods to see how well they perform on a classification task. Determine if a reddit comment is from /r/StarWaβ¦β13Mar 14, 2022Updated 4 years ago
- Zero and Few shot named entity & relationships recognitionβ402Sep 17, 2025Updated 8 months ago
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,577May 12, 2026Updated last week
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The most accurate natural language detection library for Python, suitable for short text and mixed-language textβ1,721Apr 23, 2026Updated last month
- An easy way to extract information from documentsβ1,777May 3, 2023Updated 3 years ago
- β20Jul 22, 2021Updated 4 years ago
- Analyse your own local files with ChatGPT style interactionβ14Apr 23, 2023Updated 3 years ago
- β10Apr 2, 2024Updated 2 years ago
- Brain segmentation with TensorFlowβ12Nov 13, 2017Updated 8 years ago
- Fuzzy string matching, grouping, and evaluation.β796Jul 10, 2025Updated 10 months ago
- UnionML: the easiest way to build and deploy machine learning microservicesβ337Nov 6, 2023Updated 2 years ago
- This is official repository of the series "can python do that".β10Oct 5, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Hybrid architecture media server, media service and Streamlit client app using FastAPI and Pythonβ14Jul 12, 2022Updated 3 years ago
- It's a cooler way to store simple linear models.β26Jul 15, 2024Updated last year
- Natural language Pandas queries and data generation powered by GPT-3β200Apr 13, 2024Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Sep 10, 2024Updated last year
- Simple terminal interface for chatgptβ10Dec 6, 2022Updated 3 years ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β4,017Dec 28, 2025Updated 4 months ago
- Explore the DALLΒ·E 2 API in Pythonβ55Dec 14, 2022Updated 3 years ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ403Jul 30, 2021Updated 4 years ago
- FastAPI-like interface plugin for Flaskβ43Dec 3, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifactβ¦β1,471Dec 9, 2024Updated last year
- Confection: the sweetest config system for Pythonβ193Mar 27, 2026Updated last month
- Transforms PDF, Documents and Images into Enriched Structured Dataβ6,174Mar 20, 2026Updated 2 months ago
- π Semantic search for headlines and story textβ359Sep 23, 2023Updated 2 years ago
- Expressive diffeomorphic transformations based on the closed-form integration of continuous piecewise affine velocity functions.β16Aug 9, 2023Updated 2 years ago
- Brushing up on the basicsβ13Jun 27, 2016Updated 9 years ago
- A labextension to integrate pyflyby with notebooksβ14Dec 15, 2025Updated 5 months ago