maxent-ai / ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
☆222Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ocrpy
- Neural Search☆325Updated 5 months ago
- Labelling platform for text using weak supervision.☆260Updated 2 years ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆352Updated 11 months ago
- Gain clues from clustering!☆305Updated 4 months ago
- Doubt your data, find bad labels.☆503Updated 4 months ago
- just a bunch of useful embeddings☆466Updated 2 months ago
- Custom recipe and utilities for document processing☆198Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆242Updated last year
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆124Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆309Updated 10 months ago
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆137Updated 2 years ago
- Confection: the sweetest config system for Python☆176Updated 5 months ago
- Blazing fast framework for fine-tuning similarity learning models☆643Updated last month
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆287Updated last year
- Educational python for Neural Networks.☆128Updated 10 months ago
- Check if you have training samples in your test set☆64Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- 📊 Semantic search for headlines and story text☆356Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆209Updated 5 months ago
- A Simple Bulk Labelling Tool☆552Updated 2 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 8 months ago
- Make PDFs easily☆314Updated 2 years ago
- Software that makes labeling PDFs easy.☆391Updated 6 months ago
- Creates dynamic html report from jupyter notebook.☆294Updated 8 months ago
- Vectory provides a collection of tools to track and compare embedding versions.☆70Updated last year
- Open Source Photos Platform Powered by PyTorch☆137Updated 2 years ago
- Experimental form data extraction for journalism☆76Updated 3 years ago
- Neural Search☆344Updated 5 months ago