maxent-ai / ocrpyLinks
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
☆222Updated last year
Alternatives and similar repositories for ocrpy
Users that are interested in ocrpy are comparing it to the libraries listed below
Sorting:
- Neural Search☆332Updated last year
- Labelling platform for text using weak supervision.☆262Updated 2 years ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆385Updated last week
- Gain clues from clustering!☆315Updated 11 months ago
- Custom recipe and utilities for document processing☆199Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆137Updated 2 years ago
- Confection: the sweetest config system for Python☆186Updated 2 months ago
- Software that makes labeling PDFs easy.☆415Updated last year
- Blazing fast framework for fine-tuning similarity learning models☆656Updated 2 months ago
- Natural language Pandas queries and data generation powered by GPT-3☆197Updated last year
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆132Updated last year
- Doubt your data, find bad labels.☆513Updated 11 months ago
- Fuzzy string matching, grouping, and evaluation.☆765Updated last month
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆152Updated 3 years ago
- 📊 Semantic search for headlines and story text☆360Updated last year
- Traversing links to find the deep source of information☆71Updated 2 years ago
- Super lightweight function registries for your library☆179Updated last year
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated last year
- ☆69Updated 3 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆313Updated last month
- A Simple Bulk Labelling Tool☆586Updated 5 months ago
- Spacy NER annotator using ipywidgets☆123Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆215Updated 5 months ago
- just a bunch of useful embeddings for scikit-learn pipelines☆500Updated 2 months ago
- Conversational text Analysis using various NLP techniques☆180Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Updated 9 months ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 11 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆329Updated last year