maxent-ai / ocrpyLinks
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
☆222Updated last year
Alternatives and similar repositories for ocrpy
Users that are interested in ocrpy are comparing it to the libraries listed below
Sorting:
- Neural Search☆331Updated last year
- Gain clues from clustering!☆313Updated 10 months ago
- 📊 Semantic search for headlines and story text☆360Updated last year
- Labelling platform for text using weak supervision.☆262Updated 2 years ago
- Natural language Pandas queries and data generation powered by GPT-3☆196Updated last year
- 📄 ⚙️ ETL processes for medical and scientific papers☆384Updated last month
- Software that makes labeling PDFs easy.☆416Updated last year
- 🖍️ Highlight text in documents☆107Updated last month
- Blazing fast framework for fine-tuning similarity learning models☆656Updated last month
- Doubt your data, find bad labels.☆513Updated 10 months ago
- Custom recipe and utilities for document processing☆199Updated 2 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆499Updated 2 months ago
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆132Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆214Updated 4 months ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆312Updated last month
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆136Updated last year
- A universal package of scraper scripts for humans☆310Updated 3 years ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆322Updated last year
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated last year
- Fuzzy string matching, grouping, and evaluation.☆763Updated 3 weeks ago
- An open-source AutoML Library based on PyTorch☆306Updated last month
- Vectory provides a collection of tools to track and compare embedding versions.☆71Updated 2 years ago
- Confection: the sweetest config system for Python☆185Updated last month
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆138Updated 5 months ago
- Build, present and share animated data stories in Jupyter Notebook and similar environments.☆339Updated 3 months ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆925Updated 9 months ago
- A Simple Bulk Labelling Tool☆582Updated 5 months ago
- Python package for deduplication/entity resolution using active learning☆80Updated 9 months ago
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆314Updated last year