maxent-ai / ocrpyLinks
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
☆222Updated last year
Alternatives and similar repositories for ocrpy
Users that are interested in ocrpy are comparing it to the libraries listed below
Sorting:
- Natural language Pandas queries and data generation powered by GPT-3☆197Updated last year
- Neural Search☆333Updated last year
- 📊 Semantic search for headlines and story text☆360Updated last year
- Custom recipe and utilities for document processing☆199Updated 3 years ago
- Labelling platform for text using weak supervision.☆263Updated 3 years ago
- 🖍️ Highlight text in documents☆109Updated 3 months ago
- Gain clues from clustering!☆318Updated last year
- Python package for deduplication/entity resolution using active learning☆81Updated 11 months ago
- Confection: the sweetest config system for Python☆188Updated 3 months ago
- Information extraction from English and German texts based on predicate logic☆138Updated 2 years ago
- QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.☆162Updated last month
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆135Updated last year
- Blazing fast framework for fine-tuning similarity learning models☆656Updated 3 months ago
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆315Updated last year
- An open-source AutoML Library based on PyTorch☆306Updated 3 weeks ago
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆152Updated 3 years ago
- Toolkit for developing and maintaining ML models☆155Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆217Updated 6 months ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆394Updated 3 weeks ago
- Conversational text Analysis using various NLP techniques☆180Updated 2 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆503Updated 4 months ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆154Updated 2 weeks ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated last year
- Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores☆100Updated 2 months ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- ☆69Updated 3 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- 📚 Datasets and models for instruction-tuning☆238Updated last year