MaxHalford / orcLinks
π§ Parsing structured information from OCR outputs
β20Updated 2 years ago
Alternatives and similar repositories for orc
Users that are interested in orc are comparing it to the libraries listed below
Sorting:
- A Python library aimed at dissecting and augmenting NER training data.β60Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated last year
- Generate reports for spaCy models.β29Updated 3 years ago
- Super Simple Similarities Serviceβ155Updated 9 months ago
- Full text search that feels like a numpy arrayβ301Updated this week
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.β38Updated 8 months ago
- Experimental form data extraction for journalismβ78Updated 5 years ago
- Information extraction from English and German texts based on predicate logicβ141Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- π« SpaCy wrapper for ConceptNet π«β95Updated last month
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β92Updated 4 years ago
- spaCy match and replace, maintaining conjugationβ36Updated 3 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.β49Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β156Updated last year
- β68Updated 3 years ago
- β84Updated 2 years ago
- Python package for deduplication/entity resolution using active learningβ83Updated last year
- β55Updated 2 years ago
- Plug-and-play document AI with zero-shot models.β123Updated 2 weeks ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.β30Updated 4 years ago
- A Streamlit component for annotating text by text selecting.β42Updated last year
- Nesta's Skills Extractor Libraryβ150Updated 7 months ago
- β43Updated 2 years ago
- A spaCy wrapper for GliNERβ129Updated last year
- Train huggingface models on top of Prodigy annotationsβ21Updated last year
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolutβ¦β161Updated 3 years ago
- SPEAR: Programmatically label and build training data quickly.β109Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β34Updated 5 months ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ105Updated last year
- Explainable Zero-Shot Topic Extractionβ65Updated last year