MaxHalford / orcLinks
π§ Parsing structured information from OCR outputs
β19Updated last year
Alternatives and similar repositories for orc
Users that are interested in orc are comparing it to the libraries listed below
Sorting:
- Experimental form data extraction for journalismβ77Updated 4 years ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated 2 years ago
- Super Simple Similarities Serviceβ151Updated 3 months ago
- Information extraction from English and German texts based on predicate logicβ137Updated 2 years ago
- β43Updated 2 years ago
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.β34Updated last month
- Generate reports for spaCy models.β29Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- β69Updated 3 years ago
- This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeteβ¦β38Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β91Updated 3 years ago
- β55Updated last year
- Nesta's Skills Extractor Libraryβ140Updated last month
- βοΈ Parallel and distributed training with spaCy and Rayβ54Updated last year
- ποΈ Highlight text in documentsβ109Updated 2 months ago
- Generalist and Lightweight Model for Text Classificationβ139Updated last month
- π« SpaCy wrapper for ConceptNet π«β94Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β32Updated 3 months ago
- Explainable Zero-Shot Topic Extractionβ63Updated 11 months ago
- β79Updated 2 years ago
- Sentence transformers models for SpaCyβ107Updated 2 years ago
- Bag of, not words, but tricks!β68Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Pipeline components that support partial_fit.β46Updated last year
- Few-shot Named Entity Recognitionβ123Updated 3 years ago
- A Streamlit component for annotating text by text selecting.β40Updated last year
- multimodal document analysisβ166Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β156Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.β30Updated 3 years ago
- β81Updated 3 years ago