MaxHalford / orcLinks
π§ Parsing structured information from OCR outputs
β19Updated last year
Alternatives and similar repositories for orc
Users that are interested in orc are comparing it to the libraries listed below
Sorting:
- Information extraction from English and German texts based on predicate logicβ139Updated 2 years ago
- Super Simple Similarities Serviceβ154Updated 7 months ago
- A Python library aimed at dissecting and augmenting NER training data.β59Updated 2 years ago
- β69Updated 3 years ago
- Python package for deduplication/entity resolution using active learningβ82Updated last year
- Generate reports for spaCy models.β29Updated 3 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated last year
- β55Updated last year
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.β35Updated 5 months ago
- π« SpaCy wrapper for ConceptNet π«β95Updated 2 years ago
- Experimental form data extraction for journalismβ76Updated 4 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- βοΈ Parallel and distributed training with spaCy and Rayβ56Updated 2 years ago
- Full text search that feels like a numpy arrayβ264Updated last month
- β43Updated 2 years ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- XAI based human-in-the-loop framework for automatic rule-learning.β49Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.β30Updated 3 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β92Updated 3 years ago
- β84Updated 2 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- SPEAR: Programmatically label and build training data quickly.β109Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β156Updated last year
- Just another sentiment wrapper.β18Updated 3 years ago
- multimodal document analysisβ166Updated this week
- Vespa application making an index of the CORD-19 dataset.β39Updated 4 months ago
- Bag of, not words, but tricks!β68Updated 2 years ago
- A spaCy wrapper for GliNERβ124Updated 9 months ago
- Pipeline components that support partial_fit.β46Updated last year
- It's a cooler way to store simple linear models.β27Updated last year