MaxHalford / orc
π§ Parsing structured information from OCR outputs
β18Updated last year
Alternatives and similar repositories for orc:
Users that are interested in orc are comparing it to the libraries listed below
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- Generate reports for spaCy models.β29Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.β57Updated last year
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β58Updated 8 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β104Updated 8 months ago
- β68Updated 2 years ago
- Python package for deduplication/entity resolution using active learningβ78Updated 4 months ago
- β54Updated last year
- β75Updated last year
- β42Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β26Updated last month
- Train huggingface models on top of Prodigy annotationsβ21Updated 11 months ago
- Experimental form data extraction for journalismβ77Updated 4 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ52Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systemsβ26Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ93Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 10 months ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.β30Updated 3 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- An easy way to chunk spaCy docs.β18Updated 5 months ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 10 months ago
- A spaCy wrapper for GliNERβ101Updated 6 months ago
- π« SpaCy wrapper for ConceptNet π«β89Updated last year
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated 4 months ago
- Just another sentiment wrapper.β17Updated 3 years ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ63Updated 5 months ago
- Python API for https://vespa.ai, the open big data serving engine