MaxHalford / orc
π§ Parsing structured information from OCR outputs
β18Updated last year
Alternatives and similar repositories for orc:
Users that are interested in orc are comparing it to the libraries listed below
- Experimental form data extraction for journalismβ77Updated 4 years ago
- π« SpaCy wrapper for ConceptNet π«β89Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 11 months ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- Explainable Zero-Shot Topic Extractionβ62Updated 6 months ago
- β68Updated 2 years ago
- β54Updated last year
- Generate reports for spaCy models.β29Updated 2 years ago
- spaCy extension for Visual Studio Codeβ27Updated last year
- β42Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.β25Updated 3 years ago
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- A spaCy wrapper for GliNERβ107Updated 3 weeks ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- β76Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated 9 months ago
- Train huggingface models on top of Prodigy annotationsβ21Updated last year
- Generalist and Lightweight Model for Text Classificationβ65Updated this week
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year
- Small python package to measure OCR quality and other related metrics.β21Updated last year
- Data Programming by Demonstration (DPBD) for Document Classificationβ35Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β37Updated 2 years ago
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- GLiNER model in a FastAPI microservice.β36Updated 2 months ago
- Python API for https://vespa.ai, the open big data serving engineβ113Updated this week
- Python package for deduplication/entity resolution using active learningβ76Updated 5 months ago
- β13Updated last year
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated 11 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ157Updated 2 years ago
- Query and visualize knowledge graphsβ49Updated 6 months ago