MaxHalford / orcLinks
π§ Parsing structured information from OCR outputs
β20Updated 2 years ago
Alternatives and similar repositories for orc
Users that are interested in orc are comparing it to the libraries listed below
Sorting:
- A Python library aimed at dissecting and augmenting NER training data.β60Updated 2 years ago
- Information extraction from English and German texts based on predicate logicβ141Updated 2 years ago
- Super Simple Similarities Serviceβ155Updated 9 months ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β92Updated 4 years ago
- Python package for deduplication/entity resolution using active learningβ83Updated last year
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.β38Updated 8 months ago
- Generate reports for spaCy models.β29Updated 3 years ago
- β68Updated 3 years ago
- π« SpaCy wrapper for ConceptNet π«β95Updated last month
- Experimental form data extraction for journalismβ78Updated 5 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- β84Updated 2 years ago
- Full text search that feels like a numpy arrayβ301Updated last week
- XAI based human-in-the-loop framework for automatic rule-learning.β49Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- spaCy match and replace, maintaining conjugationβ36Updated 3 years ago
- β43Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- A spaCy wrapper for GliNERβ129Updated last year
- Generalist and Lightweight Model for Text Classificationβ169Updated 2 weeks ago
- Source code and data for Like a Good Nearest Neighborβ30Updated last year
- β55Updated 2 years ago
- Nesta's Skills Extractor Libraryβ150Updated 8 months ago
- Sentence transformers models for SpaCyβ108Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ170Updated 3 years ago
- Framework for building and maintaining self-updating prompts for LLMsβ65Updated last year
- SPEAR: Programmatically label and build training data quickly.β109Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β111Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β156Updated last year