Layout-Parser / annotation-service
β15Updated 3 years ago
Related projects β
Alternatives and complementary repositories for annotation-service
- πΈ Train floret vectorsβ18Updated last year
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.β34Updated 4 years ago
- Finds linguistic patterns effortlesslyβ33Updated last year
- sequence tagging with spaCy and crfsuiteβ18Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ62Updated 8 months ago
- π Dehyphenation of broken text (mainly German), i.e., extracted from a PDFβ38Updated 2 years ago
- Named entity recognition for the legal domainβ40Updated 3 years ago
- β53Updated 10 months ago
- Source code and data for Like a Good Nearest Neighborβ28Updated 9 months ago
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the reβ¦β12Updated 2 months ago
- β70Updated last year
- Metadata Extractor & Loader (MEL) β The NLP-NER Toolkit (TNNT)β22Updated last year
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blaschβ¦β9Updated 4 years ago
- Annotation Management for Prodigy, that support multiple users working in many projectsβ15Updated 6 years ago
- Python based Wikidata framework for easy dataframe extractionβ39Updated last year
- β29Updated 2 years ago
- Text classification automlβ21Updated 3 years ago
- β17Updated last year
- Python tools for Tesseract OCR trainingβ25Updated 2 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.β21Updated 7 months ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified anβ¦β22Updated 4 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 2 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsβ19Updated last year
- spaCy match and replace, maintaining conjugationβ34Updated last year
- π Python Package to reconstruct the original continuous text from PDFs with language modelsβ33Updated last year
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"β35Updated 11 months ago
- Converter from UD-trees to BART representationβ36Updated 8 months ago
- A PyPI package for easy text annotation in a Jupyter Notebook.β28Updated 3 years ago
- spaCy entry points for Curated Transformersβ25Updated last month