Layout-Parser / annotation-service
β16Updated 3 years ago
Alternatives and similar repositories for annotation-service:
Users that are interested in annotation-service are comparing it to the libraries listed below
- πΈ Train floret vectorsβ18Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.β21Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.β17Updated 8 months ago
- Metadata Extractor & Loader (MEL) β The NLP-NER Toolkit (TNNT)β23Updated 2 years ago
- Finds linguistic patterns effortlesslyβ36Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- β17Updated last year
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- Named entity recognition for the legal domainβ42Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.β22Updated 2 years ago
- β54Updated last year
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"β37Updated last year
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.β21Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.β44Updated 11 months ago
- Next-generation Punkt sentence boundary detection with zero dependenciesβ16Updated last month
- A simple library for training named entity recognition model from partially annotated dataβ23Updated last year
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using β¦β17Updated 4 years ago
- β30Updated 2 years ago
- Python based Wikidata framework for easy dataframe extractionβ44Updated last year
- Post-processing OCR errors with seq2seq modelsβ28Updated 4 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal β¦β32Updated 4 years ago
- A Named-Entity Recogniser based on Grobid.β52Updated 7 months ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ85Updated 2 years ago
- β13Updated last year
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.β35Updated 4 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phrasβ¦β11Updated 6 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchiβ¦β33Updated 11 months ago
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blaschβ¦β9Updated 4 years ago