Layout-Parser / annotation-serviceLinks
β18Updated 3 years ago
Alternatives and similar repositories for annotation-service
Users that are interested in annotation-service are comparing it to the libraries listed below
Sorting:
- πΈ Train floret vectorsβ18Updated 2 years ago
- β55Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.β18Updated 10 months ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)β53Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- Metadata Extractor & Loader (MEL) β The NLP-NER Toolkit (TNNT)β23Updated 2 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal β¦β32Updated 4 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporatedβ¦β26Updated 2 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modelingβ23Updated 4 years ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Small python package to measure OCR quality and other related metrics.β23Updated last year
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phrasβ¦β11Updated 6 years ago
- Finds linguistic patterns effortlesslyβ36Updated last year
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.β35Updated 4 years ago
- β17Updated 2 years ago
- Post-processing OCR errors with seq2seq modelsβ28Updated 4 years ago
- MoodCatπΌ classifies the mood of English sentences.β14Updated 3 years ago
- π Dehyphenation of broken text (mainly German), i.e., extracted from a PDFβ39Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsβ19Updated 2 years ago
- Rust python bindings for symspellβ19Updated last year
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- β13Updated last year
- Next-generation Punkt sentence boundary detection with zero dependenciesβ17Updated 2 months ago
- Generate reports for spaCy models.β29Updated 3 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.β38Updated 6 years ago
- Named entity recognition for the legal domainβ42Updated 4 years ago
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)β20Updated 7 years ago
- Source code and data for Like a Good Nearest Neighborβ29Updated 5 months ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ85Updated 2 years ago