openredact / expose-textLinks
This is a prototype of a Python module for simple modification of document files.
β18Updated 3 years ago
Alternatives and similar repositories for expose-text
Users that are interested in expose-text are comparing it to the libraries listed below
Sorting:
- This is a prototype of a semi-automatic data anonymization app for German documents.β20Updated 2 years ago
- π Dehyphenation of broken text (mainly German), i.e., extracted from a PDFβ39Updated 3 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"β18Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for Germanβ13Updated 4 years ago
- Language Model and Text Classification for German Language using Deep Learningβ18Updated 6 years ago
- German lemmatization with IWNLP as extension for spaCyβ24Updated last year
- Python port for IWNLP.Lemmatizerβ17Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β80Updated 11 months ago
- Use spaCy for NLP and output to the FoLiA XML format.β12Updated last year
- BERT and ELECTRA models trained on Europeana Newspapersβ38Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.β19Updated 2 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers atβ¦β22Updated 10 months ago
- A part-of-speech tagger with support for domain adaptation and external resources.β23Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and softwareβ68Updated 2 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.gβ¦β112Updated 4 months ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.β21Updated last year
- This repository contains all manually labeled data from the GermEval-2018 shared task.β30Updated 6 years ago
- β64Updated 2 years ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-serβ¦β46Updated 8 months ago
- This repository contains the Framester resource, the main outcome of the framester project.β33Updated 5 years ago
- Plan and train German transformer models.β23Updated 4 years ago
- German Morphological Analyzerβ47Updated 3 years ago
- Named entity recognition for the legal domainβ42Updated 4 years ago
- Legal Reference Extractionβ32Updated last month
- CLI for loading Wikidata subsets (or all of it) into Elasticsearchβ70Updated 3 years ago
- Code to create the dataset from "A New Aligned Simple German Corpusβ10Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsβ19Updated 2 years ago
- German sentiment scores with SentiWS as extension for spaCyβ37Updated 2 years ago
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and Eβ¦β41Updated 2 years ago