openredact / expose-textLinks
This is a prototype of a Python module for simple modification of document files.
โ18Updated 3 years ago
Alternatives and similar repositories for expose-text
Users that are interested in expose-text are comparing it to the libraries listed below
Sorting:
- This is a prototype of a semi-automatic data anonymization app for German documents.โ22Updated 2 years ago
 - ๐ Dehyphenation of broken text (mainly German), i.e., extracted from a PDFโ39Updated 3 years ago
 - Analyze Argumentation and Rhetorical Aspects in Scientific Writing.โ19Updated 2 years ago
 - German lemmatization with IWNLP as extension for spaCyโ25Updated 2 years ago
 - Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doโฆโ81Updated last year
 - Python tools for interacting with Wikidataโ156Updated 2 years ago
 - GC4LM: A Colossal (Biased) language model for Germanโ13Updated 4 years ago
 - This repository contains all manually labeled data from the GermEval-2018 shared task.โ29Updated 7 years ago
 - BERT and ELECTRA models trained on Europeana Newspapersโ38Updated 3 years ago
 - A minimal, pure Python library to interface with CoNLL-U format files.โ152Updated this week
 - ๐งช Cutting-edge experimental spaCy components and featuresโ102Updated last year
 - Toolkit to compile a comparable/parallel corpus from European Parliament proceedingsโ16Updated 5 years ago
 - Language Model and Text Classification for German Language using Deep Learningโ18Updated 7 years ago
 - Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsโ19Updated 2 years ago
 - Named entity recognition for the legal domainโ42Updated 4 years ago
 - Legal Reference Extractionโ35Updated 6 months ago
 - A tokenizer and sentence splitter for German and English web and social media texts.โ148Updated 10 months ago
 - Next-generation Punkt sentence boundary detection with zero dependenciesโ20Updated 2 months ago
 - Repository for "Towards Robust Named Entity Recognition for Historic German"โ18Updated 4 years ago
 - Use spaCy for NLP and output to the FoLiA XML format.โ12Updated last year
 - A small tool that EXPLains spACY parse results. See what I did there?โ83Updated 3 years ago
 - Python port for IWNLP.Lemmatizerโ17Updated 2 years ago
 - โ64Updated 2 years ago
 - CLI for loading Wikidata subsets (or all of it) into Elasticsearchโ70Updated 3 years ago
 - CONLL-U to Pandas DataFrameโ31Updated 7 years ago
 - A spaCy custom component that extracts and normalizes temporal expressionsโ55Updated 2 years ago
 - An annotated corpus of argumentative microtextsโ40Updated 3 years ago
 - Linguistic and stylistic complexity measures for (literary) textsโ84Updated last year
 - Plan and train German transformer models.โ23Updated 4 years ago
 - A part-of-speech tagger with support for domain adaptation and external resources.โ23Updated 3 years ago