openredact / expose-textLinks
This is a prototype of a Python module for simple modification of document files.
☆18Updated 3 years ago
Alternatives and similar repositories for expose-text
Users that are interested in expose-text are comparing it to the libraries listed below
Sorting:
- This is a prototype of a semi-automatic data anonymization app for German documents.☆20Updated 2 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Language Model and Text Classification for German Language using Deep Learning☆18Updated 7 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Updated 3 years ago
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆16Updated 5 years ago
- Plan and train German transformer models.☆23Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Legal Reference Extraction☆33Updated last month
- Mining Legal Arguments in Court Decisions - Data and software☆68Updated 2 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆30Updated 6 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- German Morphological Analyzer☆47Updated 3 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 2 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆51Updated 2 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- Python tools for interacting with Wikidata☆153Updated last year
- The Potsdam Twitter Sentiment Corpus☆17Updated 5 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆18Updated 6 years ago