openredact / expose-textLinks
This is a prototype of a Python module for simple modification of document files.
โ18Updated 3 years ago
Alternatives and similar repositories for expose-text
Users that are interested in expose-text are comparing it to the libraries listed below
Sorting:
- This is a prototype of a semi-automatic data anonymization app for German documents.โ21Updated 2 years ago
- ๐ Dehyphenation of broken text (mainly German), i.e., extracted from a PDFโ39Updated 3 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearchโ70Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapersโ38Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doโฆโ81Updated last year
- This repository contains all manually labeled data from the GermEval-2018 shared task.โ29Updated 6 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataโ164Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.โ147Updated 8 months ago
- ๐งช Cutting-edge experimental spaCy components and featuresโ100Updated last year
- Linguistic and stylistic complexity measures for (literary) textsโ82Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataโ94Updated 2 years ago
- CONLL-U to Pandas DataFrameโ31Updated 7 years ago
- Named entity recognition for the legal domainโ42Updated 4 years ago
- Legal Reference Extractionโ34Updated 3 months ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ55Updated 2 years ago
- German lemmatization with IWNLP as extension for spaCyโ24Updated 2 years ago
- Information extraction from English and German texts based on predicate logicโ138Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and softwareโ68Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddingsโ88Updated 4 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.โ151Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.โ12Updated last year
- A Dataset of German Legal Documents for Named Entity Recognitionโ172Updated 2 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.โ19Updated 2 years ago
- Python tools for interacting with Wikidataโ154Updated last year
- GC4LM: A Colossal (Biased) language model for Germanโ13Updated 4 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interfaceโ260Updated 11 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"โ18Updated 4 years ago
- spaCy + UDPipeโ163Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsโ19Updated 2 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.โ23Updated 2 years ago