openredact / anonymizer
A Python module that provides multiple anonymization techniques for text (This is only a prototype)
☆21Updated 8 months ago
Alternatives and similar repositories for anonymizer:
Users that are interested in anonymizer are comparing it to the libraries listed below
- This is a prototype of a Python module for simple modification of document files.☆17Updated 3 years ago
- This is a prototype of a semi-automatic data anonymization app for German documents.☆20Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated 8 months ago
- An OCR evaluation tool☆64Updated last month
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- A tool for quickly adding labels to unlabeled datasets☆20Updated last year
- Generate reports for spaCy models.☆29Updated 2 years ago
- A Streamlit app to add structured tags to a dataset card☆22Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆78Updated 4 months ago
- An easy way to chunk spaCy docs.☆18Updated 5 months ago
- Finds linguistic patterns effortlessly☆34Updated last year
- Two-Step Approach to OCR Post-Correction☆14Updated 7 months ago
- Named Entity Recognition☆17Updated 2 months ago
- Open Legal Data Platform☆102Updated last week
- GermaNet API for Python☆53Updated 6 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- Blazing fast language detection using fastText model☆23Updated 2 years ago
- cologne-phonetics implementation in python☆15Updated last year
- Scrapes some Finnish word definitions from English Wiktionary.☆7Updated last year
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆104Updated 2 months ago
- ☆22Updated 11 months ago
- Visualise, evaluate, and manage annotated data☆33Updated 2 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated 2 weeks ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆43Updated 7 months ago
- Legal Reference Extraction☆29Updated 5 months ago
- Execute arbitrary SQL queries on 🤗 Datasets☆31Updated 11 months ago
- Prune your sklearn models☆19Updated 2 months ago
- Language detection using Spacy and Fasttext☆54Updated last year