openredact / anonymizerLinks
A Python module that provides multiple anonymization techniques for text (This is only a prototype)
☆26Updated last year
Alternatives and similar repositories for anonymizer
Users that are interested in anonymizer are comparing it to the libraries listed below
Sorting:
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- This is a prototype of a Python module for simple modification of document files.☆18Updated 4 years ago
- Fact checking baseline combining dense retrieval and textual entailment☆30Updated 5 months ago
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated 3 weeks ago
- Robust and fast topic models with sentence-transformers.☆89Updated this week
- Named entity recognition for the legal domain☆43Updated 4 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆27Updated 2 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆93Updated 2 years ago
- Generalist and Lightweight Model for Text Classification☆169Updated 2 weeks ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated last year
- A spaCy wrapper for GliNER☆129Updated last year
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- A lightweight Python library for constructing, processing, and visualizing constituent trees.☆68Updated last week
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- multimodal document analysis☆166Updated 2 months ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆110Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- ☆39Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆70Updated 3 years ago
- A python package to simulate typographical errors.☆38Updated 2 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆134Updated 6 months ago
- Information extraction from English and German texts based on predicate logic☆141Updated 2 years ago
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆69Updated 2 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆81Updated 2 years ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆213Updated 4 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- ☆64Updated last year
- Knowledge pills on Neural Search☆27Updated 2 years ago