Kekkodf / pypantera
A Python Package for NLP obfuscation using Differential Privacy
☆23Updated 4 months ago
Alternatives and similar repositories for pypantera:
Users that are interested in pypantera are comparing it to the libraries listed below
- Code for Findings of ACL 2021 "Differential Privacy for Text Analytics via Natural Text Sanitization"☆27Updated 3 years ago
- DP-BART for Privatized Text Rewriting under Local Differential Privacy☆16Updated 5 months ago
- ☆72Updated 2 years ago
- A codebase that makes differentially private training of transformers easy.☆171Updated 2 years ago
- ☆18Updated 3 years ago
- FairGrad, is an easy to use general purpose approach to enforce fairness for gradient descent based methods.☆14Updated last year
- A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)☆39Updated 4 months ago
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆95Updated last year
- A fast algorithm to optimally compose privacy guarantees of differentially private (DP) mechanisms to arbitrary accuracy.☆73Updated last year
- A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPR☆82Updated 3 weeks ago
- Differentially-private transformers using HuggingFace and Opacus☆133Updated 7 months ago
- Annotated corpus + evaluation metrics for text anonymisation☆55Updated last year
- ☆35Updated last year
- ☆53Updated last year
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆22Updated 11 months ago
- ☆25Updated last year
- [WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding☆50Updated 4 years ago
- Transformer-based model for learning authorship representations.☆35Updated 7 months ago
- A simple toolkit to process TREC files in Python.☆167Updated 7 months ago
- The Art and Science of Empirical Computer Science (Fall 2022)☆21Updated last year
- ☆49Updated last year
- A survey of privacy problems in Large Language Models (LLMs). Contains summary of the corresponding paper along with relevant code☆67Updated 10 months ago
- ☆27Updated 4 years ago
- ☆128Updated last year
- StereoSet: Measuring stereotypical bias in pretrained language models☆180Updated 2 years ago
- ☆39Updated 5 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Updated 4 years ago
- ☆28Updated 6 months ago
- IR module for experimaestro☆11Updated last week
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆10Updated 2 years ago