gillesdami / anonymizationLinks
Text anonymization in many languages using Faker
☆10Updated 5 years ago
Alternatives and similar repositories for anonymization
Users that are interested in anonymization are comparing it to the libraries listed below
Sorting:
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆318Updated last year
- Fuzzy string matching, grouping, and evaluation.☆788Updated 7 months ago
- Repository for Project Insight: NLP as a Service☆319Updated 2 years ago
- Simplifies use of the Dedupe library via Pandas☆136Updated 2 years ago
- Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…☆561Updated last year
- Type inference for Machine Learning pipelines☆25Updated 2 months ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243Updated last year
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆258Updated last year
- Data Processing and Machine learning methods for the Open Skills Project☆174Updated last year
- Metafeature Extraction for Unstructured Data☆103Updated 10 months ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆141Updated 10 months ago
- DagsHub client libraries☆101Updated last week
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- GPU-Powered Topic Modelling☆70Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆51Updated 3 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆245Updated 2 years ago
- 🍳 Recipes for the Prodigy, our fully scriptable annotation tool☆504Updated last year
- ☆27Updated 5 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆402Updated 4 years ago
- Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of p…☆349Updated 2 months ago
- Question Answering annotation platform - Plateforme d'annotation☆90Updated last year
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆474Updated 3 years ago
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, …☆87Updated last year
- Models and Pipelines for the Spark NLP library☆113Updated 4 years ago
- ML pipeline orchestration and model deployments on Kubernetes.☆435Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- Search for PII in Python☆30Updated 2 years ago
- MLRun template functions and examples☆39Updated 2 weeks ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Updated last year