LeapBeyond / scrubadub_spacy
Clean personally identifiable information from dirty dirty text using spaCy.
β41Updated last year
Alternatives and similar repositories for scrubadub_spacy:
Users that are interested in scrubadub_spacy are comparing it to the libraries listed below
- π« SpaCy wrapper for ConceptNet π«β90Updated last year
- A PyPI package for easy text annotation in a Jupyter Notebook.β28Updated 3 years ago
- Bag of, not words, but tricks!β68Updated last year
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- Explainable Zero-Shot Topic Extractionβ62Updated 7 months ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β91Updated 3 years ago
- Package that returns a company embedding given a company nameβ45Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- REMERGE - Multi-Word Expression discovery algorithmβ14Updated 2 years ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ97Updated 10 months ago
- Dataframe Integration with spaCy.β103Updated 4 years ago
- β30Updated 2 years ago
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- Generate reports for spaCy models.β29Updated 2 years ago
- β70Updated 2 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.β48Updated 8 months ago
- Topic Inference with Zeroshot modelsβ61Updated last year
- A comprehensive tool for linguistic analysis of communitiesβ49Updated 3 years ago
- Running Prodigy for a team of annotatorsβ53Updated 4 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporatedβ¦β26Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 9 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.β118Updated 11 months ago
- Regular spotlights of underrated NLP and Data Science GitHub repositoriesβ35Updated 4 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ80Updated 2 years ago
- sequence tagging with spaCy and crfsuiteβ19Updated 2 years ago
- Python package for deduplication/entity resolution using active learningβ76Updated 6 months ago
- A collection of machine learning model cards and datasheets.β73Updated 9 months ago
- Python text processing, pattern matching, and NLP frameworkβ63Updated last year