benouinirachid / patterns-finder
Simple, Fast, Powerful and Easily extensible python package for extracting patterns from text, with over than 60 predefined Regular Expressions.
☆23Updated last year
Related projects: ⓘ
- Python package for deduplication/entity resolution using active learning☆77Updated 3 weeks ago
- It's a cooler way to store simple linear models.☆28Updated 2 months ago
- Prune your sklearn models☆19Updated last year
- A data labelling tool based on Streamlit.☆23Updated 3 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆84Updated 2 years ago
- ☆65Updated 2 years ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Bag of, not words, but tricks!☆68Updated 10 months ago
- Pipeline components that support partial_fit.☆42Updated 2 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆66Updated 9 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 6 months ago
- Python text processing, pattern matching, and NLP framework☆61Updated last year
- A different, but useful, textcat approach.☆15Updated 2 months ago
- this repo might get accepted☆29Updated 3 years ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- Just another sentiment wrapper.☆17Updated 2 years ago
- Easy PDF to text to spaCy text extraction in Python.☆33Updated 11 months ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- semantically distinct key phrase extraction using hilbert hashes.☆46Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆151Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆29Updated 2 years ago
- A comprehensive tool for linguistic analysis of communities☆48Updated 2 years ago
- A python package to simulate typographical errors.☆30Updated 9 months ago
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- A Fuzzy Matching Approach for Clustering Strings☆26Updated last year
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆63Updated 7 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆90Updated last year