CODAIT / text-extensions-for-pandasLinks
Natural language processing support for Pandas dataframes.
☆217Updated 7 months ago
Alternatives and similar repositories for text-extensions-for-pandas
Users that are interested in text-extensions-for-pandas are comparing it to the libraries listed below
Sorting:
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243Updated last year
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- Data Analysis Baseline Library☆133Updated 11 months ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Bag of, not words, but tricks!☆68Updated last year
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Here are the notebooks used during the spacy youtube series.☆103Updated 4 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆141Updated 6 months ago
- Doubt your data, find bad labels.☆514Updated last year
- Python package for publishing Jupyter Notebooks as Medium blogposts☆148Updated 2 years ago
- A collection of machine learning model cards and datasheets.☆80Updated last week
- SpikeX - SpaCy Pipes for Knowledge Extraction☆399Updated 4 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆279Updated 2 years ago
- Text analysis with networks.☆288Updated last week
- Super Simple Similarities Service☆154Updated 5 months ago
- A Python module to convert natural language numerics into ints and floats.☆231Updated last year
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆171Updated 2 years ago
- Spacy NER annotator using ipywidgets☆122Updated last year
- Simple & Easy-to-use python modules to perform Quick Exploratory Data Analysis for any structured dataset!☆105Updated 2 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- A data labelling tool based on Streamlit.☆23Updated 4 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆164Updated 3 months ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆230Updated 3 years ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆477Updated 2 years ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Updated 3 years ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆183Updated last year
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆82Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year