CODAIT / text-extensions-for-pandasLinks
Natural language processing support for Pandas dataframes.
☆217Updated 6 months ago
Alternatives and similar repositories for text-extensions-for-pandas
Users that are interested in text-extensions-for-pandas are comparing it to the libraries listed below
Sorting:
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243Updated last year
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Bag of, not words, but tricks!☆68Updated last year
- Data Analysis Baseline Library☆133Updated 10 months ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 5 months ago
- A collection of machine learning model cards and datasheets.☆79Updated last month
- Quote extraction for modular journalism (JournalismAI collab 2021)☆230Updated 3 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- Here are the notebooks used during the spacy youtube series.☆103Updated 4 years ago
- A Python module to convert natural language numerics into ints and floats.☆231Updated 11 months ago
- Models and Pipelines for the Spark NLP library☆113Updated 4 years ago
- Python package for publishing Jupyter Notebooks as Medium blogposts☆148Updated 2 years ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Updated 3 years ago
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆171Updated 2 years ago
- Super Simple Similarities Service☆154Updated 5 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆399Updated 4 years ago
- Doubt your data, find bad labels.☆515Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A data labelling tool based on Streamlit.☆23Updated 4 years ago
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆280Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆182Updated last year
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆471Updated 2 years ago
- Mastering spaCy, published by Packt☆136Updated last week
- 📈 The panel-highcharts package makes it easy to use HighCharts in Python, Notebooks and with HoloViz Panel.☆159Updated 2 years ago
- Package that returns a company embedding given a company name☆47Updated 5 years ago