CODAIT / text-extensions-for-pandasLinks
Natural language processing support for Pandas dataframes.
☆216Updated 4 months ago
Alternatives and similar repositories for text-extensions-for-pandas
Users that are interested in text-extensions-for-pandas are comparing it to the libraries listed below
Sorting:
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆242Updated last year
- Data Analysis Baseline Library☆132Updated 8 months ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- Bag of, not words, but tricks!☆68Updated last year
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 3 months ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Python package for publishing Jupyter Notebooks as Medium blogposts☆147Updated last year
- A Python module to convert natural language numerics into ints and floats.☆228Updated 9 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Quote extraction for modular journalism (JournalismAI collab 2021)☆229Updated 3 years ago
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆278Updated 2 years ago
- Doubt your data, find bad labels.☆513Updated last year
- A collection of machine learning model cards and datasheets.☆77Updated last month
- SpikeX - SpaCy Pipes for Knowledge Extraction☆399Updated 3 years ago
- Fuzzy matching and more functionality for spaCy.☆256Updated last year
- Information extraction from English and German texts based on predicate logic☆137Updated 2 years ago
- Super Simple Similarities Service☆150Updated 3 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆472Updated 2 years ago
- Here are the notebooks used during the spacy youtube series.☆103Updated 4 years ago
- data⎰describe: Pythonic EDA Accelerator for Data Science☆301Updated 2 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆165Updated last week
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆80Updated 3 years ago
- Spacy NER annotator using ipywidgets☆124Updated last year
- ☆152Updated 4 years ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆183Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated last year
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Updated 3 years ago