neomatrix369 / nlp_profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
☆242Updated 8 months ago
Alternatives and similar repositories for nlp_profiler:
Users that are interested in nlp_profiler are comparing it to the libraries listed below
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆139Updated 9 months ago
- Repository for Project Insight: NLP as a Service☆303Updated last year
- SummVis is an interactive visualization tool for text summarization.☆251Updated 2 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆397Updated 3 years ago
- A Python module to convert natural language numerics into ints and floats.☆225Updated 3 months ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆231Updated last year
- Deploy transformers serverless on AWS Lambda☆121Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆243Updated last year
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated 2 years ago
- Doubt your data, find bad labels.☆508Updated 6 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 10 months ago
- Natural language processing support for Pandas dataframes.☆217Updated last year
- Bag of, not words, but tricks!☆68Updated last year
- Explainable Zero-Shot Topic Extraction☆62Updated 4 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆69Updated last year
- Spacy NER annotator using ipywidgets☆120Updated 9 months ago
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆472Updated last year
- 🧬 A JupyterLab extension for annotating data with Prodigy☆190Updated last year
- Google USE (Universal Sentence Encoder) for spaCy☆180Updated last year
- Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep le…☆133Updated 4 years ago
- Fuzzy matching and more functionality for spaCy.☆255Updated 6 months ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆157Updated last year
- 📄 A repo containing notes and discussions for our weekly NLP/ML paper discussions.☆150Updated 4 years ago
- Real data science interview assignments☆94Updated 4 years ago
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Katana project is a FastAPI template for ASAP 🚀 ML API deployment☆111Updated 11 months ago