neomatrix369 / nlp_profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
☆242Updated 11 months ago
Alternatives and similar repositories for nlp_profiler:
Users that are interested in nlp_profiler are comparing it to the libraries listed below
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 3 weeks ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆472Updated 2 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- Repository for Project Insight: NLP as a Service☆304Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated last year
- Doubt your data, find bad labels.☆510Updated 9 months ago
- Natural language processing support for Pandas dataframes.☆216Updated last month
- Deploy transformers serverless on AWS Lambda☆122Updated 3 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆231Updated last year
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- SummVis is an interactive visualization tool for text summarization.☆252Updated 2 years ago
- Real data science interview assignments☆94Updated 4 years ago
- 📰Natural language processing (NLP) newsletter☆301Updated 4 years ago
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 9 months ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆922Updated 7 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆71Updated last year
- Explainable Zero-Shot Topic Extraction☆62Updated 7 months ago
- A Python module to convert natural language numerics into ints and floats.☆227Updated 6 months ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated last year
- 📄 A repo containing notes and discussions for our weekly NLP/ML paper discussions.☆150Updated 4 years ago
- Use fastai-v2 with HuggingFace's pretrained transformers☆111Updated 4 years ago
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆342Updated 3 months ago
- Models and Pipelines for the Spark NLP library☆112Updated 3 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆159Updated 4 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆158Updated 2 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆183Updated 2 years ago