A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
☆243May 12, 2024Updated last year
Alternatives and similar repositories for nlp_profiler
Users that are interested in nlp_profiler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Oct 10, 2020Updated 5 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆161Sep 25, 2020Updated 5 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- Fuzzy string matching, grouping, and evaluation.☆792Jul 10, 2025Updated 8 months ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Dec 31, 2024Updated last year
- ☆152Sep 17, 2020Updated 5 years ago
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆320Mar 1, 2024Updated 2 years ago
- ☆17Sep 22, 2020Updated 5 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆234Jun 7, 2023Updated 2 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Mar 5, 2023Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…☆561Aug 20, 2024Updated last year
- Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources o…☆1,651Mar 9, 2026Updated 2 weeks ago
- Load Testing ML Microservices for Robustness and Scalability☆14Feb 8, 2022Updated 4 years ago
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,265Mar 2, 2023Updated 3 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Apr 29, 2022Updated 3 years ago
- AI apps/benchmark for legaltech☆114Sep 22, 2021Updated 4 years ago
- A comprehensive reference for all topics related to Natural Language Processing☆2,040Oct 12, 2025Updated 5 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆473Feb 6, 2023Updated 3 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,088Aug 15, 2024Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,108Nov 14, 2024Updated last year
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Aug 9, 2020Updated 5 years ago
- Visualizations and helpers to improve and debug machine learning models for Rasa Open Source☆310Feb 10, 2022Updated 4 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in …☆432Feb 5, 2024Updated 2 years ago
- ☆15Dec 20, 2020Updated 5 years ago
- Contains relevant notebooks for the hands-on NLP workshop for the Analytics India Magazine Plugin Conference -2020 Edition☆71May 29, 2020Updated 5 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Feb 8, 2023Updated 3 years ago
- Repository for Project Insight: NLP as a Service☆320Feb 14, 2023Updated 3 years ago
- A repo where we have slides, notebooks, etc. related to ml-on-code☆20Nov 30, 2020Updated 5 years ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,467Feb 20, 2026Updated last month
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,750Dec 20, 2023Updated 2 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆94Jun 5, 2023Updated 2 years ago
- Text preprocessing, representation and visualization from zero to hero.☆2,909Aug 29, 2023Updated 2 years ago
- Compute Sentence Embeddings Fast!☆625Mar 2, 2023Updated 3 years ago
- A word2vec negative sampling implementation with correct CBOW update.☆261Nov 8, 2021Updated 4 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago