A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
☆243May 12, 2024Updated last year
Alternatives and similar repositories for nlp_profiler
Users that are interested in nlp_profiler are comparing it to the libraries listed below
Sorting:
- ☆19Oct 10, 2020Updated 5 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆161Sep 25, 2020Updated 5 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Dec 31, 2024Updated last year
- ☆153Sep 17, 2020Updated 5 years ago
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆320Mar 1, 2024Updated 2 years ago
- A embed able annotation tool for end to end cross document co-reference☆41Mar 5, 2023Updated 3 years ago
- ☆17Sep 22, 2020Updated 5 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- AI apps/benchmark for legaltech☆112Sep 22, 2021Updated 4 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆234Jun 7, 2023Updated 2 years ago
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,267Mar 2, 2023Updated 3 years ago
- Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…☆561Aug 20, 2024Updated last year
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,083Aug 15, 2024Updated last year
- Load Testing ML Microservices for Robustness and Scalability☆14Feb 8, 2022Updated 4 years ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,108Nov 14, 2024Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆475Feb 6, 2023Updated 3 years ago
- A comprehensive reference for all topics related to Natural Language Processing☆2,039Oct 12, 2025Updated 4 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Aug 9, 2020Updated 5 years ago
- Contains relevant notebooks for the hands-on NLP workshop for the Analytics India Magazine Plugin Conference -2020 Edition☆71May 29, 2020Updated 5 years ago
- Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources o…☆1,642Feb 9, 2026Updated 3 weeks ago
- Code release for "A Time-Aware Transformer Based Model for Suicide Ideation Detection on Social Media", EMNLP 2020.☆54Nov 16, 2020Updated 5 years ago
- A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in …☆432Feb 5, 2024Updated 2 years ago
- Text preprocessing, representation and visualization from zero to hero.☆2,915Aug 29, 2023Updated 2 years ago
- Compute Sentence Embeddings Fast!☆624Mar 2, 2023Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Jan 26, 2021Updated 5 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆94Jun 5, 2023Updated 2 years ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,426Feb 20, 2026Updated last week
- Text analysis with networks.☆293Jan 16, 2026Updated last month
- This course covers how you can use NLP to do stuff.☆267Oct 12, 2020Updated 5 years ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated 11 months ago
- fast Rust-based SVMlight parser☆11Feb 19, 2021Updated 5 years ago
- A word2vec negative sampling implementation with correct CBOW update.☆261Nov 8, 2021Updated 4 years ago
- The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).☆224Jan 1, 2026Updated 2 months ago
- ✍️ A carefully curated list of NLP paper summaries☆1,478Dec 4, 2021Updated 4 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆119Aug 3, 2021Updated 4 years ago