drob-xx / TopicTuner
HDBSCAN Tuning for BERTopic Models
☆42Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TopicTuner
- ☆53Updated 10 months ago
- Package to extract connotation frames☆80Updated 11 months ago
- Blazing fast topic modelling for short texts.☆31Updated last month
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- Powerful topic model visualization in Python☆103Updated 3 months ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆72Updated last year
- ☆21Updated this week
- Dataset and code for directed sentiment analysis in news text.☆16Updated 3 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 7 months ago
- Robust and fast topic models with sentence-transformers.☆23Updated this week
- ☆147Updated 5 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 6 months ago
- A collection of topic diversity measures for topic modeling☆45Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆118Updated 7 months ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆41Updated 9 months ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆102Updated 10 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 8 months ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆33Updated 3 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆62Updated 8 months ago
- Semantically Structured Sentence Embeddings☆67Updated last month
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆36Updated last year
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆91Updated last year
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆70Updated 11 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆254Updated 2 weeks ago
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.☆0Updated last year