MaartenGr / BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆6,716Updated 3 weeks ago
Alternatives and similar repositories for BERTopic:
Users that are interested in BERTopic are comparing it to the libraries listed below
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,032Updated 5 months ago
- Minimal keyword extraction with BERT☆3,839Updated last month
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,176Updated last week
- State-of-the-Art Text Embeddings☆16,611Updated last week
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,228Updated 3 months ago
- Efficient few-shot learning with Sentence Transformers☆2,477Updated 3 weeks ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆764Updated 9 months ago
- Data augmentation for NLP☆4,556Updated 10 months ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,836Updated last month
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,474Updated last week
- Python package of Tomoto, the Topic Modeling Tool☆579Updated 9 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,456Updated last week
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,030Updated 8 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,373Updated last year
- Super easy library for BERT based NLP models☆1,895Updated 8 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,788Updated 2 months ago
- A Collection of BM25 Algorithms in Python☆1,156Updated 6 months ago
- Easy to use extractive text summarization with BERT☆1,428Updated last year
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,541Updated last week
- Python Keyphrase Extraction module☆1,582Updated last year
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,354Updated last month
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,383Updated 3 months ago
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,651Updated last year
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,340Updated last year
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,886Updated 2 years ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆1,816Updated this week
- Github repo with tutorials to fine tune transformers for diff NLP tasks☆848Updated last year
- This repository contains demos I made with the Transformers library by HuggingFace.☆10,821Updated 2 weeks ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,550Updated 6 months ago
- Single-document unsupervised keyword extraction☆1,721Updated 2 months ago