MaartenGr / BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆6,580Updated this week
Alternatives and similar repositories for BERTopic:
Users that are interested in BERTopic are comparing it to the libraries listed below
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,018Updated 4 months ago
- State-of-the-Art Text Embeddings☆16,265Updated this week
- Minimal keyword extraction with BERT☆3,777Updated last month
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,225Updated last month
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆753Updated 7 months ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,162Updated 9 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,381Updated this week
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,528Updated last month
- Data augmentation for NLP☆4,525Updated 8 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,237Updated last year
- Efficient few-shot learning with Sentence Transformers☆2,415Updated 2 months ago
- Single-document unsupervised keyword extraction☆1,692Updated 2 weeks ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,749Updated last year
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,120Updated 6 months ago
- Super easy library for BERT based NLP models☆1,888Updated 7 months ago
- 🪐 End-to-end NLP workflows from prototype to production☆1,362Updated 5 months ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,302Updated 3 weeks ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,519Updated 5 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,370Updated last month
- Easy to use extractive text summarization with BERT☆1,421Updated last year
- Python package of Tomoto, the Topic Modeling Tool☆572Updated 7 months ago
- Language-Agnostic SEntence Representations☆3,628Updated 10 months ago
- A curated list of pretrained sentence and word embedding models☆2,252Updated 3 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,112Updated this week
- A Collection of BM25 Algorithms in Python☆1,118Updated 5 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆1,771Updated 2 weeks ago
- Github repo with tutorials to fine tune transformers for diff NLP tasks☆843Updated 11 months ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,271Updated 4 months ago
- Beautiful visualizations of how language differs among document types.