MaartenGr / BERTopicLinks
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆6,788Updated 2 weeks ago
Alternatives and similar repositories for BERTopic
Users that are interested in BERTopic are comparing it to the libraries listed below
Sorting:
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,050Updated 6 months ago
- Minimal keyword extraction with BERT☆3,870Updated 2 months ago
- State-of-the-Art Text Embeddings☆16,812Updated last week
- Efficient few-shot learning with Sentence Transformers☆2,486Updated last month
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆766Updated 10 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,430Updated last year
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,230Updated 3 months ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,185Updated last month
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,408Updated last month
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,035Updated 9 months ago
- The implementation of DeBERTa☆2,093Updated last year
- Data augmentation for NLP☆4,569Updated 11 months ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,168Updated 10 months ago
- Python package of Tomoto, the Topic Modeling Tool☆580Updated 9 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,816Updated 3 months ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,752Updated last year
- Single-document unsupervised keyword extraction☆1,727Updated 2 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,509Updated this week
- 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production☆9,726Updated this week
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆1,836Updated last week
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,182Updated last week
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,387Updated last week
- A Collection of BM25 Algorithms in Python☆1,173Updated 7 months ago
- BERT score for text generation☆1,746Updated 10 months ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,560Updated 7 months ago
- Longformer: The Long-Document Transformer☆2,129Updated 2 years ago
- Python Keyphrase Extraction module☆1,582Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,366Updated last month
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,942Updated 2 months ago
- MTEB: Massive Text Embedding Benchmark☆2,554Updated this week