MaartenGr / BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆6,178Updated this week
Related projects ⓘ
Alternatives and complementary repositories for BERTopic
- Top2Vec learns jointly embedded topic, document and word vectors.☆2,944Updated this week
- Minimal keyword extraction with BERT☆3,552Updated 4 months ago
- State-of-the-Art Text Embeddings☆15,368Updated this week
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,203Updated 10 months ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆731Updated 3 months ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,109Updated 5 months ago
- Data augmentation for NLP☆4,454Updated 4 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆6,952Updated last year
- Efficient few-shot learning with Sentence Transformers☆2,239Updated 2 months ago
- Single-document unsupervised keyword extraction☆1,648Updated 10 months ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,741Updated 11 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,352Updated 5 months ago
- Python package of Tomoto, the Topic Modeling Tool☆560Updated 3 months ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,010Updated 10 months ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆13,946Updated this week
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,426Updated last month
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆2,974Updated 3 months ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,097Updated 2 months ago
- Super easy library for BERT based NLP models☆1,866Updated 3 months ago
- The implementation of DeBERTa☆1,991Updated last year
- Python Keyphrase Extraction module☆1,565Updated last year
- 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production☆9,052Updated this week
- skweak: A software toolkit for weak supervision applied to NLP tasks☆920Updated 2 months ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,295Updated 2 weeks ago
- End-to-end neural table-text understanding models.☆1,147Updated 3 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆3,986Updated this week
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,072Updated this week
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,178Updated 2 months ago
- NLP, before and after spaCy☆2,217Updated last year
- A BERT model for scientific text.☆1,524Updated 2 years ago