MaartenGr / BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆6,648Updated last week
Alternatives and similar repositories for BERTopic:
Users that are interested in BERTopic are comparing it to the libraries listed below
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,024Updated 5 months ago
- Efficient few-shot learning with Sentence Transformers☆2,443Updated this week
- Minimal keyword extraction with BERT☆3,819Updated 3 weeks ago
- State-of-the-Art Text Embeddings☆16,444Updated this week
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,227Updated 2 months ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆754Updated 8 months ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,169Updated 10 months ago
- Data augmentation for NLP☆4,539Updated 9 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,439Updated last week
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,306Updated last year
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,536Updated 2 weeks ago
- 🤗 Evaluate: A library for easily evaluating machine learning models and datasets.☆2,180Updated 3 months ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,138Updated 8 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,379Updated 2 months ago
- A BERT model for scientific text.☆1,583Updated 3 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,535Updated 6 months ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,332Updated last year
- The implementation of DeBERTa☆2,072Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆922Updated 7 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,428Updated this week
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,777Updated 2 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,134Updated last week
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆8,608Updated this week
- Data augmentation for NLP, presented at EMNLP 2019☆1,630Updated 2 years ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,321Updated last week
- Jupyter notebooks for the Natural Language Processing with Transformers book☆4,285Updated 7 months ago
- Papers & presentation materials from Hugging Face's internal science day☆2,045Updated 4 years ago
- Single-document unsupervised keyword extraction☆1,707Updated last month
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,030Updated last year
- Open source annotation tool for machine learning practitioners.☆9,911Updated 4 months ago