MaartenGr / BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆6,138Updated this week
Related projects ⓘ
Alternatives and complementary repositories for BERTopic
- Top2Vec learns jointly embedded topic, document and word vectors.☆2,936Updated 5 months ago
- Minimal keyword extraction with BERT☆3,541Updated 3 months ago
- State-of-the-Art Text Embeddings☆15,236Updated this week
- Efficient few-shot learning with Sentence Transformers☆2,226Updated last month
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,203Updated 9 months ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,101Updated 5 months ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆729Updated 3 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆6,919Updated last year
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆1,985Updated 2 months ago
- Data augmentation for NLP☆4,445Updated 4 months ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆2,958Updated 3 months ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,010Updated 10 months ago
- The implementation of DeBERTa☆1,986Updated last year
- Super easy library for BERT based NLP models☆1,861Updated 2 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,350Updated 5 months ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,739Updated 10 months ago
- Notebooks using the Hugging Face libraries 🤗☆3,645Updated this week
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,284Updated last week
- Papers & presentation materials from Hugging Face's internal science day☆2,035Updated 4 years ago
- Easy to use extractive text summarization with BERT☆1,397Updated last year
- An open-source NLP research library, built on PyTorch.☆11,756Updated last year
- Python package of Tomoto, the Topic Modeling Tool☆559Updated 3 months ago
- Jupyter notebooks for the Natural Language Processing with Transformers book☆3,883Updated 2 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,608Updated 3 months ago
- Single-document unsupervised keyword extraction☆1,644Updated 10 months ago
- 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production☆9,038Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆3,953Updated this week
- Github repo with tutorials to fine tune transformers for diff NLP tasks☆818Updated 7 months ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,289Updated last year
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,411Updated 3 weeks ago