MaartenGr / BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆6,314Updated this week
Alternatives and similar repositories for BERTopic:
Users that are interested in BERTopic are comparing it to the libraries listed below
- Top2Vec learns jointly embedded topic, document and word vectors.☆2,973Updated 2 months ago
- Minimal keyword extraction with BERT☆3,656Updated 6 months ago
- Efficient few-shot learning with Sentence Transformers☆2,311Updated this week
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,211Updated last year
- State-of-the-Art Text Embeddings☆15,772Updated last week
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,075Updated last year
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,137Updated 7 months ago
- Data augmentation for NLP☆4,491Updated 6 months ago
- Single-document unsupervised keyword extraction☆1,668Updated last year
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆742Updated 5 months ago
- The implementation of DeBERTa☆2,026Updated last year
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.☆886Updated last month
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,182Updated this week
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,357Updated this week
- Jupyter notebooks for the Natural Language Processing with Transformers book☆3,972Updated 4 months ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,316Updated last year
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆1,996Updated 5 months ago
- 🤗 Evaluate: A library for easily evaluating machine learning models and datasets.☆2,082Updated last week
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,021Updated last year
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,673Updated 5 months ago
- A Collection of BM25 Algorithms in Python☆1,082Updated 3 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,337Updated this week
- Beautiful visualizations of how language differs among document types.☆2,272Updated 3 months ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,510Updated 3 weeks ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,183Updated 2 months ago
- Github repo with tutorials to fine tune transformers for diff NLP tasks☆831Updated 9 months ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,749Updated last year
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆826Updated 4 months ago
- Papers & presentation materials from Hugging Face's internal science day☆2,041Updated 4 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,105Updated 4 months ago