Minimal keyword extraction with BERT
☆4,116Feb 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for KeyBERT
Users that are interested in KeyBERT are comparing it to the libraries listed below
Sorting:
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,412Feb 20, 2026Updated last week
- Single-document unsupervised keyword extraction☆1,825Feb 11, 2026Updated 2 weeks ago
- Python Keyphrase Extraction module☆1,588Jul 12, 2023Updated 2 years ago
- State-of-the-Art Text Embeddings☆18,298Feb 20, 2026Updated last week
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,106Nov 14, 2024Updated last year
- ☆448Oct 26, 2022Updated 3 years ago
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- Efficient few-shot learning with Sentence Transformers☆2,688Dec 11, 2025Updated 2 months ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,359Oct 27, 2025Updated 4 months ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,265Jul 24, 2025Updated 7 months ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆267Nov 8, 2024Updated last year
- Data augmentation for NLP☆4,645Jun 24, 2024Updated last year
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆440Apr 7, 2023Updated 2 years ago
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,295Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,875Updated this week
- Open source annotation tool for machine learning practitioners.☆10,546Feb 17, 2026Updated last week
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,231Aug 25, 2025Updated 6 months ago
- Deep Keyphrase Extraction using BERT☆261Feb 21, 2022Updated 4 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,641Oct 16, 2024Updated last year
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,782Oct 14, 2025Updated 4 months ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,752Dec 20, 2023Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,210Updated this week
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,817Jan 23, 2024Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,170Sep 30, 2025Updated 5 months ago
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated 3 weeks ago
- BertViz: Visualize Attention in Transformer Models☆7,921Jan 8, 2026Updated last month
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,687Oct 23, 2024Updated last year
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,033Jan 23, 2026Updated last month
- A library for efficient similarity search and clustering of dense vectors.☆39,195Updated this week
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,209Feb 15, 2026Updated last week
- An open-source NLP research library, built on PyTorch.☆11,889Nov 22, 2022Updated 3 years ago
- Retrieval and Retrieval-augmented LLMs☆11,329Dec 15, 2025Updated 2 months ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆797Feb 20, 2026Updated last week
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,083Aug 15, 2024Updated last year
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆2,022Jan 15, 2025Updated last year
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,668Feb 22, 2026Updated last week
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,023Feb 21, 2026Updated last week
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,087Oct 16, 2025Updated 4 months ago