MaartenGr / KeyBERT
Minimal keyword extraction with BERT
☆3,839Updated last month
Alternatives and similar repositories for KeyBERT:
Users that are interested in KeyBERT are comparing it to the libraries listed below
- Single-document unsupervised keyword extraction☆1,721Updated last month
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆6,716Updated 3 weeks ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,031Updated 5 months ago
- Easy to use extractive text summarization with BERT☆1,428Updated last year
- Python Keyphrase Extraction module☆1,582Updated last year
- Efficient few-shot learning with Sentence Transformers☆2,477Updated 3 weeks ago
- State-of-the-Art Text Embeddings☆16,611Updated last week
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,227Updated 3 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,788Updated 2 months ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆760Updated 9 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,383Updated 3 months ago
- Python package of Tomoto, the Topic Modeling Tool☆579Updated 8 months ago
- Fuzzy string matching, grouping, and evaluation.☆761Updated 2 months ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,174Updated last week
- BERT score for text generation☆1,738Updated 9 months ago
- A Collection of BM25 Algorithms in Python☆1,156Updated 6 months ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,787Updated 2 years ago
- The implementation of DeBERTa☆2,082Updated last year
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆1,813Updated last week
- MTEB: Massive Text Embedding Benchmark☆2,469Updated this week
- NeuSpell: A Neural Spelling Correction Toolkit☆692Updated last year
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,550Updated 6 months ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,130Updated 8 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,174Updated 9 months ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,354Updated 3 weeks ago
- Compute Sentence Embeddings Fast!☆623Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 5 months ago
- ☆1,270Updated 2 years ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆1,978Updated last week
- 🦙 Integrating LLMs into structured NLP pipelines☆1,240Updated 3 months ago