Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.
☆187Jan 31, 2024Updated 2 years ago
Alternatives and similar repositories for Lbl2Vec
Users that are interested in Lbl2Vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning from Neighbors: Unsupervised Text Classification☆17Sep 27, 2022Updated 3 years ago
- Weakly-supervised Text Classification Based on Keyword Graph☆23Jan 8, 2023Updated 3 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆267Nov 8, 2024Updated last year
- An offshoot of the Awesome-Public-Datasets repo I'm cultivating☆15Dec 3, 2019Updated 6 years ago
- ☆19Jul 25, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Active Learning for Text Classification in Python☆640Apr 17, 2026Updated 3 weeks ago
- Experimental code used in pre-training the KBIR and KeyBART models☆27Jul 8, 2022Updated 3 years ago
- Code and datasets for EMNLP 2022 paper: Beyond prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Repr…☆19Jan 1, 2024Updated 2 years ago
- Topic modeling streamlit app.☆13Sep 7, 2024Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Jan 20, 2025Updated last year
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,106Nov 14, 2024Updated last year
- Creating class-based TF-IDF matrices☆91Oct 14, 2022Updated 3 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Mar 3, 2025Updated last year
- Efficient few-shot learning with Sentence Transformers☆2,728Apr 17, 2026Updated 3 weeks ago
- ☆22Aug 24, 2023Updated 2 years ago
- Official PyTorch implementation of RIO☆19Jul 29, 2021Updated 4 years ago
- ☆25Dec 8, 2022Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆156Jul 29, 2025Updated 9 months ago
- ACL 2023 Dual-Alignment Pre-training for Cross-lingual Sentence Embedding☆24Aug 21, 2024Updated last year
- Explainable Zero-Shot Topic Extraction☆65Aug 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- spaCy entry points for Curated Transformers☆32Mar 27, 2026Updated last month
- Checkout the new version at the link!☆22Dec 11, 2020Updated 5 years ago
- ☆23Jul 23, 2021Updated 4 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,267Jul 24, 2025Updated 9 months ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,578Feb 20, 2026Updated 2 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆96Feb 5, 2026Updated 3 months ago
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- ☆59Apr 24, 2021Updated 5 years ago
- Search Engines with Autoregressive Language models☆295Apr 4, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 4 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- Package for controllable summarization☆79Dec 7, 2022Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Jan 5, 2022Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago