sebischair / Lbl2Vec
Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.
☆184Updated last year
Alternatives and similar repositories for Lbl2Vec:
Users that are interested in Lbl2Vec are comparing it to the libraries listed below
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆258Updated 4 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆329Updated last year
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated last year
- Clustering sentence embeddings to extract message intent☆172Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 10 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- Few-shot Named Entity Recognition☆123Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆90Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆214Updated 2 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated this week
- [DEPRECATED] Adapt Transformer-based language models to new text domains☆87Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆171Updated 4 months ago
- A Python library for calculating a large variety of metrics from text☆332Updated 3 months ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆157Updated 2 years ago
- ☆158Updated 9 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆97Updated 11 months ago
- SpanMarker for Named Entity Recognition☆422Updated 2 months ago
- Nesta's Skills Extractor Library☆129Updated 4 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 11 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 5 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆106Updated 10 months ago
- A Corpus of 475,000 Industrial Occupations☆66Updated 4 years ago
- Active Learning for Text Classification in Python☆608Updated last week
- Spacy NER annotator using ipywidgets☆121Updated 11 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆486Updated 2 months ago
- 🔍 A statutory article retrieval dataset in French. (ACL 2022)☆39Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year