sebischair / Lbl2Vec
Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.
☆175Updated 7 months ago
Related projects: ⓘ
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆251Updated 4 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆149Updated 3 months ago
- Clustering sentence embeddings to extract message intent☆166Updated 2 years ago
- Active Learning for Text Classification in Python☆548Updated this week
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆240Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆208Updated 3 months ago
- Creating class-based TF-IDF matrices☆81Updated last year
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆139Updated 5 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated last year
- Few-shot Named Entity Recognition☆121Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆321Updated last year
- A Python library for calculating a large variety of metrics from text☆309Updated this week
- SpanMarker for Named Entity Recognition☆384Updated last month
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆373Updated last year
- Zero and Few shot named entity & relationships recognition☆340Updated this week
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 6 months ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆283Updated 10 months ago
- Sentence transformers models for SpaCy☆104Updated last year
- Google USE (Universal Sentence Encoder) for spaCy☆176Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 2 years ago
- Text analysis with networks.☆283Updated 4 months ago
- Concept Modeling: Topic Modeling on Images and Text☆192Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆207Updated 2 months ago
- just a bunch of useful embeddings☆458Updated last week
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆113Updated 3 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆114Updated 5 months ago
- Fuzzy matching and more functionality for spaCy.☆249Updated 2 months ago
- ☆58Updated 3 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆155Updated last year
- A Corpus of 475,000 Industrial Occupations☆63Updated 3 years ago