ruanchaves / hashformers
Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).
☆70Updated 7 months ago
Alternatives and similar repositories for hashformers:
Users that are interested in hashformers are comparing it to the libraries listed below
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 3 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 7 months ago
- ☆43Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated last year
- Topic Inference with Zeroshot models☆61Updated last year
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- No Teacher BART distillation experiment for NLI tasks☆26Updated 4 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 3 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated this week
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- ☆22Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 10 months ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆66Updated 2 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆92Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Just another sentiment wrapper.☆17Updated 3 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated 7 months ago
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- IR-BERT at TREC 2020: Leveraging BERT for Semantic Search in Background Linking☆14Updated 3 years ago