VinAIResearch / BERTweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
☆575Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for BERTweet
- Repository for TweetEval☆357Updated 2 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,203Updated 10 months ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆377Updated last year
- Compute Sentence Embeddings Fast!☆618Updated last year
- ☆344Updated 3 years ago
- A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.☆167Updated 3 months ago
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated last year
- Officially supported AllenNLP models☆528Updated last year
- Applying BERT to named entity recognition in English and Russian.☆160Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆777Updated 6 months ago
- Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classif…☆306Updated 4 years ago
- Catalog of abusive language data (PLoS 2020)☆304Updated 5 months ago
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆428Updated last year
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,097Updated 2 months ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆555Updated 2 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆661Updated 8 months ago
- TextAugment: Text Augmentation Library☆402Updated 9 months ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆363Updated 2 years ago
- New dataset☆299Updated 3 years ago
- Code for using and evaluating SpanBERT.☆891Updated last year
- Fixes contractions such as `you're` to `you are`☆312Updated 2 years ago
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020☆598Updated 4 years ago
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆336Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆920Updated 2 months ago
- Topic Modeling in Embedding Spaces☆546Updated last year
- A small repo showing how to easily use BERT (or other transformers) for inference☆98Updated 4 years ago
- Autoregressive Entity Retrieval☆764Updated last year
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆107Updated last year
- ☆454Updated 3 years ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆736Updated last month