VinAIResearch/BERTweet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VinAIResearch/BERTweet)

VinAIResearch / BERTweet

BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)

☆609

Alternatives and similar repositories for BERTweet

Users that are interested in BERTweet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cardiffnlp / tweeteval
View on GitHub
Repository for TweetEval
☆401Jul 8, 2022Updated 4 years ago
digitalepidemiologylab / covid-twitter-bert
View on GitHub
Pretrained BERT model for analysing COVID-19 Twitter data
☆184Mar 25, 2023Updated 3 years ago
cambridge-wtwt / acl2020-wtwt-tweets
View on GitHub
☆36Oct 1, 2020Updated 5 years ago
mit-ccc / TweebankNLP
View on GitHub
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…
☆107Jan 24, 2024Updated 2 years ago
ucinlp / covid19-data
View on GitHub
☆21Jul 28, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
danielpreotiuc / complaints-social-media
View on GitHub
Research on Complaints in Social Media (ACL 2019)
☆15Aug 15, 2019Updated 6 years ago
MilaNLProc / contextualized-topic-models
View on GitHub
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…
☆1,272Jul 24, 2025Updated last year
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
datquocnguyen / VnDT
View on GitHub
VnDT: A Vietnamese Dependency Treebank
☆24Nov 6, 2021Updated 4 years ago
sakibsh / ANTiVax
View on GitHub
A novel dataset containing over 15 Million COVID-19 vaccine-related tweets and 15 Thousand labeled tweet for vaccine misinformation detec…
☆34Aug 24, 2024Updated last year
sebsk / CS224N-Project
View on GitHub
☆38Jul 19, 2020Updated 6 years ago
ddangelov / Top2Vec
View on GitHub
Top2Vec learns jointly embedded topic, document and word vectors.
☆3,102Nov 14, 2024Updated last year
GateNLP / semeval2019-hyperpartisan-bertha-von-suttner
View on GitHub
SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution
☆23Aug 15, 2019Updated 6 years ago
prrao87 / tweet-stance-prediction
View on GitHub
Applying NLP transfer learning techniques to predict Tweet stance toward a topic
☆106Feb 10, 2019Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
kglandt / stance-detection-in-covid-19-tweets
View on GitHub
☆27Jul 30, 2021Updated 4 years ago
INESCTEC / Tweet2Story
View on GitHub
Repository for the Tweet2Story framework for the extraction of narratives from tweets.
☆13Feb 13, 2022Updated 4 years ago
VinAIResearch / PhoBERT
View on GitHub
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
☆793Jul 23, 2024Updated 2 years ago
MaartenGr / BERTopic
View on GitHub
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆7,756May 13, 2026Updated 2 months ago
s4zong / extract_COVID19_events_from_Twitter
View on GitHub
Annotated corpus and code for "Extracting COVID-19 Events from Twitter".
☆44May 19, 2022Updated 4 years ago
firojalam / COVID-19-tweets-for-check-worthiness
View on GitHub
COVID-19 Infodemic Twitter dataset
☆13Sep 5, 2021Updated 4 years ago
pysentimiento / pysentimiento
View on GitHub
A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks
☆658Jul 9, 2024Updated 2 years ago
cbaziotis / ekphrasis
View on GitHub
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…
☆675Jun 2, 2025Updated last year
chuchun8 / MTSD
View on GitHub
☆16Jun 6, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MilaNLProc / twitter-demographer
View on GitHub
A python package to enrich Twitter Data
☆75Jun 1, 2023Updated 3 years ago
huggingface / sentence-transformers
View on GitHub
State-of-the-Art Embeddings, Retrieval, and Reranking
☆18,944Updated this week
AkshitaJha / NLP_CSS_2017
View on GitHub
☆10Jul 27, 2018Updated 7 years ago
GU-DataLab / stance-detection-KE-MLM
View on GitHub
Official resource of the paper "Knowledge Enhanced Masked Language Model for Stance Detection", NAACL 2021
☆40Oct 26, 2021Updated 4 years ago
DocNow / hydrator
View on GitHub
Turn Tweet IDs into Twitter JSON & CSV from your desktop!
☆439Apr 18, 2023Updated 3 years ago
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
swisscom / ai-research-keyphrase-extraction
View on GitHub
EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)
☆440Apr 7, 2023Updated 3 years ago
asahi417 / tner
View on GitHub
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…
☆397May 11, 2023Updated 3 years ago
bhoov / exbert
View on GitHub
A Visual Analysis Tool to Explore Learned Representations in Transformers Models
☆607Feb 7, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
timoschick / pet
View on GitHub
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
☆1,625Jun 12, 2023Updated 3 years ago
VinAIResearch / ViText2SQL
View on GitHub
ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)
☆39Jul 22, 2024Updated 2 years ago
PAIR-code / lit
View on GitHub
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …
☆3,658Jul 7, 2026Updated 2 weeks ago
echen102 / COVID-19-TweetIDs
View on GitHub
The repository contains an ongoing collection of tweets IDs associated with the novel coronavirus COVID-19 (SARS-CoV-2), which commenced …
☆723Feb 22, 2023Updated 3 years ago
shehel / BERT_propaganda_detection
View on GitHub
Propaganda detection using fine-tuned BERT
☆20Jul 21, 2022Updated 4 years ago
VinAIResearch / COVID19Tweet
View on GitHub
WNUT-2020 Task 2: Identification of informative COVID-19 English Tweets
☆30Jul 22, 2024Updated 2 years ago
appvision-ai / fast-bert
View on GitHub
Super easy library for BERT based NLP models
☆1,918Aug 19, 2024Updated last year