cardiffnlp/xlm-t

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cardiffnlp/xlm-t)

cardiffnlp / xlm-t

Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data

☆163

Alternatives and similar repositories for xlm-t

Users that are interested in xlm-t are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cardiffnlp / tweeteval
View on GitHub
Repository for TweetEval
☆401Jul 8, 2022Updated 4 years ago
MinhDucBui / Multi3Hate
View on GitHub
☆15Jan 6, 2025Updated last year
cardiffnlp / timelms
View on GitHub
TimeLMs: Diachronic Language Models from Twitter
☆113Mar 5, 2024Updated 2 years ago
ltgoslo / simple_elmo_training
View on GitHub
Minimal code to train ELMo models in recent versions of TensorFlow
☆14Jun 16, 2026Updated last month
Ankush7890 / ssfinetuning
View on GitHub
A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning
☆14Oct 27, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
boschresearch / adversarial_meta_embeddings
View on GitHub
Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"
☆13Dec 14, 2021Updated 4 years ago
ielab / CharacterBERT-DR
View on GitHub
The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…
☆16May 4, 2022Updated 4 years ago
nlp-uoregon / trankit
View on GitHub
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
☆795Jul 22, 2025Updated last year
tigerchen52 / LOVE
View on GitHub
ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost
☆41Nov 15, 2023Updated 2 years ago
cisnlp / GlotWeb
View on GitHub
[WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages
☆17Apr 14, 2026Updated 3 months ago
catalpa-cl / inceptalytics
View on GitHub
An easy-to-use API for analyzing INCEpTION annotation projects.
☆17Oct 17, 2023Updated 2 years ago
jwieting / paraphrastic-representations-at-scale
View on GitHub
☆74Jul 2, 2021Updated 5 years ago
OFAI / million-post-corpus
View on GitHub
Annotated data set consisting of user comments posted to a German-language newspaper website
☆18Jun 28, 2018Updated 8 years ago
asahi417 / kex
View on GitHub
Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…
☆55Feb 17, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cisnlp / ofa
View on GitHub
[NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆18Nov 26, 2023Updated 2 years ago
MiuLab / GenDef
View on GitHub
Probing task; contextual embeddings -> textual definitions (EMNLP19)
☆12Apr 22, 2021Updated 5 years ago
drbenvincent / bayesian2afc
View on GitHub
Vincent, B. T. (2015) A tutorial on Bayesian models of Perception, Journal of Mathematical Psychology.
☆14Oct 17, 2017Updated 8 years ago
proycon / spacy2folia
View on GitHub
Use spaCy for NLP and output to the FoLiA XML format.
☆12Feb 27, 2024Updated 2 years ago
Social-AI-Studio / HatReD
View on GitHub
Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).
☆22Jun 15, 2023Updated 3 years ago
t-systems-on-site-services-gmbh / german-elmo-model
View on GitHub
This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.
☆28Dec 15, 2019Updated 6 years ago
mayhewsw / pytorch-truecaser
View on GitHub
A simple neural truecaser written in pytorch and allennlp.
☆35Jun 17, 2024Updated 2 years ago
sonoisa / clip-japanese
View on GitHub
日本語CLIPモデル
☆13Sep 15, 2025Updated 10 months ago
TamSiuhin / BotPercent
View on GitHub
implementation of "BotPercent: Estimating Bot Populations in Twitter Communities" at EMNLP 2023, findings
☆22Feb 2, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
makcedward / nlpatl
View on GitHub
☆18Feb 28, 2022Updated 4 years ago
google-research / byt5
View on GitHub
☆547Feb 13, 2024Updated 2 years ago
amazon-science / irgr
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
eg-nlp-community / nlp-reading-group
View on GitHub
☆12Jun 6, 2020Updated 6 years ago
tnhaider / poetry-emotion
View on GitHub
Poetry Corpora Annotated on Aesthetic Emotions
☆13Aug 2, 2022Updated 3 years ago
MohammadForouhesh / latent-aspect-detection
View on GitHub
Code and models for the paper "Latent Aspect Detection from Online Unsolicited Customer Reviews"
☆15May 16, 2022Updated 4 years ago
Influence-Disinformation-Networks / PNAS-Narrative-Networks
View on GitHub
This repository contains additional data used for the paper Automatic detection of influential actors in disinformation networks, PNAS, t…
☆18Dec 29, 2020Updated 5 years ago
abap34 / medCon2021-1st-place-solution
View on GitHub
1st place solution of 🦾😢 in https://www.kaggle.com/c/ai-medical-contest-2021/
☆10Apr 2, 2021Updated 5 years ago
jzbjyb / lm-calibration
View on GitHub
☆34Nov 17, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
stefan-it / german-gpt2
View on GitHub
German GPT-2 model
☆32Aug 17, 2021Updated 4 years ago
MilaNLProc / twitter-demographer
View on GitHub
A python package to enrich Twitter Data
☆75Jun 1, 2023Updated 3 years ago
tatHi / maxmatch_dropout
View on GitHub
☆10Sep 13, 2022Updated 3 years ago
boun-tabi / SQuAD-TR
View on GitHub
☆11Jun 8, 2024Updated 2 years ago
claws-lab / petgen
View on GitHub
A PyTorch implementation of the ACM SIGKDD 2021 paper titled "PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-base…
☆17Dec 19, 2023Updated 2 years ago
cindyxinyiwang / expand-via-lexicon-based-adaptation
View on GitHub
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆29Apr 2, 2022Updated 4 years ago
uds-lsv / GermEval-2018-Data
View on GitHub
This repository contains all manually labeled data from the GermEval-2018 shared task.
☆29Sep 28, 2018Updated 7 years ago