Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆161Feb 8, 2023Updated 3 years ago
Alternatives and similar repositories for xlm-t
Users that are interested in xlm-t are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for TweetEval☆395Jul 8, 2022Updated 3 years ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆385Apr 2, 2025Updated last year
- ☆14Jan 6, 2025Updated last year
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆47Dec 2, 2024Updated last year
- Multilingual emotion analysis research☆21Apr 8, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆794Jul 22, 2025Updated 8 months ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Oct 17, 2023Updated 2 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆54Feb 17, 2022Updated 4 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆25Nov 4, 2022Updated 3 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Apr 2, 2018Updated 8 years ago
- Probing task; contextual embeddings -> textual definitions (EMNLP19)☆11Apr 22, 2021Updated 4 years ago
- Vincent, B. T. (2015) A tutorial on Bayesian models of Perception, Journal of Mathematical Psychology.☆14Oct 17, 2017Updated 8 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,266Jul 24, 2025Updated 8 months ago
- Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).☆19Jun 15, 2023Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Jun 17, 2024Updated last year
- Annotated data set consisting of user comments posted to a German-language newspaper website☆17Jun 28, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- implementation of "BotPercent: Estimating Bot Populations in Twitter Communities" at EMNLP 2023, findings☆22Feb 2, 2023Updated 3 years ago
- ☆18Feb 28, 2022Updated 4 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆542Feb 13, 2024Updated 2 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- ☆12Jun 6, 2020Updated 5 years ago
- ☆35Nov 17, 2021Updated 4 years ago
- ☆10Jun 8, 2024Updated last year
- ☆10Sep 13, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- A python package to enrich Twitter Data☆75Jun 1, 2023Updated 2 years ago
- 1st place solution of 🦾😢 in https://www.kaggle.com/c/ai-medical-contest-2021/☆10Apr 2, 2021Updated 5 years ago
- This repository contains additional data used for the paper Automatic detection of influential actors in disinformation networks, PNAS, t…☆18Dec 29, 2020Updated 5 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Sep 28, 2018Updated 7 years ago
- A PyTorch implementation of the ACM SIGKDD 2021 paper titled "PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-base…☆17Dec 19, 2023Updated 2 years ago
- A french litbank corpus☆10Jan 22, 2026Updated 2 months ago