cardiffnlp / timelmsLinks
TimeLMs: Diachronic Language Models from Twitter
☆108Updated last year
Alternatives and similar repositories for timelms
Users that are interested in timelms are comparing it to the libraries listed below
Sorting:
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 8 months ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆85Updated last year
- Creating class-based TF-IDF matrices☆84Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- An instruction-based benchmark for text improvements.☆141Updated 2 years ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Updated 2 months ago
- Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021☆35Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- ☆76Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 9 months ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆66Updated 3 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Updated 2 years ago
- ☆13Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- ☆40Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆203Updated 2 years ago
- ☆66Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- ☆87Updated 3 years ago
- Apps built using Inspired Cognition's Critique.☆58Updated 2 years ago
- A Python Commonsense Knowledge Inference Toolkit☆64Updated last year
- [DEPRECATED] Adapt Transformer-based language models to new text domains☆87Updated last year
- Ensembling Hugging Face transformers made easy☆63Updated 2 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Updated 3 years ago
- Models for automatically transforming toxic text to neutral☆34Updated last year
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆49Updated 3 years ago