twitter-research / lmsoc
Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining
☆13Updated 3 years ago
Alternatives and similar repositories for lmsoc:
Users that are interested in lmsoc are comparing it to the libraries listed below
- ☆14Updated 6 months ago
- Converter from UD-trees to BART representation☆36Updated last year
- Making a bridge between NLP models and Brain data☆18Updated 4 years ago
- interactive explorer for language models☆9Updated 5 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Code repo for "Transformer on a Diet" paper☆31Updated 4 years ago
- ☆16Updated last year
- ☆16Updated 6 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆13Updated 2 years ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆24Updated 2 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated 11 months ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Updated 4 years ago
- ☆30Updated 3 years ago
- ☆22Updated 3 years ago
- ☆8Updated 9 months ago
- ☆12Updated 6 years ago
- Keras Implementation of Flair's Contextualized Embeddings☆27Updated 3 years ago
- ☆14Updated 7 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- ☆19Updated 2 years ago
- NER System Developed at CMU☆11Updated 7 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- ☆14Updated 9 months ago
- A collection of utilities for writing labeling functions, transformation functions, and slicing functions.☆20Updated 5 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated 9 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆45Updated last year