eyalbd2 / RL-based-Language-ModelingLinks

☆13

Alternatives and similar repositories for RL-based-Language-Modeling

Users that are interested in RL-based-Language-Modeling are comparing it to the libraries listed below

Sorting:

stas00 / porting
Helper scripts and notes that were used while porting various nlp models
☆45Updated 3 years ago
jungokasai / beam_with_patience
☆46Updated 3 years ago
spyysalo / wiki-bert-pipeline
Generate BERT vocabularies and pretraining examples from Wikipedias
☆18Updated 5 years ago
anthonywchen / MOCHA
Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".
☆16Updated 3 years ago
lucidrains / learning-to-expire-pytorch
An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain
☆34Updated 4 years ago
JunShern / few-shot-adaptation
Exploring Few-Shot Adaptation of Language Models with Tables
☆24Updated 2 years ago
cambridgeltl / parameter-factorization
Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer
☆39Updated 4 years ago
IBM / model-recycling
Ranking of fine-tuned HF models as base models.
☆35Updated 3 months ago
RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
rowanz / turingadvice
Evaluating Machines by their Real-World Language Use
☆33Updated 2 years ago
sunlab-osu / ReasonBERT
Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021
☆29Updated 2 years ago
allenai / sledgehammer
☆47Updated 5 years ago
prajjwal1 / fluence
A deep learning library based on Pytorch focussed on low resource language research and robustness
☆70Updated 3 years ago
HendrikStrobelt / LMdiff
A diff tool for language models
☆43Updated last year
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
nreimers / se-pytorch-xla
☆21Updated 3 years ago
martiansideofthemoon / hurdles-longform-qa
Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://a…
☆46Updated 3 years ago
carolinlawrence / gradient-rollback
Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…
☆21Updated 4 years ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
krandiash / gpt3-nli
Training a model without a dataset for natural language inference (NLI)
☆25Updated 5 years ago
acmi-lab / pretraining-with-nonsense
Pretraining summarization models using a corpus of nonsense
☆13Updated 3 years ago
gsarti / lambda-bert
A 🤗-style implementation of BERT using lambda layers instead of self-attention
☆69Updated 4 years ago
jungokasai / twist_decoding
☆29Updated 3 years ago
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆58Updated 2 years ago
JeremyAlain / imitation_learning_from_language_feedback
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Updated 2 years ago
elephantmipt / bert-distillation
Distillation of BERT model with catalyst framework
☆78Updated 2 years ago
yandex-research / graph-glove
PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"
☆41Updated 4 years ago
nng555 / ssmba
☆62Updated 3 years ago
ofirpress / sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …
☆55Updated 4 years ago
awasthiabhijeet / Learning-From-Rules
Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net…
☆50Updated 2 years ago