zhjohnchan / awesome-reinforcement-learning-in-nlpLinks

A curated list of reinforcement learning in NLP. :-)

☆20

Alternatives and similar repositories for awesome-reinforcement-learning-in-nlp

Users that are interested in awesome-reinforcement-learning-in-nlp are comparing it to the libraries listed below

Sorting:

eyalbd2 / RL-based-Language-Modeling
☆13Updated 6 years ago
divyanshuaggarwal / IndicXNLI
Code Repository for the IndicXNLI paper.
☆15Updated last year
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated last year
jaketae / ensemble-transformers
Ensembling Hugging Face transformers made easy
☆63Updated 2 years ago
rajaswa / DRIFT
DRIFT is a tool for Diachronic Analysis of Scientific Literature.
☆115Updated 2 years ago
youngerous / pytorch-lightning-nlp-template
Lightning template for easy prototyping⚡️
☆13Updated 2 years ago
UKPLab / MetaQA
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
☆22Updated 3 years ago
rovle / gpt3-in-context-fitting
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Updated 2 years ago
benpry / chain-of-thought-metaphor
This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…
☆14Updated 2 years ago
HanGuo97 / soft-Q-learning-for-text-generation
☆68Updated 2 years ago
google-research-datasets / Hinglish-TOP-Dataset
Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…
☆39Updated 2 years ago
anthonywchen / MOCHA
Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".
☆16Updated 3 years ago
AI4Bharat / indic-bart
Pre-trained, multilingual sequence-to-sequence models for Indian languages
☆48Updated 2 years ago
keyonvafa / sequential-rationales
Rationales for Sequential Predictions
☆40Updated 3 years ago
JoaoLages / RATransformers
RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!
☆41Updated 2 years ago
EleutherAI / semantic-memorization
☆44Updated 7 months ago
gabeorlanski / stackoverflow-encourages-cheating
Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"
☆21Updated 3 years ago
tuvuumass / task-transferability
Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.
☆50Updated 4 years ago
IBM / commonsense-rl
Knowledge-Aware RL agents with Commonsense Reasoning
☆79Updated 3 years ago
stas00 / porting
Helper scripts and notes that were used while porting various nlp models
☆46Updated 3 years ago
MilaNLProc / language-invariant-properties
☆21Updated 3 years ago
neubig / coderx
A highly sophisticated sequence-to-sequence model for code generation
☆40Updated 3 years ago
PierreColombo / RankingNLPSystems
What are the best Systems? New Perspectives on NLP Benchmarking
☆13Updated 2 years ago
McGill-NLP / MLQuestions
☆19Updated 3 years ago
martiansideofthemoon / hurdles-longform-qa
Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://a…
☆46Updated 2 years ago
microsoft / Litmus
AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems
☆46Updated 2 years ago
midas-research / bhaav
Dataset of sentences from Hindi stories tagged with different emotion tags
☆11Updated 5 years ago
sgugger / hf_examples
NLP Examples using the 🤗 libraries
☆41Updated 4 years ago
HendrikStrobelt / LMdiff
A diff tool for language models
☆42Updated last year
zphang / minimal-opt
☆67Updated 2 years ago