A curated list of reinforcement learning in NLP. :-)
☆21Oct 30, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-reinforcement-learning-in-nlp
Users that are interested in awesome-reinforcement-learning-in-nlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated 2 months ago
- Yet another Python project template.☆13Sep 13, 2024Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Apr 28, 2023Updated 3 years ago
- ☆15Oct 19, 2020Updated 5 years ago
- ☆18Oct 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20Jan 16, 2024Updated 2 years ago
- A library of translation-based text similarity measures☆25Dec 11, 2023Updated 2 years ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- Natural Universal Trigger Search (NUTS)☆21Apr 17, 2021Updated 5 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 5 years ago
- ☆12Jun 30, 2025Updated 11 months ago
- ☆16Jun 9, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Oct 5, 2025Updated 8 months ago
- Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model☆11Feb 11, 2020Updated 6 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated 2 years ago
- Pallet loading problem solver with recursive partitioning approach for the packing of different rectangles in a rectangle.☆13Oct 1, 2012Updated 13 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- online judge as a service☆20Aug 12, 2014Updated 11 years ago
- A transformer model to predict pathogenic mutations☆12Jun 25, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 5 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 3 years ago
- A platform for Applied Reinforcement Learning (Applied RL)☆14Jan 19, 2019Updated 7 years ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆84Jun 11, 2026Updated 2 weeks ago
- A survey on machine learning for combinatorial optimization.☆13Dec 27, 2021Updated 4 years ago
- Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf☆12Dec 2, 2024Updated last year
- ☆13Jun 11, 2021Updated 5 years ago
- A simple enigma machine in Go☆11Nov 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated 4 months ago
- ROCK Framework for Commonsense Causality Reasoning (CCR)☆10Jun 28, 2023Updated 3 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆18Sep 27, 2019Updated 6 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 6 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆13Nov 14, 2019Updated 6 years ago
- ☆12Oct 18, 2020Updated 5 years ago