lvwerra / rl-implementations
This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆15Updated 2 years ago
Alternatives and similar repositories for rl-implementations:
Users that are interested in rl-implementations are comparing it to the libraries listed below
- ☆31Updated 2 years ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago
- ☆34Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- ☆36Updated last year
- ☆28Updated 2 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆29Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 8 months ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- ☆31Updated last week
- Amos optimizer with JEstimator lib.☆82Updated 11 months ago
- Train very large language models in Jax.☆204Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- Unity Machine Learning Agents Toolkit☆46Updated last year
- Embedding Recycling for Language models☆38Updated last year
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 3 years ago
- ☆23Updated 3 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- Clean RL implementation using MLX☆30Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆16Updated 6 months ago
- ☆16Updated last year