lvwerra / rl-implementationsLinks
This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆15Updated 2 years ago
Alternatives and similar repositories for rl-implementations
Users that are interested in rl-implementations are comparing it to the libraries listed below
Sorting:
- ☆31Updated 2 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆29Updated 2 years ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated last year
- A simple OpenAI Gym environment for Neural Architecture Search (NAS)☆30Updated 5 years ago
- Parallel data preprocessing for NLP and ML.☆34Updated 7 months ago
- ☆13Updated last week
- Repo to reproduce the First-Explore paper results☆37Updated 5 months ago
- Code associated to papers on superposition (in ML interpretability)☆28Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Neuro-Symbolic Visual Question Answering on Sort-of-CLEVR using PyTorch☆56Updated 3 years ago
- Amos optimizer with JEstimator lib.☆82Updated last year
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated last year
- A library to create and manage configuration files, especially for machine learning projects.☆78Updated 3 years ago
- 🎢 Creating and sharing simulation environments for embodied and synthetic data research☆190Updated last year
- Documentation for dynamic machine learning systems.☆29Updated 8 months ago
- Collaborative inference of latent diffusion via hivemind☆12Updated 2 years ago
- Interpretability tools for recurrent convolutional networks (DRC) that play Sokoban☆13Updated 3 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 8 months ago
- Shakespeare transformer fine-tuned to generate positive sentiment samples using RLHF☆9Updated 2 years ago
- Building the cognitive-core to solve ARC-AGI-2☆21Updated 4 months ago
- Embedding Recycling for Language models☆38Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 9 months ago
- ☆41Updated 9 months ago
- Automatically take good care of your preemptible TPUs☆36Updated 2 years ago