prajjwal1 / rl_paradigm
☆15Updated last year
Alternatives and similar repositories for rl_paradigm:
Users that are interested in rl_paradigm are comparing it to the libraries listed below
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Generalised UDRL☆37Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆27Updated 2 years ago
- ☆13Updated 3 months ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated 2 years ago
- ☆29Updated 3 years ago
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- Variational Reinforcement Learning☆16Updated 6 months ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆28Updated 2 months ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆102Updated 2 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆41Updated 3 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 7 months ago
- Contextual Bandits Action Elimination DQN☆20Updated 6 years ago
- Code repository complementing the ICLR 2021 paper "Unsupervised Object Keypoint Learning using Local Spatial Predictability" (https://arx…☆9Updated last month
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- ☆53Updated 3 months ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆48Updated last year
- ☆19Updated 3 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆17Updated last year
- ☆26Updated last year
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Updated 2 years ago
- ☆40Updated 3 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 3 years ago
- ☆20Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago