Pi-Star-Lab / csce642-deepRLLinks
Assignments of CSCE-642: Deep Reinforcement Learning offered at Texas A&M University.
☆10Updated 5 months ago
Alternatives and similar repositories for csce642-deepRL
Users that are interested in csce642-deepRL are comparing it to the libraries listed below
Sorting:
- Neuronal Circuit Policies☆41Updated 3 years ago
- High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm☆29Updated 3 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 4 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Updated 5 years ago
- Fast reinforcement learning 💨☆28Updated 6 months ago
- Made for a reading group at the Center for Safe AGI.☆12Updated 3 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆79Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Updated last year
- Personal solutions to the Triton Puzzles☆20Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Updated 3 years ago
- Collection of Papers and Trials on Deep Learning to aid EE design☆45Updated 5 years ago
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆64Updated 3 weeks ago
- A PyTorch Implementation of Neural Turing Machine☆13Updated 5 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Updated 6 years ago
- A C++ pytorch implementation of MuZero☆40Updated last year
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Updated 3 years ago
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆35Updated last year
- ☆30Updated 3 years ago
- Efficiently send large arrays across machines☆17Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 8 months ago
- Parallelizing non-linear sequential models over the sequence length☆56Updated 7 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆29Updated 3 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆36Updated 2 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- ☆35Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆19Updated 2 years ago
- ☆40Updated 2 years ago
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 3 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆15Updated 5 years ago