Pi-Star-Lab / csce642-deepRLLinks
Assignments of CSCE-642: Deep Reinforcement Learning offered at Texas A&M University.
☆10Updated 2 months ago
Alternatives and similar repositories for csce642-deepRL
Users that are interested in csce642-deepRL are comparing it to the libraries listed below
Sorting:
- High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm☆29Updated 3 years ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Updated last year
- A PyTorch Implementation of Neural Turing Machine☆13Updated 5 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Personal solutions to the Triton Puzzles☆20Updated last year
- Fast reinforcement learning 💨☆28Updated 4 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 5 months ago
- train with kittens!☆63Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- Neuronal Circuit Policies☆40Updated 3 years ago
- ☆52Updated last year
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆68Updated 11 months ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- ☆39Updated last year
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Updated last year
- an environment based on XLA for deep learning compiler optimization research.☆23Updated 2 years ago
- 🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper☆38Updated 4 months ago
- Make triton easier☆48Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆26Updated 3 months ago
- ☆38Updated 2 years ago
- Experimental paper writing linter.☆35Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆111Updated last month
- Reinforcement Learning Assembly☆92Updated 4 years ago
- ☆16Updated 3 years ago
- Code associated to papers on superposition (in ML interpretability)☆33Updated 3 years ago
- Efficiently send large arrays across machines☆17Updated last year
- Building your own autograd mechanism based on PyTorch tensor only (not Variable, can be seen as numpy array)☆22Updated last year
- Ancestral Gumbel-Top-k Sampling☆25Updated 5 years ago