Pi-Star-Lab / csce642-deepRLLinks
Assignments of CSCE-642: Deep Reinforcement Learning offered at Texas A&M University.
☆10Updated 5 months ago
Alternatives and similar repositories for csce642-deepRL
Users that are interested in csce642-deepRL are comparing it to the libraries listed below
Sorting:
- Neuronal Circuit Policies☆41Updated 3 years ago
- Repository containing lectures from 2023 Machine Learning course☆11Updated 2 years ago
- Personal solutions to the Triton Puzzles☆20Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Updated 5 years ago
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆26Updated last week
- Pytorch routines for (Ker)nel (Mac)hines☆10Updated 3 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆20Updated last year
- Building your own autograd mechanism based on PyTorch tensor only (not Variable, can be seen as numpy array)☆24Updated 2 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Updated 3 months ago
- ☆12Updated 2 years ago
- Experimental paper writing linter.☆35Updated last year
- ☆35Updated last year
- Learn online intrinsic rewards from LLM feedback☆45Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Updated last year
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆64Updated 3 weeks ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Updated last year
- Simple and efficient pytorch-native transformer training and inference (batched)☆79Updated last year
- Code for Solving Black-Box Optimization Challenge via Learning Search Space Partition for Local Bayesian Optimization.☆21Updated 4 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 4 months ago
- Understanding RL vision Distill article☆25Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Updated last year
- High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm☆29Updated 3 years ago
- Efficiently send large arrays across machines☆17Updated last year
- Make triton easier☆50Updated last year
- Code and data for paper "(How) do Language Models Track State?"☆21Updated 10 months ago
- Implementation of UltraMem, improved Product Key Memory design, from Bytedance AI labs☆28Updated 3 months ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 4 years ago
- ☆23Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year