Pi-Star-Lab / csce642-deepRLLinks
Assignments of CSCE-642: Deep Reinforcement Learning offered at Texas A&M University.
☆10Updated 3 weeks ago
Alternatives and similar repositories for csce642-deepRL
Users that are interested in csce642-deepRL are comparing it to the libraries listed below
Sorting:
- High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm☆29Updated 3 years ago
- A PyTorch Implementation of Neural Turing Machine☆13Updated 5 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year
- Repository containing lectures from 2023 Machine Learning course☆11Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆25Updated 2 months ago
- Neuronal Circuit Policies☆40Updated 3 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Personal solutions to the Triton Puzzles☆20Updated last year
- Efficiently send large arrays across machines☆16Updated last year
- ☆38Updated 2 years ago
- Repository of machine learning benchmarks☆42Updated last week
- ☆18Updated last year
- An environment for learning formal mathematical reasoning from scratch☆72Updated last year
- Reinforcement Learning Assembly☆92Updated 4 years ago
- NeurIPS 2024 tutorial on LLM Inference☆48Updated 9 months ago
- Learn online intrinsic rewards from LLM feedback☆43Updated 9 months ago
- Fast reinforcement learning 💨☆26Updated 2 months ago
- ☆58Updated 10 months ago
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆34Updated 10 months ago
- Understanding RL vision Distill article☆24Updated 2 years ago
- Nonparametric Score Estimators, ICML 2020☆36Updated 4 years ago
- Code for Solving Black-Box Optimization Challenge via Learning Search Space Partition for Local Bayesian Optimization.☆21Updated 4 years ago
- Code associated to papers on superposition (in ML interpretability)☆33Updated 3 years ago
- Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring 2025). LLM architectures, training paradigms (pre- and post-t…☆41Updated 3 months ago
- Minimal RLHF implementation built on top of minGPT.☆30Updated last year
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Updated 3 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆68Updated 9 months ago
- ☆49Updated 2 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year