xycoord / deep-rl-courseLinks
This course aims to teach from the basics of RL to advanced algorithms such as PPO.
☆32Updated 2 weeks ago
Alternatives and similar repositories for deep-rl-course
Users that are interested in deep-rl-course are comparing it to the libraries listed below
Sorting:
- ☆46Updated 10 months ago
- Repository of notes, code and notebooks in Python for the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew …☆36Updated 5 months ago
- ☆29Updated last year
- ☆120Updated 2 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆49Updated last year
- Building GPT ...☆18Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆141Updated last year
- Deep Learning for Computer Vision☆60Updated last year
- NYU Artificial Intelligence Spring 2024☆61Updated last year
- PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …☆205Updated 5 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆137Updated last week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- Distributed training (multi-node) of a Transformer model☆93Updated last year
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Sc…☆123Updated 5 months ago
- Notebooks for fine tuning pali gemma☆117Updated 9 months ago
- ☆46Updated 8 months ago
- ☆42Updated last year
- ☆52Updated last year
- Collection of autoregressive model implementation☆85Updated 3 weeks ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆116Updated last year
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆221Updated last month
- A curated list of awesome mobile machine learning resources.☆150Updated 6 years ago
- A practical guide to diffusion models, implemented from scratch.☆245Updated last month
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Updated 11 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- ML/DL Math and Method notes☆66Updated 2 years ago
- ☆42Updated last year
- General multi-task deep RL Agent☆185Updated last year
- ☆157Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆71Updated 11 months ago