alessiodm / drl-zhLinks
Deep Reinforcement Learning: Zero to Hero!
☆2,223Updated last month
Alternatives and similar repositories for drl-zh
Users that are interested in drl-zh are comparing it to the libraries listed below
Sorting:
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆612Updated 9 months ago
- Leetcode for Pytorch☆1,686Updated 4 months ago
- ☆1,451Updated 9 months ago
- ☆589Updated 2 months ago
- Hackers' Guide to Language Models☆1,861Updated 11 months ago
- A concise, beginner-friendly introduction to the core ideas of linear algebra.☆1,879Updated 2 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆626Updated 8 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,830Updated 5 months ago
- R.L. methods and techniques.☆199Updated last year
- Grandmaster-Level Chess Without Search☆596Updated 10 months ago
- NAND is a logic simulator suite made entirely from NAND gates☆581Updated 2 weeks ago
- ☆513Updated last year
- Easily train AlphaZero-like agents on any environment you want!☆432Updated last year
- Solve puzzles. Improve your pytorch.☆3,814Updated last year
- Fine-tune LLM agents with online reinforcement learning☆1,245Updated last year
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,073Updated last month
- Llama 2 Everywhere (L2E)☆1,521Updated 3 months ago
- From the Transistor to the Web Browser, a rough outline for a 12 week course☆236Updated last year
- Text compression for generating keyboard expansions☆1,421Updated 2 years ago
- Learn WebAssembly by writing small programs!☆1,637Updated last year
- ☆249Updated last year
- Things you can do with the token embeddings of an LLM☆1,449Updated last month
- A Python library to inspect and modify the internal structure of a PDF file☆1,012Updated 3 months ago
- ☆186Updated 11 months ago
- Code behind Arxiv Papers☆536Updated last year
- LLM Analytics☆696Updated last year
- ☆1,273Updated 2 years ago
- the statistics handbook open source repository☆271Updated 2 months ago
- Heart Rate Variability Training with the Polar H10 Monitor☆627Updated last year
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆688Updated 5 months ago