alessiodm / drl-zhLinks

Deep Reinforcement Learning: Zero to Hero!

☆2,118

Alternatives and similar repositories for drl-zh

Users that are interested in drl-zh are comparing it to the libraries listed below

Sorting:

therealoliver / Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆597Updated 4 months ago
Exorust / TorchLeet
Leetcode for Pytorch
☆1,030Updated last week
Lesabotsy / bootcamp
☆587Updated last month
Om-Alve / smolGPT
☆1,416Updated 5 months ago
lsc4719 / MyViewOfLinuxSystems
☆518Updated last year
dleemiller / WordLlama
Things you can do with the token embeddings of an LLM
☆1,445Updated 3 months ago
yousef-rafat / miniDiffusion
A reimplementation of Stable Diffusion 3.5 in pure PyTorch
☆637Updated last month
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆620Updated 3 months ago
stas00 / the-art-of-debugging
The Art of Debugging
☆903Updated 11 months ago
PsyChip / machina
OpenCV+YOLO+LLAVA powered video surveillance system
☆763Updated this week
ivanbelenky / RL
R.L. methods and techniques.
☆199Updated 8 months ago
thiswillbeyourgithub / AnkiAIUtils
AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and bey…
☆763Updated 5 months ago
eschluntz / compress
Text compression for generating keyboard expansions
☆1,416Updated last year
andrewn6 / fromthetransistor
From the Transistor to the Web Browser, a rough outline for a 12 week course
☆230Updated last year
srush / Tensor-Puzzles
Solve puzzles. Improve your pytorch.
☆3,647Updated last year
samuel-vitorino / lm.rs
Minimal LLM inference in Rust
☆1,006Updated 8 months ago
ArhanChaudhary / NAND
NAND is a logic simulator suite made entirely from NAND gates
☆573Updated 2 months ago
google-deepmind / searchless_chess
Grandmaster-Level Chess Without Search
☆582Updated 6 months ago
kieranabrennan / every-breath-you-take
Heart Rate Variability Training with the Polar H10 Monitor
☆608Updated 9 months ago
desgeeko / pdfsyntax
A Python library to inspect and modify the internal structure of a PDF file
☆996Updated last week
s-casci / tinyzero
Easily train AlphaZero-like agents on any environment you want!
☆430Updated last year
google-deepmind / penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,804Updated 3 weeks ago
trholding / llama2.c
Llama 2 Everywhere (L2E)
☆1,519Updated 6 months ago
mlecauchois / micrograd-cuda
☆248Updated last year
EurekaLabsAI / ngram
The n-gram Language Model
☆1,433Updated 11 months ago
hackclub / RAM-a-thon
Just a detailed in-depth, and comprehensive explanation of how computers operate internally, focusing on RAM and CPU aspects, respectivel…
☆337Updated 9 months ago
Robertleoj / pen_plotter_robot
☆183Updated 7 months ago
cfahlgren1 / natural-sql
A series of top performing Text to SQL LLMs
☆866Updated last year
raghavan / PdfGptIndexer
RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for ra…
☆677Updated 6 months ago
jla524 / fromthetensor
From the Tensor to Stable Diffusion, a rough outline for a 1 week course.
☆1,067Updated 3 months ago