Deep Reinforcement Learning: Zero to Hero!
☆2,262Oct 27, 2025Updated 4 months ago
Alternatives and similar repositories for drl-zh
Users that are interested in drl-zh are comparing it to the libraries listed below
Sorting:
- llama3 implementation one matrix multiplication at a time☆15,242May 23, 2024Updated last year
- Machine Learning Engineering Open Book☆17,286Feb 21, 2026Updated 2 weeks ago
- Things you can do with the token embeddings of an LLM☆1,453Dec 1, 2025Updated 3 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,870Jun 22, 2025Updated 8 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆625Feb 24, 2025Updated last year
- Python programs, usually short, of considerable difficulty, to perfect particular skills.☆24,284Updated this week
- Email security is a key part of internet communication. But what are SPF, DKIM, and DMARC, and how do they work? This guide will explain …☆1,244Jun 21, 2024Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆630Mar 23, 2025Updated 11 months ago
- LLM training in simple, raw C/CUDA☆29,054Jun 26, 2025Updated 8 months ago
- R.L. methods and techniques.☆199Feb 28, 2026Updated last week
- Minimal LLM inference in Rust☆1,032Oct 24, 2024Updated last year
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,619Jul 24, 2024Updated last year
- The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams☆14,007Jul 30, 2025Updated 7 months ago
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,964Dec 11, 2025Updated 2 months ago
- Fine-tune LLM agents with online reinforcement learning☆1,248Mar 19, 2024Updated last year
- interactive visualization of 5 popular gradient descent methods with step-by-step illustration and hyperparameter tuning UI☆1,376Aug 4, 2024Updated last year
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆31,523Updated this week
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆209Sep 12, 2024Updated last year
- This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."☆14,853Feb 22, 2026Updated 2 weeks ago
- Open source business application platform for fast development☆1,017Feb 19, 2025Updated last year
- Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.☆2,680Jun 5, 2024Updated last year
- Raspberry Pi Voice Assistant☆815Dec 22, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆54,071Nov 12, 2025Updated 3 months ago
- Advanced Python Mastery (course by @dabeaz)☆13,113Dec 22, 2025Updated 2 months ago
- Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipeli…☆652Mar 1, 2026Updated last week
- llama3.np is a pure NumPy implementation for Llama 3 model.☆993Apr 27, 2025Updated 10 months ago
- A BERT that you can train on a (gaming) laptop.☆209Sep 8, 2023Updated 2 years ago
- An open source approach to locally record and enable searching everything you view on your Mac.☆2,470May 30, 2024Updated last year
- Distribute and run LLMs with a single file.☆23,776Updated this week
- Loki: Open-source solution designed to automate the process of verifying factuality☆1,134Oct 3, 2024Updated last year
- ☆3,383Sep 21, 2024Updated last year
- 🪓 Run Background Tasks at Scale☆6,664Updated this week
- Grandmaster-Level Chess Without Search☆618Jan 10, 2025Updated last year
- Solve puzzles. Learn CUDA.☆11,980Sep 1, 2024Updated last year
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆375Jun 11, 2024Updated last year
- Open-source E-ink monitor. Mirror of https://gitlab.com/zephray/glider☆2,131Updated this week
- CoreNet: A library for training deep neural networks☆7,011Oct 9, 2025Updated 5 months ago
- A repository of Maker Skill Trees and templates to make your own.☆3,327Feb 20, 2026Updated 2 weeks ago
- It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; ho…☆4,811Aug 22, 2025Updated 6 months ago