alessiodm / drl-zhView external linksLinks
Deep Reinforcement Learning: Zero to Hero!
☆2,260Oct 27, 2025Updated 3 months ago
Alternatives and similar repositories for drl-zh
Users that are interested in drl-zh are comparing it to the libraries listed below
Sorting:
- llama3 implementation one matrix multiplication at a time☆15,239May 23, 2024Updated last year
- Machine Learning Engineering Open Book☆16,675Updated this week
- Things you can do with the token embeddings of an LLM☆1,454Dec 1, 2025Updated 2 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,863Jun 22, 2025Updated 7 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆624Feb 24, 2025Updated 11 months ago
- Python programs, usually short, of considerable difficulty, to perfect particular skills.☆24,272Updated this week
- Email security is a key part of internet communication. But what are SPF, DKIM, and DMARC, and how do they work? This guide will explain …☆1,242Jun 21, 2024Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆628Mar 23, 2025Updated 10 months ago
- LLM training in simple, raw C/CUDA☆28,814Jun 26, 2025Updated 7 months ago
- R.L. methods and techniques.☆199Jan 23, 2026Updated 3 weeks ago
- Minimal LLM inference in Rust☆1,030Oct 24, 2024Updated last year
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,606Jul 24, 2024Updated last year
- The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams☆13,942Jul 30, 2025Updated 6 months ago
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,959Dec 11, 2025Updated 2 months ago
- Fine-tune LLM agents with online reinforcement learning☆1,247Mar 19, 2024Updated last year
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆31,345Updated this week
- interactive visualization of 5 popular gradient descent methods with step-by-step illustration and hyperparameter tuning UI☆1,371Aug 4, 2024Updated last year
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆209Sep 12, 2024Updated last year
- This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."☆14,599Jan 31, 2026Updated 2 weeks ago
- Open source business application platform for fast development☆1,012Feb 19, 2025Updated 11 months ago
- Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.☆2,679Jun 5, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆52,955Nov 12, 2025Updated 3 months ago
- Raspberry Pi Voice Assistant☆813Dec 22, 2024Updated last year
- Advanced Python Mastery (course by @dabeaz)☆13,038Dec 22, 2025Updated last month
- Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipeli…☆651Feb 1, 2026Updated 2 weeks ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆991Apr 27, 2025Updated 9 months ago
- A BERT that you can train on a (gaming) laptop.☆209Sep 8, 2023Updated 2 years ago
- An open source approach to locally record and enable searching everything you view on your Mac.☆2,469May 30, 2024Updated last year
- Distribute and run LLMs with a single file.☆23,704Updated this week
- Loki: Open-source solution designed to automate the process of verifying factuality☆1,131Oct 3, 2024Updated last year
- 🪓 Run Background Tasks at Scale☆6,521Updated this week
- ☆3,376Sep 21, 2024Updated last year
- Grandmaster-Level Chess Without Search☆606Jan 10, 2025Updated last year
- Solve puzzles. Learn CUDA.☆11,942Sep 1, 2024Updated last year
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆375Jun 11, 2024Updated last year
- Open-source E-ink monitor. Mirror of https://gitlab.com/zephray/glider☆2,118Jan 26, 2026Updated 3 weeks ago
- A repository of Maker Skill Trees and templates to make your own.☆3,306Feb 4, 2026Updated last week
- CoreNet: A library for training deep neural networks☆7,013Oct 9, 2025Updated 4 months ago
- It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; ho…☆4,794Aug 22, 2025Updated 5 months ago