alessiodm / drl-zhLinks
Deep Reinforcement Learning: Zero to Hero!
☆2,146Updated this week
Alternatives and similar repositories for drl-zh
Users that are interested in drl-zh are comparing it to the libraries listed below
Sorting:
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆607Updated 6 months ago
- ☆1,423Updated 6 months ago
- ☆588Updated 2 months ago
- Leetcode for Pytorch☆1,520Updated last month
- ☆516Updated last year
- Fine-tune LLM agents with online reinforcement learning☆1,216Updated last year
- Text compression for generating keyboard expansions☆1,418Updated last year
- Grandmaster-Level Chess Without Search☆588Updated 7 months ago
- grep for words with similar meaning to the query☆1,177Updated last year
- There is hardly any theory which is more elementary than linear algebra, in spite of the fact that generations of professors and textbook…☆1,173Updated this week
- AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and bey…☆782Updated 6 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆621Updated 5 months ago
- Easily train AlphaZero-like agents on any environment you want!☆431Updated last year
- ☆249Updated last year
- R.L. methods and techniques.☆199Updated 9 months ago
- Llama 2 Everywhere (L2E)☆1,523Updated last week
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆667Updated 2 months ago
- Things you can do with the token embeddings of an LLM☆1,445Updated 5 months ago
- The Art of Debugging☆929Updated last week
- A Python library to inspect and modify the internal structure of a PDF file☆1,008Updated 2 weeks ago
- Solve Puzzles. Learn Metal 🤘☆581Updated 11 months ago
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,071Updated this week
- From the Transistor to the Web Browser, a rough outline for a 12 week course☆231Updated last year
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,812Updated 2 months ago
- Repository and hands-on workshop on how to develop applications with local LLMs☆401Updated last year
- A deep dive into embeddings starting from fundamentals☆1,029Updated 9 months ago
- Talk to any ArXiv paper using ChatGPT☆531Updated last year
- 📚 Download the full collection of Paul Graham essays in EPUB, PDF & Markdown for easy reading.☆885Updated last month
- NAND is a logic simulator suite made entirely from NAND gates☆578Updated last month
- OpenCV+YOLO+LLAVA powered video surveillance system☆771Updated last week