alessiodm / drl-zh
Deep Reinforcement Learning: Zero to Hero!
☆2,057Updated 8 months ago
Alternatives and similar repositories for drl-zh
Users that are interested in drl-zh are comparing it to the libraries listed below
Sorting:
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆573Updated 2 months ago
- ☆588Updated 2 months ago
- ☆516Updated last year
- ☆1,353Updated 3 months ago
- Easily train AlphaZero-like agents on any environment you want!☆430Updated last year
- Talk to any ArXiv paper using ChatGPT☆524Updated last year
- AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data☆1,097Updated 4 months ago
- Things you can do with the token embeddings of an LLM☆1,441Updated last month
- A nanoGPT pipeline packed in a spreadsheet☆2,113Updated 11 months ago
- Fine-tune LLM agents with online reinforcement learning☆1,169Updated last year
- Grandmaster-Level Chess Without Search☆574Updated 4 months ago
- R.L. methods and techniques.☆185Updated 6 months ago
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆177Updated 2 months ago
- ☆179Updated 5 months ago
- Text compression for generating keyboard expansions☆1,415Updated last year
- NAND is a logic simulator suite made entirely from NAND gates☆561Updated 3 weeks ago
- A deep dive into embeddings starting from fundamentals☆1,015Updated 6 months ago
- Hackers' Guide to Language Models☆1,836Updated 5 months ago
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,060Updated last month
- Run and explore Llama models locally with minimal dependencies on CPU☆189Updated 7 months ago
- A BERT that you can train on a (gaming) laptop.☆208Updated last year
- Machine Learning Engineering Open Book☆13,689Updated last week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,779Updated 3 weeks ago
- grep for words with similar meaning to the query☆1,159Updated 8 months ago
- OpenCV+YOLO+LLAVA powered video surveillance system☆759Updated 2 months ago
- A digital twin of the city of Colombo, Sri Lanka, implemented in Cities: Skylines, based on real data. Nearly 1:1 in terms of geography a…☆314Updated 8 months ago
- Learn WebAssembly by writing small programs!☆1,643Updated last year
- A series of top performing Text to SQL LLMs☆871Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆610Updated last month
- Solve Puzzles. Learn Metal 🤘☆549Updated 7 months ago