waylandzhang/learn-reinforcement-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/waylandzhang/learn-reinforcement-learning)

waylandzhang / learn-reinforcement-learning

《Reinforcement Learning》读书学习与视频分享笔记

☆79

Alternatives and similar repositories for learn-reinforcement-learning

Users that are interested in learn-reinforcement-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

waylandzhang / Transformer-from-scratch
View on GitHub
☆540Apr 29, 2024Updated 2 years ago
waylandzhang / llm-transformer-book
View on GitHub
《Transformer系列视频的手稿整理》
☆105Jan 4, 2026Updated 6 months ago
waylandzhang / embedding_from_scratch
View on GitHub
训练自己的中文 Embedding 模型
☆30Jan 6, 2025Updated last year
LUMIA-Group / APL
View on GitHub
The official implementation of the paper "Asymmetric Polynomial Loss for Multi-Label Classification"(ICASSP 2023)
☆20Apr 5, 2023Updated 3 years ago
Kocoro-lab / ai-agent-book
View on GitHub
《From Concept to Production: Framework-Agnostic AI Agent Architecture Patterns》
☆305Apr 9, 2026Updated 3 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Zessay / sohu_2019
View on GitHub
2019搜狐第三届内容识别挑战赛rank10
☆11Oct 17, 2019Updated 6 years ago
DTennant / dual-rank-ncd
View on GitHub
Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)
☆12Aug 20, 2023Updated 2 years ago
waylandzhang / ai-quant-book
View on GitHub
《AI Quant Trading - From Zero to One》
☆422Feb 14, 2026Updated 5 months ago
wxhdf / MRM
View on GitHub
A Multi-Resolution Mutual Learning Network for Multi-Label ECG Classification
☆13Mar 14, 2025Updated last year
mlbio-epfl / falcon
View on GitHub
[ICML 2024] Fine-Grained Classes and How to Find Them
☆14Jun 21, 2024Updated 2 years ago
TencentAILabHealthcare / IIB-MIL
View on GitHub
☆11Jul 21, 2023Updated 3 years ago
jeanne-wang / octree_network_for_3d_semantic_reconstruction
View on GitHub
This repository maintains the code for my master thesis "learn semantic 3d reconstruction on octree"
☆13May 8, 2019Updated 7 years ago
dujh22 / AiMed
View on GitHub
AiMed面向中文医学的人工智能大语言模型期望实现有效处理医学知识问答、医学论文阅读、医学文献检索等任务和在医学科研中的应用。
☆13Feb 8, 2025Updated last year
CodeDuoGun / deepseek_lora
View on GitHub
基于deepseek、qwen3大模型，lora sft 医疗行业数据
☆15Apr 10, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vluz / QRCodeGenSD
View on GitHub
A gradio app that uses a stable diffusion model to generate novel QR codes
☆12Jun 21, 2023Updated 3 years ago
waylandzhang / DeepSeek-RL-Qwen-0.5B-GRPO-gsm8k
View on GitHub
☆86Feb 3, 2025Updated last year
ekansh09 / LRH-Net
View on GitHub
Official Implementation of LRH-Net: A Multi-Level Knowledge Distillation Approach for Low-Resource Heart Network
☆20Nov 11, 2023Updated 2 years ago
Nexuslkl / Swin_MIL
View on GitHub
☆14Jun 2, 2023Updated 3 years ago
ModelBunker / StarNet-PyTorch
View on GitHub
StarNet: Targeted Computation for Object Detection in Point Clouds
☆14Jan 28, 2020Updated 6 years ago
VILAN-Lab / MLGCN-DP
View on GitHub
☆14Dec 10, 2024Updated last year
DLR-RM / UMF
View on GitHub
☆23Apr 29, 2025Updated last year
TommyHuang821 / DeepLearning-MachineLearning
View on GitHub
☆16Dec 27, 2017Updated 8 years ago
ttnghia / Banana
View on GitHub
My (deprecated) personal C++ codebase.
☆12Jan 10, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
FatimaZulfiqar / multi-class-brain-tumor-classification
View on GitHub
This repository contains the code of the paper Multi-class classification of brain tumor types from MR Images using EfficientNets
☆14Aug 22, 2025Updated 11 months ago
JerryKingQAQ / AEEG-PI-CL
View on GitHub
[TIM 2024] The official implementation for the paper "Affective EEG-based Person Identification with Continual Learning"
☆15Aug 12, 2024Updated last year
Yazooliu / agent_from_0t1
View on GitHub
手把手带你从0到1实现大模型agent
☆124Jun 28, 2024Updated 2 years ago
mmahdavian / STPOTR
View on GitHub
Human Pose and Hip Trajectory Prediction Using Transformers
☆16Oct 11, 2023Updated 2 years ago
etzinis / optimal_condition_training
View on GitHub
Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…
☆14Feb 15, 2023Updated 3 years ago
danikiyasseh / CLOPS
View on GitHub
[Nature Communications 2021] Continual learning of AI models on ECG data with CLOPS
☆21Oct 19, 2022Updated 3 years ago
karasHou / DataBase-Simulation
View on GitHub
一个用java实现的简单数据库模拟系统。
☆16Jan 19, 2018Updated 8 years ago
yuankeyi / 2019-SOHU-Contest
View on GitHub
2019年4月8日，第三届搜狐校园内容识别算法大赛。
☆26May 14, 2019Updated 7 years ago
ZhengzeZhou / slime
View on GitHub
S-LIME: Stabilized-LIME for model explanation (KDD 2021). pip install stabilized-lime
☆16Mar 7, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
btaille / contener
View on GitHub
Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020
☆13Jul 25, 2024Updated 2 years ago
gmum / Kernel_SA-AbMILP
View on GitHub
☆18Feb 13, 2023Updated 3 years ago
LLouice / Sohu2019
View on GitHub
2019搜狐校园算法大赛
☆27May 22, 2019Updated 7 years ago
Sharpiless / play-Pacman-with-gesture-recognition-by-resnet18
View on GitHub
手把手教你用PaddleX训练手势识别模型，并将该模型用于吃豆豆游戏
☆16Mar 7, 2021Updated 5 years ago
engyasin / EKF-MonoSLAM_for_3D-reconstruction
View on GitHub
Using MonoSLAM as starting step for 3D-reconstruction
☆11Aug 23, 2020Updated 5 years ago
AllenAnthony / MiniSQL
View on GitHub
这是我自己用C++写的一个小型数据库，其中包括语义分析，存储管理，检索优化（主要用到了B+树，B+树也是自己写的）
☆16Mar 5, 2017Updated 9 years ago
Kamaleswaran-Lab / Clin-JEPA
View on GitHub
☆18Jun 15, 2026Updated last month