junnannie/RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/junnannie/RL)

junnannie / RL

上海交通大学《动手学强化学习》课程笔记，完成了所有算法实现，包括但不限于 Actor-Critic、PPO、DDPG、DQN等

☆47

Alternatives and similar repositories for RL

Users that are interested in RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jasongief / TGS-Agent
View on GitHub
[2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation
☆20Nov 8, 2025Updated 8 months ago
simongu20070911 / quantitative-pricing-agents
View on GitHub
☆26Dec 2, 2025Updated 7 months ago
Max1musy / ModbusSharp
View on GitHub
Modbus TCP, Modbus UDP, Modbus Ascii and Modbus RTU client/server library for .NET implementations
☆12May 29, 2025Updated last year
Zmy6 / UNeLF
View on GitHub
☆14Oct 27, 2025Updated 8 months ago
Yifei-Zuo / FlashLLA
View on GitHub
Official repository Flash Local Linear Attention
☆37May 28, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Hyizhou1 / trx
View on GitHub
tg机器人 trx兑换、能量租赁、trx闪兑自动回能量，-完整功能 https://t.me/hongsx
☆52Mar 17, 2026Updated 3 months ago
cranbs / Awesome-Flare-Removal
View on GitHub
Collection of recent flare removal / glare removal works, including datasets, papers and codes.
☆39Updated this week
lixiaoyu2000 / HAT
View on GitHub
Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"
☆32Mar 25, 2026Updated 3 months ago
libdriver / aht30
View on GitHub
AHT30 full-featured driver library for general-purpose MCU and Linux.
☆14Oct 25, 2025Updated 8 months ago
ZyLi99 / BirneStore
View on GitHub
☆13Sep 14, 2022Updated 3 years ago
WuJH2001 / LocalDyGS
View on GitHub
[ICCV’2025] LocalDyGS : Multi-view Global Dynamic Scene Modeling through Adaptive Local Feature Decoupling
☆123May 3, 2026Updated 2 months ago
hs-knowledge-base / hs-knowledge-base
View on GitHub
一个集文档、代码实践于一体的技术知识库平台。包含文档、代码编辑、管理后台等5个应用的monorepo项目。采用Next.js、NestJS等现代技术栈，为开发者提供学习和实践平台。
☆17Jul 21, 2025Updated 11 months ago
wangzhaode / tokenizer.cpp
View on GitHub
A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.
☆31Jan 4, 2026Updated 6 months ago
xiaotaiyangcmm / DSPO
View on GitHub
☆138Jun 24, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SSCT-Lab / NLPLego
View on GitHub
☆23Nov 27, 2024Updated last year
sablin39 / tilelang-cuda-skills
View on GitHub
Skills for writing tilelang and debugging with CUDA toolkits.
☆129May 20, 2026Updated last month
ZJU-DAILY / ST2Vec
View on GitHub
Source code for Spatio-Temporal Trajectory Similarity Learning in Road Networks. KDD 2022.
☆72Nov 6, 2022Updated 3 years ago
alphagammamle / John-Rust-1987-Python
View on GitHub
Recreation of the "Optimal Replacement of GMC Bus Engines" paper by J. Rust, describing a single-agent dynamic optimization model.
☆28Apr 2, 2017Updated 9 years ago
facebookresearch / VLM3
View on GitHub
Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".
☆381Jun 1, 2026Updated last month
evelyyyyynnnnn / 2.0-Healthcare-Ai-Systems
View on GitHub
Machine learning and decision intelligence models designed to improve healthcare safety through clinical risk prediction and medical inte…
☆194Mar 19, 2026Updated 3 months ago
carsonpo / quadmul
View on GitHub
a fast and customizable CUDA int4 tensor core gemm
☆15Aug 2, 2024Updated last year
ChuanMeng / text-ranking-in-deep-research
View on GitHub
Official repository for the SIGIR 2026 paper "Revisiting Text Ranking in Deep Research"
☆203Apr 8, 2026Updated 3 months ago
louiszengCN / lidar_camera_auto_calibration
View on GitHub
A method to automatically calibrate lidar and camera
☆21Jun 11, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
caokaifa / MPC
View on GitHub
☆31Jun 19, 2022Updated 4 years ago
xinli2 / Reading-Files
View on GitHub
☆18Oct 30, 2021Updated 4 years ago
FrankSuperG / CPG-SPMT
View on GitHub
CPG-SPMT: Control-oriented Parameter-Grouped Single Particle Model with Thermal effects
☆79Apr 22, 2026Updated 2 months ago
Tyrant-sudo / BEM_Surrogate_Propeller
View on GitHub
the Propeller calculation and optimization by a Surrogate model
☆45May 31, 2024Updated 2 years ago
Orange-3DV-Team / MoZoo
View on GitHub
☆130May 15, 2026Updated last month
MediaX-SJTU / FineVQ
View on GitHub
Official implement of FineVQ: Fine-Grained User Generated Content Video Quality (CVPR2025 Highlight)
☆23Jul 8, 2025Updated last year
DaoyuanLi2816 / labelbank
View on GitHub
Retrieve + rerank over a closed label bank: LLM bi-encoders with self-mined hard negatives and a generative listwise reranker. Generalize…
☆20Updated this week
zihaozeng0021 / DouDiZhuAssistant
View on GitHub
☆98Apr 10, 2026Updated 3 months ago
melonedo / algebraic-layouts
View on GitHub
☆23Aug 20, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
IST-DASLab / gemm-fp8
View on GitHub
High Performance FP8 GEMM Kernels for SM89 and later GPUs.
☆21Jan 24, 2025Updated last year
George-Hotz / deepseek-r1-1.5b-deploy-on-rk3588
View on GitHub
在rk3588平台利用rkllmrt的api实现deepseek-r1-1.5b蒸馏模型的部署
☆16Feb 22, 2025Updated last year
hhyqhh / inno-agent
View on GitHub
☆226Updated this week
junnannie / LLaVA
View on GitHub
基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型
☆32Aug 8, 2025Updated 11 months ago
neu-vi / LASER
View on GitHub
[CVPR 2026] Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction
☆74Mar 18, 2026Updated 3 months ago
mowanying123 / OKX-tradeview
View on GitHub
A tradeview strategy trading bot for OKX
☆20Sep 13, 2024Updated last year
ZhihaoZhu / cap-vlm
View on GitHub
Perceive, Predict, Verify: Continual Pre-training for Multimodal Agentic Foundation Models
☆82Apr 1, 2026Updated 3 months ago