上海交通大学《动手学强化学习》课程笔记,完成了所有算法实现,包括但不限于 Actor-Critic、PPO、DDPG、DQN等
☆47Mar 19, 2025Updated last year
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation☆20Nov 8, 2025Updated 7 months ago
- LaTeX中文模板收集☆32Aug 15, 2018Updated 7 years ago
- ☆26Dec 2, 2025Updated 6 months ago
- Modbus TCP, Modbus UDP, Modbus Ascii and Modbus RTU client/server library for .NET implementations☆12May 29, 2025Updated last year
- SmartCLIP: A training method to improve CLIP with both short and long texts☆42Jun 18, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Oct 27, 2025Updated 7 months ago
- Official repository Flash Local Linear Attention☆37May 28, 2026Updated 3 weeks ago
- tg机器人 trx兑换、能量租赁、trx闪兑自动回能量,-完整功能 https://t.me/hongsx☆52Mar 17, 2026Updated 3 months ago
- Collection of recent flare removal / glare removal works, including datasets, papers and codes.☆39Mar 11, 2026Updated 3 months ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆32Mar 25, 2026Updated 2 months ago
- AHT30 full-featured driver library for general-purpose MCU and Linux.☆14Oct 25, 2025Updated 7 months ago
- ☆13Sep 14, 2022Updated 3 years ago
- [ICCV’2025] LocalDyGS : Multi-view Global Dynamic Scene Modeling through Adaptive Local Feature Decoupling☆123May 3, 2026Updated last month
- 一个集文档、代码实践于一体的技术知识库平台。包含文档、代码编辑、管理后台等5个应用的monorepo项目。采用Next.js、NestJS等现代技术栈,为开发者提供学习和实践平台。☆17Jul 21, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆31Jan 4, 2026Updated 5 months ago
- Machine learning and decision intelligence models designed to improve healthcare safety through clinical risk prediction and medical inte…☆154Mar 19, 2026Updated 3 months ago
- ☆138Jun 24, 2025Updated 11 months ago
- ☆23Nov 27, 2024Updated last year
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆92Sep 29, 2025Updated 8 months ago
- Skills for writing tilelang and debugging with CUDA toolkits.☆126May 20, 2026Updated last month
- ✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】☆29Jan 29, 2024Updated 2 years ago
- Source code for Spatio-Temporal Trajectory Similarity Learning in Road Networks. KDD 2022.☆72Nov 6, 2022Updated 3 years ago
- [CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".☆48Jun 5, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Recreation of the "Optimal Replacement of GMC Bus Engines" paper by J. Rust, describing a single-agent dynamic optimization model.☆28Apr 2, 2017Updated 9 years ago
- Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".☆308Jun 1, 2026Updated 2 weeks ago
- Official repository for the SIGIR 2026 paper "Revisiting Text Ranking in Deep Research"☆202Apr 8, 2026Updated 2 months ago
- a fast and customizable CUDA int4 tensor core gemm☆15Aug 2, 2024Updated last year
- A method to automatically calibrate lidar and camera☆21Jun 11, 2024Updated 2 years ago
- CPG-SPMT: Control-oriented Parameter-Grouped Single Particle Model with Thermal effects☆79Apr 22, 2026Updated last month
- ☆18Oct 30, 2021Updated 4 years ago
- the Propeller calculation and optimization by a Surrogate model☆45May 31, 2024Updated 2 years ago
- Official implement of FineVQ: Fine-Grained User Generated Content Video Quality (CVPR2025 Highlight)☆23Jul 8, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆129May 15, 2026Updated last month
- Retrieve + rerank over a closed label bank: LLM bi-encoders with self-mined hard negatives and a generative listwise reranker. Generalize…☆20Jun 11, 2026Updated last week
- ☆97Apr 10, 2026Updated 2 months ago
- [CVPR 2026] Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction☆67Mar 18, 2026Updated 3 months ago
- ☆23Aug 20, 2025Updated 10 months ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- 在rk3588平台利用rkllmrt的api实现deepseek-r1-1.5b蒸馏模型的部署☆16Feb 22, 2025Updated last year