《动手学强化学习》练习代码(Pytorch)
☆20Sep 16, 2022Updated 3 years ago
Alternatives and similar repositories for learn-lr
Users that are interested in learn-lr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A novel Encoder-Decoder model based on read-first LSTM for air pollutant prediction☆14Apr 23, 2022Updated 4 years ago
- Official GitHub Repository for TRC:Trust Region Conditional Value at Risk for Safe Reinforcement Learning.☆25Nov 24, 2025Updated 6 months ago
- ☆18Sep 16, 2022Updated 3 years ago
- 广工java课设--带图形界面的即时多人聊天程序☆11May 25, 2022Updated 4 years ago
- 为原始的pytorch-drl4vrp代码添加注释和bug修复☆22May 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A fun android game to train your brain with some quick math quizes.☆12May 30, 2019Updated 6 years ago
- ☆19May 9, 2023Updated 3 years ago
- ICML2025 | From Feature Interaction to Feature Generation: A Generative Paradigm of CTR Prediction Models☆37Sep 17, 2025Updated 8 months ago
- Algorithm for multiple-shooting differential dynamic programming (MS-DDP) implemented in MATLAB, with a few robotics examples.☆25Apr 10, 2024Updated 2 years ago
- 18年912真题回忆☆11Dec 24, 2018Updated 7 years ago
- ☆19Nov 19, 2021Updated 4 years ago
- MEC task offloading(change ddpg into SAC)☆22Jun 20, 2022Updated 3 years ago
- 基于springboot的开发网站,可以访问书籍的查询和电影音乐的播放,运用技术包括:编程语言java,数据库mysql,缓存redis,maven,前端html,js,css以及一些爬虫技术(书籍来自读书网,音乐来自网易云,电影是在时光电影网站数据进行爬取)☆14Oct 11, 2020Updated 5 years ago
- “2024年中国软件杯A10赛题”,妙笔 —— 基于大小模型的在线文档富文本编辑器,通过结合AI技术,为用户提供了一个全面、高效的文档编辑平台。☆15Apr 23, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 动手学强化学习代码☆66Jan 17, 2024Updated 2 years ago
- Console Game Application☆20Jan 5, 2022Updated 4 years ago
- Code for paper "Temporal Interest Network for Click-Through Rate Prediction"☆27Dec 4, 2024Updated last year
- This work considers combine multi-tricks with highway network to achieve traffic flow prediction accurately.☆29May 6, 2025Updated last year
- 简易的TCP/IP聊天室程序☆10Oct 13, 2016Updated 9 years ago
- ☆33May 13, 2021Updated 5 years ago
- Various Control Barrier Functions realized on cartpole.☆25Jun 29, 2024Updated last year
- 带禁手的五子棋(中国科学院大学 杨力祥老师的公选课《C++程序设计》大作业)☆17Feb 24, 2022Updated 4 years ago
- Differential Dynamic Programming Solver☆20Oct 31, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- UAVGym是一个用python编写的GYM风格的无人机仿真环境,用于强化学习算法的研究。☆76Oct 5, 2023Updated 2 years ago
- ☆27Feb 4, 2021Updated 5 years ago
- Homework 5 of Numerical Optimization in Robotics Class of Shenlan Xueyuan☆32Dec 2, 2022Updated 3 years ago
- PBVI C++ Implementation for solving POMDPs☆27Aug 2, 2021Updated 4 years ago
- 李宏毅2020深度学习作业☆38Feb 15, 2021Updated 5 years ago
- 火车票预售系统——数据库课设☆14Jul 3, 2018Updated 7 years ago
- a learning-based decision-making algorithm for on-ramp merging☆33Jan 24, 2024Updated 2 years ago
- ☆46Oct 19, 2022Updated 3 years ago
- Safe Planning with Diffusion Probabilistic Models☆75Apr 30, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementations of the iLQR algorithm☆33Mar 26, 2017Updated 9 years ago
- 一个基于JSP+Servlet的房源出租管理系统,适合毕业设计 和 大作业☆17Feb 13, 2024Updated 2 years ago
- CDC2024_submission_repository☆49Jul 28, 2024Updated last year
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆45May 6, 2024Updated 2 years ago
- I build this Mobile Edge Computation simulating environment all by myself, and use the costomized ddpg reinforcement learning algorithm t…☆34Jul 3, 2023Updated 2 years ago
- seq_2_seq text generation based on transformers☆22Feb 18, 2021Updated 5 years ago
- 本项目将研究和开发一款集智能推送、景区信息、出行订票、出行交通、出行天气、旅行翻译以及目的地客流量的实时监测、人数预测和智能分析于一体的智慧型景区App。☆26Jul 11, 2019Updated 6 years ago