Chapter 13 Learning to Run in book Deep Reinforcement Learning: code example of solving NIPS 2017: Learning to Run challenge with paralleled Soft Actor-Critic (SAC) algorithm.
☆13Jul 4, 2021Updated 4 years ago
Alternatives and similar repositories for Chapter13-Learning-to-Run
Users that are interested in Chapter13-Learning-to-Run are comparing it to the libraries listed below
Sorting:
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆36Feb 18, 2020Updated 6 years ago
- DQN examples codes in chapter 4☆44Mar 24, 2023Updated 2 years ago
- ☆69Jan 24, 2024Updated 2 years ago
- ☆13Apr 29, 2023Updated 2 years ago
- Click Me -->☆32Mar 3, 2023Updated 3 years ago
- the backup of articles from sbb4891☆11Sep 11, 2016Updated 9 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Basic template for using Flan-t5 on Banana's serverless GPU platform. Ready for 1-Click deploy☆11Jan 30, 2023Updated 3 years ago
- ☆10Mar 8, 2024Updated last year
- Transfer PaddlePaddle's codes to TensorLayerX's codes☆10Feb 10, 2023Updated 3 years ago
- Reproduce analyses in Harmony Manuscript☆11Feb 21, 2020Updated 6 years ago
- Common support code for user-facing front end systems.☆12Feb 24, 2026Updated last week
- ☆11Dec 11, 2024Updated last year
- Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City☆12Dec 17, 2023Updated 2 years ago
- Trial version for prs platform (python project). Please note that the complete experience requires downloading the Unity resource.☆10Jun 26, 2024Updated last year
- ☆12Jan 14, 2026Updated last month
- [AAAI 2026] AutoTool: Efficient Tool Selection for Large Language Model Agents☆29Dec 28, 2025Updated 2 months ago
- Tool to bridge Blender animation and physics-based robotic simulation☆16Updated this week
- ☆10Dec 29, 2020Updated 5 years ago
- Jean Gallier‘s Algebra, Topology, Differential Calculus, and Optimization Theory for Computer Science and Machine Learning Chinese versio…☆11Apr 16, 2020Updated 5 years ago
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- Reproducibility for the "Harmonization and Annotation of Single-cell Transcriptomics data with Deep Generative Models" paper☆13Jul 15, 2022Updated 3 years ago
- ☆12Mar 21, 2024Updated last year
- Material for the RNAseq course☆10Aug 8, 2019Updated 6 years ago
- I personally studied VINS-MONO and commented in Korean.☆12Jan 3, 2024Updated 2 years ago
- Implementation of Diffusion Policy☆13Dec 13, 2024Updated last year
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- Small and simple SGF parser for python☆10Jan 23, 2025Updated last year
- Python functions for retrieving data from the MediaWiki/Wikipedia API☆15Feb 19, 2026Updated last week
- PyTorch implementation of the paper-"Human Mobility Prediction with Causal and Spatial-constrained Multi-task Network"☆12Mar 19, 2024Updated last year
- 从开始使用Vim起、累积的学习记录,附带几个自用一键配置linux脚本☆10Aug 26, 2021Updated 4 years ago
- Telegram Bot that does everything☆10May 17, 2016Updated 9 years ago
- Tracker blocking lists based on the DuckDuckGo Tracker Radar provided in the popular EasyList format and thus suitable for inclusion in e…☆13May 17, 2022Updated 3 years ago
- 使用基于MSA方法的用户均衡模型求解AV,CV车流的交通分配问题☆17Sep 3, 2022Updated 3 years ago
- LLM Application Systems for Education☆11May 16, 2025Updated 9 months ago
- A Platform-agnostic Computer Vision Application Library☆12Jul 24, 2025Updated 7 months ago
- ☆14Aug 3, 2022Updated 3 years ago
- Discord bot for using OpenAI's image generator DALL-E☆12May 22, 2024Updated last year
- KAM and KFM Estimation via IMU-Camera Fusion☆12May 26, 2024Updated last year