Trust Region Policy Optimization with Generalized Advantage Estimator
☆16Nov 15, 2018Updated 7 years ago
Alternatives and similar repositories for TRPO-GAE
Users that are interested in TRPO-GAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Apr 15, 2018Updated 8 years ago
- Source code for AdaptSky paper☆11Jan 1, 2023Updated 3 years ago
- A basic program for Python to crawl recruitment position information based on Selenium. Python 基于 Selenium 爬取招聘岗位信息的基础程序☆13Nov 23, 2024Updated last year
- 一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm☆13Jul 25, 2024Updated last year
- ☆15Apr 30, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 基于Python的选课管理系统☆12Nov 30, 2017Updated 8 years ago
- 使用Transformer进行中英翻译(demo)☆17Aug 25, 2023Updated 2 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 7 years ago
- ☆18Dec 5, 2024Updated last year
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- ☆11May 12, 2021Updated 4 years ago
- Examples of MolScore implementations☆12May 30, 2024Updated last year
- ☆42Oct 31, 2012Updated 13 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆101Aug 15, 2016Updated 9 years ago
- This reposotory is for a project about Distributed TDMA for Mobile UWB Network Localization☆15Jun 1, 2021Updated 4 years ago
- reimplementation of the ddpg algorithm using tensorflow☆37Oct 17, 2016Updated 9 years ago
- Simple GStreamer test programs for learning puporses.☆13Jul 27, 2013Updated 12 years ago
- Constrained Exploration and Recovery from Experience Shaping☆22Apr 18, 2019Updated 7 years ago
- ROMFS文件系统固件解析与提取☆12Dec 24, 2023Updated 2 years ago
- ☆17Jan 10, 2024Updated 2 years ago
- Gstreamer, Qt, RTSP server☆15Sep 7, 2018Updated 7 years ago
- Summary of Paper Survey☆15Oct 16, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Active learning workflow to train and fine-tune molecular property predictors with chemist feedback for goal-oriented molecule generation…☆15Apr 25, 2025Updated 11 months ago
- 使用Python的LeetCode解题笔记,详情访问 http://leetcode.xyu.ink/☆11Sep 7, 2021Updated 4 years ago
- FFT Explorations (basic implementation)☆10Aug 8, 2014Updated 11 years ago
- ☆15Updated this week
- Generative Adversarial Network: Optimization in Targeted Design☆16Apr 12, 2022Updated 4 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- AC3ESBrowser is a tool for analyzing ac3/eac3 bitstreams☆12Feb 27, 2015Updated 11 years ago
- Least-squares Reverse Time Migration using 1D scalar wave equation. Very simple and for demonstration purposes only.☆11Sep 4, 2017Updated 8 years ago
- Cordova plugin for Foxit PDF SDK to View PDF Files☆22Aug 1, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 4 years ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- 🎸 Scaffold AI-friendly project structures for Vibe Coding☆53Jan 21, 2026Updated 2 months ago
- Code for "RGFN: Synthesizable Molecular Generation Using GFlowNets" (NeurIPS 2024)☆29Jun 9, 2025Updated 10 months ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Demonstration of Jackknife Variational Inference for Variational Autoencoders, related to ICLR 2018 paper.☆22Feb 21, 2018Updated 8 years ago