Trust Region Policy Optimization with Generalized Advantage Estimator
☆16Nov 15, 2018Updated 7 years ago
Alternatives and similar repositories for TRPO-GAE
Users that are interested in TRPO-GAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Apr 15, 2018Updated 8 years ago
- Source code for AdaptSky paper☆11Jan 1, 2023Updated 3 years ago
- A basic program for Python to crawl recruitment position information based on Selenium. Python 基于 Selenium 爬取招聘岗位信息的基础程序☆13Nov 23, 2024Updated last year
- 一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm☆13Jul 25, 2024Updated last year
- 基于Python的选课管理系统☆12Nov 30, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- ☆42Oct 31, 2012Updated 13 years ago
- ☆99Aug 15, 2016Updated 9 years ago
- This reposotory is for a project about Distributed TDMA for Mobile UWB Network Localization☆15Jun 1, 2021Updated 4 years ago
- reimplementation of the ddpg algorithm using tensorflow☆37Oct 17, 2016Updated 9 years ago
- ROMFS文件系统固件解析与提取☆12Dec 24, 2023Updated 2 years ago
- 使用Python的LeetCode解题笔记,详情访问 http://leetcode.xyu.ink/☆11Sep 7, 2021Updated 4 years ago
- FFT Explorations (basic implementation)☆10Aug 8, 2014Updated 11 years ago
- ☆15Apr 14, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is the official implementation of the voxel-based humanoid locomotion in "Gallant: Voxel Grid-based Humanoid Locomotion and Local-na…☆64Apr 24, 2026Updated 2 weeks ago
- Least-squares Reverse Time Migration using 1D scalar wave equation. Very simple and for demonstration purposes only.☆12Sep 4, 2017Updated 8 years ago
- Cordova plugin for Foxit PDF SDK to View PDF Files☆22Aug 1, 2025Updated 9 months ago
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 4 years ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- negamax AI algorithm for turn-based games☆13Oct 6, 2019Updated 6 years ago
- Demonstration of Jackknife Variational Inference for Variational Autoencoders, related to ICLR 2018 paper.☆22Feb 21, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- high availability ros master☆17Nov 1, 2019Updated 6 years ago
- ☆10Jan 29, 2019Updated 7 years ago
- Code for "Boosting Semi-supervised Image Segmentation with Global and Local Mutual Information Regularization"☆13Jul 14, 2021Updated 4 years ago
- halcon算 子阈值分割的实现☆13Apr 13, 2018Updated 8 years ago
- Audio Masking Methods☆12Nov 15, 2019Updated 6 years ago
- Here we proposed two novel algorithms, the Direct Subsequence Dynamic Time Warping for nanopore raw signal search (DSDTWnano) and the con…☆10Jul 13, 2020Updated 5 years ago
- A minimal and interpretable Brian2 based DYNAP neuromorphic processor simulator for educational purposes.☆12Jun 23, 2022Updated 3 years ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- Arduino sketch to write to Nano 33 BLE Sense memory using the NVMC☆11Apr 15, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 国内首款Java研发的xss跨站脚本漏洞测试平台☆21Dec 16, 2022Updated 3 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- ☆33Mar 20, 2025Updated last year
- MCMC routines☆12Nov 22, 2022Updated 3 years ago
- ☆13Apr 7, 2026Updated last month
- Code implementation of: "Graying the black box: Understanding DQNs"☆20Feb 23, 2017Updated 9 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago