mike-gimelfarb / deep-successor-features-for-transferView external linksLinks
A reusable framework for successor features for transfer in deep reinforcement learning using keras.
☆48May 11, 2021Updated 4 years ago
Alternatives and similar repositories for deep-successor-features-for-transfer
Users that are interested in deep-successor-features-for-transfer are comparing it to the libraries listed below
Sorting:
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- Deep Successor Representation☆18Mar 6, 2018Updated 7 years ago
- This repository contains implementations of the paper VUSFA☆14Mar 31, 2021Updated 4 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆25Mar 29, 2019Updated 6 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- ☆13Mar 14, 2024Updated last year
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Domain and problem PDDL parser in C/C++ using Flex & Bison.☆15Jun 18, 2019Updated 6 years ago
- Bayes-Adaptive Monte-Carlo Planning algorithm☆17Mar 5, 2013Updated 12 years ago
- A toolkit for working with RDDL domains in Python3.☆17Nov 7, 2020Updated 5 years ago
- Source code for paper: Efficient deep reinforcement learning via adaptive policy transfer☆15Aug 15, 2022Updated 3 years ago
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆17May 9, 2022Updated 3 years ago
- DBN++ Data Structures and Algorithms in C++ for Dynamic Bayesian Networks☆19Feb 5, 2016Updated 10 years ago
- ☆24Aug 6, 2025Updated 6 months ago
- Reinforcement Learning via Latent State Decoding☆29Jun 12, 2023Updated 2 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- ☆37Apr 22, 2024Updated last year
- ☆35Mar 26, 2025Updated 10 months ago
- A Surrogate Model with Data Augmentation and Deep Transfer Learning for Temperature Field Prediction of Heat Source Layout☆10Nov 25, 2020Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆30Jun 4, 2022Updated 3 years ago
- 软件测试入门所需的基础知识☆10Jan 18, 2024Updated 2 years ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆90Jan 13, 2026Updated last month
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Oct 14, 2020Updated 5 years ago
- ☆359Oct 12, 2022Updated 3 years ago
- Contextual Bandit Spectral Representation Learner☆12Oct 25, 2022Updated 3 years ago
- go + vue + websocket 音乐播放器和聊天室☆12Mar 27, 2022Updated 3 years ago
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- Vue component to easy select time intervals. Available at npm. 2018☆16Mar 1, 2019Updated 6 years ago
- FE model updating in Python☆11Jul 16, 2021Updated 4 years ago
- Codes for various problems solved using Finite Difference Method and Finite Volume Method.☆12Apr 6, 2016Updated 9 years ago
- Utility to run separate X with discrete nvidia graphics with full performance adapted to work on Debian 9. in a Lenovo Yoga☆11Dec 20, 2018Updated 7 years ago
- ☆10Oct 26, 2022Updated 3 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Source code for our journal submission : ELD-Net: An efficient deep learning architecture for accurate saliency detection☆10Nov 27, 2017Updated 8 years ago
- Bayesian PINN codes to solve 2D/3D Navier Stokes for wind fields☆11Dec 12, 2023Updated 2 years ago
- ☆13Jan 12, 2021Updated 5 years ago
- 基于go-zero框架,websocket 示例☆10Apr 21, 2024Updated last year