TensorFlow implementation of asynchronous advantage actor-critic (A3C)
☆38Oct 20, 2021Updated 4 years ago
Alternatives and similar repositories for ocd-a3c
Users that are interested in ocd-a3c are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple Example A3C Reinforcement Learning Algorithm in Tensorflow☆13May 23, 2017Updated 8 years ago
- ☆13Jun 23, 2017Updated 8 years ago
- Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆25Apr 20, 2017Updated 9 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆13Jul 11, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Models and training scripts for the English, German and Russian MAGEC systems described in R. Grundkiewicz, M. Junczys-Dowmunt: Minimally…☆12Jul 7, 2021Updated 4 years ago
- A project copied from google-research which named motion-imitation was rewrited with PyTorch☆10Sep 30, 2022Updated 3 years ago
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.☆13Jan 30, 2021Updated 5 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Oct 3, 2021Updated 4 years ago
- ☆13May 29, 2018Updated 7 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Feb 11, 2025Updated last year
- Demo for the subjective interface☆14Mar 4, 2018Updated 8 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆84Nov 19, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Python implementation of a graph-based parser for Abstract Meaning Representation (AMR)☆11Feb 2, 2018Updated 8 years ago
- Port of pybullet envs to gymnasium☆18Mar 4, 2025Updated last year
- OpenAI Gym environment for DART robotics simulator.☆22Apr 17, 2018Updated 8 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- Source code for "A deep dive into reinforcement learning"☆13Dec 17, 2019Updated 6 years ago
- ☆12Dec 2, 2020Updated 5 years ago
- Implementation of the two-step-task as described in "Prefrontal cortex as a meta-reinforcement learning system" and "Learning to Reinforc…☆60Mar 28, 2019Updated 7 years ago
- Matrix exponential in cuda for pytorch and tensorflow☆17Nov 26, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the Black-DROPS algorithm: "Black-Box Data-efficient Policy Search for Robotics", IROS 2017/ICRA 2018☆66Nov 17, 2021Updated 4 years ago
- Imagination Augmented Agents in TensorFlow☆20Oct 21, 2018Updated 7 years ago
- Part-of-speech tagger implemented using a feedforward network in TensorFlow☆14Jan 15, 2018Updated 8 years ago
- my public website☆12Aug 17, 2024Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆30Sep 16, 2022Updated 3 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago
- Paper notes for my PhD on Machine Learning (mostly focused on Reinforcement Learning)☆17Jul 22, 2019Updated 6 years ago
- Simple Tensorflow implementation of "MirrorGAN: Learning Text-to-image Generation by Redescription" (CVPR 2019)☆15Mar 23, 2020Updated 6 years ago
- Code for the paper "3D Human Pose Estimation with Siamese Equivariant Embedding"☆19Sep 26, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Easy TensorFlow logging for quick prototypes☆110Oct 20, 2021Updated 4 years ago
- Snake Robot Reinforcement Learning Environment☆20Jun 8, 2021Updated 4 years ago
- some tutorials for blog: simonjisu.github.io☆23Mar 25, 2021Updated 5 years ago
- Pytorch2Jax is a small Python library that provides functions that wraps PyTorch models into Jax functions and Flax modules.☆21Feb 20, 2023Updated 3 years ago
- A simple image classification test using Core ML and Inception V3 model in Objective-C☆22Nov 24, 2017Updated 8 years ago
- Library for model based RL in robotics☆37Sep 10, 2018Updated 7 years ago
- [NeurIPS'24] "NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes" by Hao-Lun …☆10Sep 18, 2025Updated 7 months ago