hercky / ACER_tfView external linksLinks
Implementation for ACER in tensorflow and sonnet by deepmind
☆11Aug 28, 2017Updated 8 years ago
Alternatives and similar repositories for ACER_tf
Users that are interested in ACER_tf are comparing it to the libraries listed below
Sorting:
- machine learning project using DeepMind's PySc2☆12Aug 29, 2017Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Thea…☆44Feb 27, 2018Updated 7 years ago
- Keras implementation of DQN on ViZDoom environment☆54Oct 16, 2016Updated 9 years ago
- Actor-critic with experience replay☆256Oct 9, 2022Updated 3 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆31Jul 17, 2017Updated 8 years ago
- Tensorflow implementation of the map reading algorithm described in ‘Teaching a Machine to Read Maps with Deep Reinforcement Learning’☆32Nov 14, 2017Updated 8 years ago
- Combining deep learning and reinforcement learning.☆81Oct 14, 2021Updated 4 years ago
- ☆32Apr 27, 2017Updated 8 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- Implementing expectimax, alpha-beta pruning, and minimax algorithms in a game of Pacman☆11Jan 17, 2014Updated 12 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Jan 29, 2019Updated 7 years ago
- ☆13Nov 20, 2023Updated 2 years ago
- QWOP AI using Q-learning☆12Jul 13, 2016Updated 9 years ago
- Experiment utility code, specifically designed for use with Compute Canada.☆11Jan 27, 2025Updated last year
- A2C, ACKTR and A2T implementations for ViZDoom☆10Dec 18, 2017Updated 8 years ago
- Maddpg_flight code☆11Jul 4, 2018Updated 7 years ago
- BitmapScaler with different scaling algorhytms based on jxl-coder from awxkee☆11Jan 8, 2024Updated 2 years ago
- WIP — OpenSwiftUI is an OpenSource implementation of Apple's SwiftUI DSL.☆10Feb 28, 2020Updated 5 years ago
- ☆12May 14, 2024Updated last year
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- AI CUP 2024 RAG☆13Nov 19, 2024Updated last year
- Simulation system for path planning evaluation☆14Dec 13, 2025Updated 2 months ago
- Exploration Strategies for Deep Reinforcement Learning☆39Oct 31, 2018Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Jan 27, 2018Updated 8 years ago
- A simple cache that can hold anything, including Swift items☆13Jan 31, 2017Updated 9 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- ☆17Oct 10, 2025Updated 4 months ago
- A TypeSpec Emitter creating Typescript from Models and generating a structured routes object for HTTP APIs.☆14Jan 30, 2026Updated 2 weeks ago
- ☆15May 24, 2021Updated 4 years ago
- Hierarchical state machine framework in Swift.☆11Nov 2, 2022Updated 3 years ago
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated last year
- Like the Vulkan C API but with quality of life improvements of C++☆12Feb 24, 2024Updated last year
- Showcase of diffusion models☆14Feb 20, 2023Updated 2 years ago
- Monte Carlo Tree Search (MCTS) ,realize using python☆12Mar 10, 2016Updated 9 years ago
- A program that times various techniques for performing a moving median filter (sometimes called rolling median, or streaming median)☆11Feb 13, 2016Updated 10 years ago
- Quadruped Robot controller design and simulation on Webots☆12Apr 28, 2020Updated 5 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated last year