uct tree search + supervised lerning for atari games
☆12Feb 14, 2017Updated 9 years ago
Alternatives and similar repositories for uct_atari
Users that are interested in uct_atari are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Python MUD/MUX/MUSH/MU* development system☆26Oct 30, 2015Updated 10 years ago
- Reinforcement learning library for PyTorch.☆11Jun 15, 2018Updated 7 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 9 years ago
- Implement BinaryNet of CNN with chainer☆11May 5, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆10Sep 20, 2018Updated 7 years ago
- The architecture used to train the level generator in the game Relay.☆12Apr 8, 2017Updated 9 years ago
- ☆34Jan 14, 2021Updated 5 years ago
- WIP implementation of Probabilistic Differential Dynamic Programming in PyTorch☆16Jul 25, 2024Updated last year
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- IR Remote Controller for WS2812(or analog) LED Strip☆10Nov 28, 2022Updated 3 years ago
- Hindsight policy gradients☆46Jan 31, 2020Updated 6 years ago
- PyLIS (Life in Silico with PyBullet)☆18May 5, 2022Updated 4 years ago
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Libp2p bindings for Python☆12Mar 21, 2026Updated last month
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Principal Feature Visualization for convolutional neural networks☆11Jan 28, 2021Updated 5 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Jun 26, 2016Updated 9 years ago
- ☆12Sep 27, 2023Updated 2 years ago
- An implementation of the Sequence to Sequence model using the Lasagne library (WIP)☆12Aug 11, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [not maintained anymore] [for study purpose] A simple PyTorch implementation for "Global Vectors for Word Representation".☆17Nov 7, 2019Updated 6 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated 11 months ago
- ☆17Sep 10, 2024Updated last year
- ☆18Jan 3, 2022Updated 4 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13May 5, 2021Updated 5 years ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 5 months ago
- A badge for join telegram chat room or channel.☆15Jan 9, 2016Updated 10 years ago
- Contains simple MPC implementation with neural network learned dynamics.☆17Feb 16, 2018Updated 8 years ago
- OpenAI Gym Environment for 2048☆17Dec 4, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The server portion of the Neural Chat project to deploy chatbots on web. This code is accompanied by another repository that includes the…☆37Jun 10, 2021Updated 4 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- Implementation of the Self Paced Reinforcement Learning Experiments☆19Sep 27, 2023Updated 2 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆38Feb 13, 2021Updated 5 years ago
- Voice to vector [Russian]☆15Feb 5, 2017Updated 9 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago