lightaime / cs294
Berkeley Deep Reinforcement Learning cs294 solution
☆13Updated 7 years ago
Alternatives and similar repositories for cs294:
Users that are interested in cs294 are comparing it to the libraries listed below
- ☆8Updated 8 years ago
- simple reinforcement learning example for the minecraft☆9Updated 6 years ago
- ☆19Updated 5 years ago
- Logging utility for ML experiments☆16Updated 2 years ago
- Linear Algebra for Machine Learning Book Exercises☆13Updated 5 years ago
- ☆13Updated 4 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Updated 5 years ago
- A skeleton pytorch codebase commonly used across my projects☆30Updated 5 years ago
- DS-GA 1003: Machine Learning Course Webpage☆16Updated 2 years ago
- Towards Visual Explanations for Convolutional Neural Networks via Input Resampling☆14Updated 7 years ago
- Reinforcement learning in 3D.☆21Updated 7 years ago
- Pytorch extension for Singular Value Decompostion (SVD) with LAPACK gesvd backend☆27Updated 4 years ago
- A Keras inspired training utility for PyTorch☆38Updated 6 years ago
- Programs with word vectors, RNN, NLP stuff, etc☆18Updated 7 years ago
- Implement Natural Language Object Retrieval in tensorflow☆36Updated 8 years ago
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.☆9Updated 8 years ago
- Code for "Probabilistic Neural Programmed Networks for Scene Generation.", Deng et al, NIPS 2018☆40Updated 5 months ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Updated 7 years ago
- Implementation of modular composition network from https://arxiv.org/pdf/1711.11289.pdf☆25Updated 7 years ago
- ☆10Updated 5 years ago
- Implementation of AlphaZero in PyTorch.☆10Updated 5 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Conference notes for AAAI 2019☆15Updated 5 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆20Updated 6 years ago