homework for CS234 2017
☆150May 4, 2018Updated 8 years ago
Alternatives and similar repositories for CS234
Users that are interested in CS234 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- homework for CS294 Fall 2017☆166Feb 2, 2018Updated 8 years ago
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆24Mar 27, 2020Updated 6 years ago
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Analytical methods for efficient inference of integrate-and-fire circuit models from single-trial spike trains☆11Oct 30, 2019Updated 6 years ago
- a starter-kit for jaynes, the cloud-agnostic launch library☆17Apr 1, 2026Updated last month
- [2017] Solving homeworks for Berekeley Deep Reinforcement Learning Course☆33Sep 11, 2017Updated 8 years ago
- FEN Code☆41Nov 4, 2019Updated 6 years ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆22Aug 1, 2021Updated 4 years ago
- ☆19Mar 21, 2020Updated 6 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning☆24Apr 17, 2021Updated 5 years ago
- My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019☆170Mar 24, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin☆12Mar 15, 2019Updated 7 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- DQN to play Atari Pong☆113Jan 15, 2019Updated 7 years ago
- OpenRedukti is a C++ library for Interest Rate Swaps and Fras, supports bootstrapping of Interest Rate Curves, computing NPV and sensitiv…☆10Jul 28, 2023Updated 2 years ago
- ☆10Feb 21, 2025Updated last year
- Code for recreating the results of our RSS 2020 paper, 'Learning Memory-Based Control for Human-Scale Bipedal Locomotion.'☆10Aug 18, 2022Updated 3 years ago
- ☆11Apr 7, 2019Updated 7 years ago
- Assignment Solutions to CS234: Reinforcement learning course☆36Aug 24, 2018Updated 7 years ago
- Maximum Entropy toolbox for MATLAB☆16Mar 28, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆11Dec 18, 2015Updated 10 years ago
- yet another reinforcement learning package☆12May 24, 2022Updated 3 years ago
- Bomberman deep reinforcement learning challenge in PyTorch☆27Jan 3, 2019Updated 7 years ago
- Programming Assignments and Lectures for UC Berkeley's CS 294: Deep Reinforcement Learning☆58Jul 15, 2018Updated 7 years ago
- Using GNN and DQN to find a baetter branching heuristic for a CDCL Solver☆54Oct 20, 2020Updated 5 years ago
- Deep Reinforcement Learning with pytorch & visdom☆805Jul 16, 2020Updated 5 years ago
- A python notebook showing how to visualize laplace transforms☆11Feb 13, 2019Updated 7 years ago
- Basic C++ Hidden Markov Model functionality implementation with state prediction estimation☆16Apr 29, 2013Updated 13 years ago
- A Multi-agent Learning Framework☆62May 10, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Cheatsheet for CS189/289 at UC Berkeley☆16May 14, 2015Updated 10 years ago
- ☆12May 22, 2016Updated 9 years ago
- Clustering documents based on LSH☆14Apr 20, 2016Updated 10 years ago
- https://arxiv.org/abs/2102.12594☆14Oct 3, 2023Updated 2 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆53Feb 16, 2020Updated 6 years ago
- ☆25Dec 18, 2015Updated 10 years ago
- convert the deep-residual-network(50, 101, 152) from caffe to mxnet☆11Aug 26, 2016Updated 9 years ago