My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework
☆36Jun 27, 2018Updated 7 years ago
Alternatives and similar repositories for CS294_homework
Users that are interested in CS294_homework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning☆94Mar 1, 2019Updated 7 years ago
- Assignments for CS294-112 Deep Reinforcement Learning in UC Berkeley in Fall 2018☆16Nov 15, 2018Updated 7 years ago
- Deep RL Algorithms implemented for UC Berkeley's CS 294-112: Deep Reinforcement Learning☆142Aug 22, 2017Updated 8 years ago
- Lecture notes of cs294-2017Fall☆10Feb 28, 2018Updated 8 years ago
- Assignments for CS294-112.☆1,653Mar 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆14Jul 14, 2018Updated 7 years ago
- A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Lea…☆27May 5, 2024Updated last year
- Assignments for CS294-112.☆10Nov 16, 2018Updated 7 years ago
- ☆11Mar 6, 2021Updated 5 years ago
- ☆10Oct 19, 2020Updated 5 years ago
- ☆10Apr 18, 2017Updated 8 years ago
- Assignments for Berkeley CS 294: Deep Reinforcement Learning (Fall 2017)☆42Oct 20, 2017Updated 8 years ago
- Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)☆21Mar 12, 2021Updated 5 years ago
- This work allows to train and test 3 different types of LSTM systems for trajectory prediction. The generation of the datasets used for t…☆11Oct 3, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- My solutions to CS285 2019 Fall of UC Berkeley☆14Nov 21, 2022Updated 3 years ago
- ☆13May 15, 2025Updated 10 months ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆20Oct 13, 2018Updated 7 years ago
- (NeurIPS 2018) Hardware Conditioned Policies for Multi-Robot Transfer Learning☆20Apr 8, 2019Updated 7 years ago
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- ☆46Sep 23, 2020Updated 5 years ago
- ☆19Aug 8, 2024Updated last year
- CADRE: Contextual Attention-based Drug REsponse☆12Nov 23, 2020Updated 5 years ago
- Code for "Learning Deep Features in Instrumental Variable Regression" (https://arxiv.org/abs/2010.07154)☆16Sep 16, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Gesture learning and execution via DMPs☆15Oct 29, 2018Updated 7 years ago
- A game search and evaluation parameter tuner using optuna framework☆14Jan 20, 2023Updated 3 years ago
- The social-LSTM code for complete trajectory prediction (20 frames). In this repository, the normalized trajectory and non-normalized tra…☆12Apr 16, 2023Updated 2 years ago
- Repo for Working with Open Data (Spring 2014 edition), a course at the School of Information, UC Berkeley☆34Dec 10, 2015Updated 10 years ago
- Solutions to the assignments of the CPSC 540 Machine Learning course (2013) taught by Nando de Freitas☆24Jan 4, 2014Updated 12 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- ☆13May 8, 2020Updated 5 years ago
- implement the classic reinforcement learning algorithm DQN to play supermariobrother☆15Dec 18, 2017Updated 8 years ago
- Distributed optimization framework with parameter server☆23Jun 14, 2015Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A PyTorch implementation of Rainbow DQN agent☆170Apr 23, 2018Updated 7 years ago
- R/txshift: Efficient Estimation of the Causal Effects of Stochastic Interventions, with Corrections for Outcome-Dependent Sampling☆13Sep 21, 2024Updated last year
- AI library developed in functional scala☆14Dec 28, 2015Updated 10 years ago
- Correspondence Matrices are Underrated☆18Nov 2, 2020Updated 5 years ago
- A video-based multi-feature flame detection system.☆21Jun 7, 2014Updated 11 years ago
- My solutions to UC Berkeley CS285 (originally CS294-112, deeprlcourse) Fall 2019 assignments☆118Nov 21, 2022Updated 3 years ago
- ☆26Dec 1, 2020Updated 5 years ago