My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework
☆36Jun 27, 2018Updated 7 years ago
Alternatives and similar repositories for CS294_homework
Users that are interested in CS294_homework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My solutions to the UC Berkeley Deep Reinforcement Learning course offered in 2017 Fall, taught by Sergey Levine. Course URL: rll.berkele…☆13Mar 29, 2018Updated 8 years ago
- homework for CS294 Fall 2017☆166Feb 2, 2018Updated 8 years ago
- Assignments for CS294-112 Deep Reinforcement Learning in UC Berkeley in Fall 2018☆16Nov 15, 2018Updated 7 years ago
- Deep RL Algorithms implemented for UC Berkeley's CS 294-112: Deep Reinforcement Learning☆142Aug 22, 2017Updated 8 years ago
- Lecture notes of cs294-2017Fall☆10Feb 28, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Assignments for CS294-112.☆1,654Mar 24, 2023Updated 3 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆14Jul 14, 2018Updated 7 years ago
- A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Lea…☆27May 5, 2024Updated 2 years ago
- Assignments for CS294-112.☆10Nov 16, 2018Updated 7 years ago
- ☆11Mar 6, 2021Updated 5 years ago
- ☆10Oct 19, 2020Updated 5 years ago
- A reinforcement learning ready gazebo environment with Universal Robot 10. Similar to gym API.☆13May 4, 2017Updated 9 years ago
- ☆10Apr 18, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Assignments for Berkeley CS 294: Deep Reinforcement Learning (Fall 2017)☆42Oct 20, 2017Updated 8 years ago
- This work allows to train and test 3 different types of LSTM systems for trajectory prediction. The generation of the datasets used for t…☆11Oct 3, 2019Updated 6 years ago
- My solutions to CS285 2019 Fall of UC Berkeley☆14Nov 21, 2022Updated 3 years ago
- ☆13May 15, 2025Updated last year
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆20Oct 13, 2018Updated 7 years ago
- (NeurIPS 2018) Hardware Conditioned Policies for Multi-Robot Transfer Learning☆20Apr 8, 2019Updated 7 years ago
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- ☆48Sep 23, 2020Updated 5 years ago
- This project explores a deep reinforcement learning technique to train an agent to play atari pong game from OpenAI Gym. OpenAI Gym is a …☆13Feb 18, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Corresponding topologically different 3D shapes☆13Mar 3, 2017Updated 9 years ago
- TPLinker: Single-stage Joint Extraction of Entities and Relations Through Token Pair Linking☆19Apr 15, 2021Updated 5 years ago
- Code for "Learning Deep Features in Instrumental Variable Regression" (https://arxiv.org/abs/2010.07154)☆16Sep 16, 2024Updated last year
- an R package to perform synchronization analysis on motion energy time-series☆16Dec 13, 2024Updated last year
- recognition task with license plate dataset (26 letters A-Z and 10 digits 0-9). Each license plate has 5,6,7 or 8 characters. Dataset inc…☆14Sep 5, 2019Updated 6 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- Matrix Factorization based Movie Recommender System for group of users.☆14May 4, 2017Updated 9 years ago
- Repo for Working with Open Data (Spring 2014 edition), a course at the School of Information, UC Berkeley☆34Dec 10, 2015Updated 10 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- implement the classic reinforcement learning algorithm DQN to play supermariobrother☆15Dec 18, 2017Updated 8 years ago
- Distributed optimization framework with parameter server☆23Jun 14, 2015Updated 10 years ago
- A PyTorch implementation of Rainbow DQN agent☆170Apr 23, 2018Updated 8 years ago
- R/txshift: Efficient Estimation of the Causal Effects of Stochastic Interventions, with Corrections for Outcome-Dependent Sampling☆14Sep 21, 2024Updated last year
- AI library developed in functional scala☆14Dec 28, 2015Updated 10 years ago
- Correspondence Matrices are Underrated☆18Nov 2, 2020Updated 5 years ago
- Satellite package for LBP-TOP based face anti-spoofing☆10Oct 2, 2014Updated 11 years ago