tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
☆31Oct 4, 2020Updated 5 years ago
Alternatives and similar repositories for policy-gradient-pong
Users that are interested in policy-gradient-pong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Trains an agent with (stochastic) Policy Gradients(actor-critic) on Pong. Uses OpenAI Gym.☆18Jan 10, 2025Updated last year
- MIMICIII Sepsis Survival Analysis☆14Mar 27, 2017Updated 9 years ago
- Backprop with Low-Precision Activations☆11Oct 28, 2019Updated 6 years ago
- ☆12Oct 7, 2017Updated 8 years ago
- PyTorch Implementation of the paper - 'Generative Adversarial Text to Image Synthesis' from ICML 2016 https://arxiv.org/abs/1605.05396☆10May 23, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Oct 21, 2022Updated 3 years ago
- my experimental repository of Mask R-CNN based on lightweight network☆10Apr 20, 2019Updated 6 years ago
- Survey on machine learning.☆14Nov 28, 2020Updated 5 years ago
- Oracle backend for dplyr (R package)☆14Mar 23, 2016Updated 10 years ago
- Learning to reinforcement learn and treating sepsis on the side☆15Dec 9, 2017Updated 8 years ago
- 11-785 Group Project: YouShen Poetry generation☆10Dec 23, 2020Updated 5 years ago
- ☆12Nov 19, 2016Updated 9 years ago
- This repository contains scripts for comparing OSM and authoritative road network datasets.☆14Apr 19, 2017Updated 8 years ago
- A pipe friendly way to interact with an OMOP Common Data Model☆14Apr 2, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Stan-code for Markov-switching vector autoregressive models☆21Oct 4, 2020Updated 5 years ago
- CS234 Sepsis Simulator For RL☆18Dec 8, 2022Updated 3 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago
- A PyTorch implementation of Human-Level Control through Deep Reinforcement Learning☆24Jun 6, 2017Updated 8 years ago
- PyTorch implementation of DeepLab v2 (ResNet) + COCO-Stuff 10k/164k☆15Nov 7, 2018Updated 7 years ago
- Automatically exported from code.google.com/p/pyrbf☆11May 4, 2015Updated 10 years ago
- TopoRhino☆11Feb 4, 2020Updated 6 years ago
- Deep Q Network implements by Tensorflow☆25Mar 9, 2018Updated 8 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Open AI Gym Environment For MIMIC Dataset Sepsis Patient☆24Dec 8, 2022Updated 3 years ago
- The mechanoChemIGA code is an isogeometric analysis based code used to solve the partial differential equations describing solid mechanic…☆14Oct 15, 2020Updated 5 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 8 years ago
- coding examples to Intro to RL☆13Apr 30, 2018Updated 7 years ago
- ☆23Jan 28, 2018Updated 8 years ago
- ☆15Jan 11, 2019Updated 7 years ago
- Train a quadcopter to fly with a deep reinforcement learning algorithm - DDPG☆12Jul 19, 2018Updated 7 years ago
- litellm helper☆31Updated this week
- The code used, and a docker image to run it, of the paper `Exploiting locality and physical invariants to design effective Deep Reinforce…☆13Dec 10, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Inference on marginal distributions using gradient-based optimization☆13Mar 27, 2017Updated 9 years ago
- Experimentation with Streamlit for personal LLM tool☆15Jun 19, 2023Updated 2 years ago
- Implementation of lid driven cavity solver based on SIMPLE algorithm☆16Jan 11, 2019Updated 7 years ago
- A PyTorch implement of Dilated RNN☆11Dec 31, 2017Updated 8 years ago
- An R package providing access to the OpenAI Gym API☆21Jul 1, 2017Updated 8 years ago
- A Python library for parsing OSM streams.☆15May 8, 2021Updated 4 years ago
- Windows version of StatTag☆24Dec 19, 2024Updated last year