tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
☆31Oct 4, 2020Updated 5 years ago
Alternatives and similar repositories for policy-gradient-pong
Users that are interested in policy-gradient-pong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AsynchroNous Disk-based Representation of MassivE DAta: An R package aimed at replacing ff for storing large data objects.☆11Nov 21, 2025Updated 5 months ago
- Trains an agent with (stochastic) Policy Gradients(actor-critic) on Pong. Uses OpenAI Gym.☆18Jan 10, 2025Updated last year
- MIMICIII Sepsis Survival Analysis☆14Mar 27, 2017Updated 9 years ago
- Variational autoencoder implementation using Tensorflow and Python☆10Dec 6, 2016Updated 9 years ago
- ☆12Oct 7, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- State of the Art Language models and Classifier for Odia, which is spoken in the Indian state of Odisha☆14Aug 7, 2020Updated 5 years ago
- Denoising method based on Deep Image Prior and Neural Image Assessment☆10Mar 10, 2020Updated 6 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Oct 21, 2022Updated 3 years ago
- Survey on machine learning.☆14Nov 28, 2020Updated 5 years ago
- Gem that offers access to data of Abbott's FreeStyle Libre over USB and from the official export☆11May 13, 2017Updated 8 years ago
- Reinforcement learning for sepsis☆17Feb 14, 2020Updated 6 years ago
- Cache your API calls with a single line of code. No mocks, no fixtures. Just faster, cleaner code.☆25Updated this week
- 11-785 Group Project: YouShen Poetry generation☆10Dec 23, 2020Updated 5 years ago
- A PyTorch implementation of Human-Level Control through Deep Reinforcement Learning☆24Jun 6, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple python wrapper for using the Caddy API☆26Updated this week
- TopoRhino☆12Feb 4, 2020Updated 6 years ago
- Prune your sklearn models☆19Oct 28, 2024Updated last year
- Open AI Gym Environment For MIMIC Dataset Sepsis Patient☆24Dec 8, 2022Updated 3 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 8 years ago
- Deep learning model for sepsis prediction using high-frequency data☆18May 5, 2019Updated 6 years ago
- coding examples to Intro to RL☆13Apr 30, 2018Updated 8 years ago
- Train a quadcopter to fly with a deep reinforcement learning algorithm - DDPG☆12Jul 19, 2018Updated 7 years ago
- The code used, and a docker image to run it, of the paper `Exploiting locality and physical invariants to design effective Deep Reinforce…☆13Dec 10, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Experimentation with Streamlit for personal LLM tool☆15Jun 19, 2023Updated 2 years ago
- Implementation of lid driven cavity solver based on SIMPLE algorithm☆16Jan 11, 2019Updated 7 years ago
- A PyTorch implement of Dilated RNN☆11Dec 31, 2017Updated 8 years ago
- An R package providing access to the OpenAI Gym API☆21Jul 1, 2017Updated 8 years ago
- Pytorch implementation of YOLO v1 from scratch☆13May 21, 2024Updated last year
- A Python library for parsing OSM streams.☆15May 8, 2021Updated 4 years ago
- ☆14Aug 18, 2023Updated 2 years ago
- Python tools for solving data-constrained finite element problems☆14Nov 9, 2021Updated 4 years ago
- Code for "Semi-Supervised Models via Data Augmentation for Classifying Interactive Affective Responses"☆15Jun 26, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Comparison between Sarsa and Q-Learning algorithms on risk handling☆17Jul 10, 2017Updated 8 years ago
- ☆14Nov 13, 2017Updated 8 years ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆19Dec 10, 2021Updated 4 years ago
- ☆11Apr 13, 2025Updated last year
- ☆18Nov 16, 2020Updated 5 years ago
- Sample to Bitcoin Address Generation☆15Nov 14, 2018Updated 7 years ago
- NumerBay (https://numerbay.ai) - The Numerai Community Marketplace for anything Numerai.☆18Apr 12, 2026Updated 3 weeks ago