Trains an agent with (stochastic) Policy Gradients(actor-critic) on Pong. Uses OpenAI Gym.
☆18Jan 10, 2025Updated last year
Alternatives and similar repositories for pong_actor-critic
Users that are interested in pong_actor-critic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A web app for annotating Freesound loops, and the tools to analyse the dataset created.☆20Jul 6, 2023Updated 2 years ago
- Published by Packt☆11Jan 18, 2021Updated 5 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- FFT Explorations (basic implementation)☆10Aug 8, 2014Updated 11 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Dec 10, 2017Updated 8 years ago
- ROBEL: Robotics Benchmarks for Learning with low-cost robots (dev fork)☆13Jul 30, 2020Updated 5 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆52Feb 4, 2020Updated 6 years ago
- ☆12Dec 8, 2016Updated 9 years ago
- A Python library for parsing OSM streams.☆15May 8, 2021Updated 5 years ago
- Code for "Predictive-Corrective Networks for Action Detection"☆16Nov 29, 2017Updated 8 years ago
- ☆14Aug 18, 2023Updated 2 years ago
- Implementations on OpenAI's Gym☆10Nov 21, 2017Updated 8 years ago
- Temporal Difference Learning based Backgammon game using Neural Network based model☆11Mar 13, 2018Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A CUDA implementation of the ZeroOut tensorflow custom op, just for fun☆11Feb 1, 2017Updated 9 years ago
- Audio Masking Methods☆12Nov 15, 2019Updated 6 years ago
- In this repository I'll be programming the cool exercises of the Book Reinforcement-Learning: An introduction by Sutton☆14Apr 15, 2018Updated 8 years ago
- ☆13May 15, 2025Updated last year
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Repo for PyData 2018 tuorial☆12Oct 18, 2018Updated 7 years ago
- Predictive State Recurrent Neural Networks☆18May 18, 2020Updated 6 years ago
- Mancs: A multi-task attentional network with curriculum sampling for person re-identification☆13Aug 5, 2019Updated 6 years ago
- Trust Region Policy Optimization with Generalized Advantage Estimator☆16Nov 15, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Vue app for https://github.com/bearpelican/musicautobot☆17Dec 10, 2022Updated 3 years ago
- Using Multiple GPU with tensorflow☆13Dec 28, 2018Updated 7 years ago
- Odds and Ends and Things I've implemented.☆78Jan 31, 2019Updated 7 years ago
- ☆25Mar 7, 2026Updated 2 months ago
- Reproducible Data Science in Python (SciPy 2019 Tutorial)☆13Feb 2, 2023Updated 3 years ago
- ☆26May 20, 2026Updated last week
- Files from the published Alpha Star paper by DeepMind☆18Nov 14, 2019Updated 6 years ago
- Using Deep Learning to predict audio quality.☆18Jan 31, 2020Updated 6 years ago
- Pytorch Implementation of MusicVAE☆16May 4, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Companion code to the GraphQL vs REST video☆17Feb 12, 2022Updated 4 years ago
- Discriminative Unsupervised Feature Learning with Convolutional Neural Networks☆19Feb 9, 2022Updated 4 years ago
- ☆19Feb 1, 2024Updated 2 years ago
- A fire-tested template for production grade python libraries and packages.☆18Jul 15, 2025Updated 10 months ago
- A dataset for chord coloring and voicing☆20Nov 2, 2020Updated 5 years ago
- Undergraduate course in Pattern Recognition and Imaging.☆18Oct 4, 2022Updated 3 years ago
- Harmonizes a melody with the likeliest sequence of chords using dynamic Bayesian networks☆15Jul 7, 2018Updated 7 years ago