Proximal Policy Optimization (PPO) algorithm for Contra
☆144Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for Contra-PPO-pytorch
Users that are interested in Contra-PPO-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Proximal Policy Optimization (PPO) algorithm for Super Mario Bros☆1,287Jul 24, 2021Updated 4 years ago
- Demo of predict and train YOLOv8 with custom data☆18Feb 1, 2023Updated 3 years ago
- Deployment of ML model using flask☆22Jan 8, 2019Updated 7 years ago
- Vietnamese GPT-J API service deployed with Docker & Helm chart☆10Dec 11, 2022Updated 3 years ago
- ☆15Aug 16, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- implementation of "Evolution Strategies as a Scalable Alternative to Reinforcement Learning" OpenAI paper☆20Apr 18, 2021Updated 5 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆51Dec 8, 2022Updated 3 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆32Jan 9, 2019Updated 7 years ago
- Curiosity-driven Exploration by Self-supervised Prediction for Street Fighter III Third Strike☆164Nov 11, 2019Updated 6 years ago
- Pattern language for abstract single-player games and puzzles, and Unity player☆14Dec 7, 2022Updated 3 years ago
- Versions of hybrid pso algorithms for engineering optimization☆10Dec 21, 2017Updated 8 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Oct 4, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆50Dec 8, 2022Updated 3 years ago
- my docker files !☆11Jun 29, 2020Updated 5 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- runs the emulator that communicates with the weplay io backend☆25Feb 22, 2017Updated 9 years ago
- Vision-Based Navigation for Auto-Docking☆13Apr 21, 2021Updated 5 years ago
- Benchmark dataset for maritime multi-sensor, multi-target tracking☆17Apr 19, 2022Updated 4 years ago
- YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)(Rotation Detection)(Rotated BBox)基于YOLOv5的旋转目标检测☆10Mar 27, 2021Updated 5 years ago
- Netflix DNS proxy written in Go☆29Jun 7, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros☆1,109Apr 28, 2024Updated 2 years ago
- Deep Q-learning for playing tetris game☆532Apr 3, 2023Updated 3 years ago
- Example repository for running a federated Nomad / Consul / Vault cluster in AWS and GCP☆18Apr 4, 2018Updated 8 years ago
- The goal of this design is to use the PYNQ-Z2 development board to design a general convolution neural network accelerator. And through r…☆11Sep 30, 2020Updated 5 years ago
- an $85 arduino thermocycler☆16Jul 24, 2012Updated 13 years ago
- ☆12May 26, 2022Updated 4 years ago
- ☆10Jan 22, 2023Updated 3 years ago
- BfA / 8.0 update for SpeakinSpell☆10Aug 17, 2018Updated 7 years ago
- Эгея — движок блога, созданный Ильей Бирманом☆10Mar 16, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A collection of Fault Diagnosis python codes☆10Mar 13, 2022Updated 4 years ago
- Various reinforcement learning algorithms written in Jax + Flax☆26Jun 24, 2023Updated 2 years ago
- CNN implementation in C++ categorizing the MNIST data set☆17Sep 24, 2019Updated 6 years ago
- JigLib rigid body physics engine☆12Feb 10, 2022Updated 4 years ago
- Creates temporary Jupyter Notebook servers using Docker containers.☆14Mar 27, 2016Updated 10 years ago
- The codebase and datasets for the IJCAI 2021 paper "The Surprising Power of Graph Neural Networks with Random Node Initialization".☆22Jun 3, 2021Updated 4 years ago
- Secret ballot voting Smart Contract with Quorum☆15Dec 28, 2017Updated 8 years ago