This repo contains PPO implementation in PyTorch for LunarLander-v2
☆11Jun 26, 2020Updated 5 years ago
Alternatives and similar repositories for PPO_PyTorch
Users that are interested in PPO_PyTorch are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym☆11May 20, 2018Updated 7 years ago
- Implementation of reinforcement learning algorithms for the OpenAI Gym environment LunarLander-v2☆24Mar 8, 2021Updated 4 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피를 생성해주는 모델을 제작하였습니다!!☆11Dec 28, 2021Updated 4 years ago
- PyData Boston 2013 talks: "Intro to scikit-learn" & "Realtime Predictive Analytics: Using scikit-learn and RabbitMQ"☆11Jan 5, 2014Updated 12 years ago
- ☆10Oct 26, 2022Updated 3 years ago
- OpenAI LunarLander-v2 DeepRL-based solutions (DQN, DuelingDQN, D3QN)☆43Aug 11, 2021Updated 4 years ago
- A C++ Package for Solving Multiple-Phase Optimal Control Problem Using Adaptive Radau Pseudospectral Methods☆10Aug 31, 2020Updated 5 years ago
- ☆14Mar 10, 2021Updated 4 years ago
- ☆10Aug 17, 2018Updated 7 years ago
- ☆14Jan 14, 2025Updated last year
- pytorch faster r-cnn☆11Dec 21, 2020Updated 5 years ago
- 팡요랩 자료☆11May 31, 2019Updated 6 years ago
- OpenAI Gym's LunarLander-v2 Implementation☆43Apr 27, 2024Updated last year
- Source Code for 'Practical Blockchains and Cryptocurrencies' by Karan Singh Garewal☆13Sep 5, 2020Updated 5 years ago
- RapidJson Plugin for UnrealEngine 4☆11Nov 22, 2019Updated 6 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- The Apache Kafka Connector connects to the IE Databus and Apache Kafka.☆10Nov 25, 2025Updated 3 months ago
- ☆12Sep 20, 2021Updated 4 years ago
- ☆10Mar 14, 2022Updated 3 years ago
- Public examples for FORCES NLP☆12Jun 20, 2017Updated 8 years ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- ☆13Mar 9, 2024Updated last year
- NTHU CS6135 VLSI實體設計自動化☆12Mar 12, 2022Updated 3 years ago
- ☆18Oct 3, 2024Updated last year
- Data camp notes in jupyter notebook☆14Aug 2, 2018Updated 7 years ago
- Build FMUs using modern C++☆19Dec 3, 2025Updated 2 months ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- frame interpolation for CLIP guided videos☆15Aug 18, 2022Updated 3 years ago
- Reinforcement Learning for Rocket Lander Environment☆10Jun 6, 2018Updated 7 years ago
- We present a new comparative study of the paper from Hansson and Boyd Robust Optimal Control of Linear Discrete-Time Systems using Primal…☆13May 5, 2020Updated 5 years ago
- ☆12Mar 23, 2021Updated 4 years ago
- 4 bits quantization of LLaMa using GPTQ☆12Jun 2, 2023Updated 2 years ago
- [NeurIPS, 2020 - Reproducibility Challenge]: [RE] Towards Interpretable Reinforcement Learning Using Attention Augmented Agents☆13Apr 26, 2021Updated 4 years ago
- Machine Learning (Imagenet) User Interface Demo application using Streamlit☆17Mar 25, 2023Updated 2 years ago
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 2 years ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- ☆14Oct 23, 2018Updated 7 years ago