mandrakedrink / PPO-pytorchView external linksLinks
This repository contains the source code pytorch realization of PPO for solving openai gym enviroments.
☆21Oct 9, 2020Updated 5 years ago
Alternatives and similar repositories for PPO-pytorch
Users that are interested in PPO-pytorch are comparing it to the libraries listed below
Sorting:
- Model-Based RL Demo for Pendulum-v0☆13Jun 16, 2020Updated 5 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 2 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆26Oct 16, 2025Updated 4 months ago
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 2 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- ☆18Apr 13, 2024Updated last year
- Reinforcement Learning Methods with PyTorch☆38Jan 16, 2020Updated 6 years ago
- 一个用 ChatGPT 生成命令行的小玩具☆10Mar 7, 2023Updated 2 years ago
- Catkinized version of the latest version of PCL (http://pointclouds.org/)☆14Apr 9, 2020Updated 5 years ago
- ToolEENet: Tool Affordance 6D Pose Estimation☆11Jun 29, 2024Updated last year
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Swarm learning algorithm☆11Jun 2, 2021Updated 4 years ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- MLflow App Using React, Hooks, RabbitMQ, FastAPI Server, Celery, Microservices☆11Sep 25, 2022Updated 3 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- Official implementation of: "PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning" by…☆16Jun 2, 2025Updated 8 months ago
- Small extensions of the Bellman-Ford routines in NetworkX, primarily for convenience☆13May 7, 2018Updated 7 years ago
- The first large scale formally verified reasoning dataset for Verilog☆19May 16, 2025Updated 9 months ago
- Poker hand evaluation for Go☆12Feb 7, 2014Updated 12 years ago
- Neural machine translation with Recurrent Deterministic Policy Gradient☆10Aug 18, 2016Updated 9 years ago
- ☆11Jan 28, 2022Updated 4 years ago
- Reinforcement learning training project for a SLG game☆13Dec 21, 2017Updated 8 years ago
- Simulink implementations of sliding mode and LQR controller for rotary inverted pendulum☆12May 20, 2018Updated 7 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- ☆11Apr 13, 2023Updated 2 years ago
- Implementation of elo rating for large competitions☆10Nov 25, 2016Updated 9 years ago
- A Simple Game Using Unity ML-Agents☆10Nov 20, 2020Updated 5 years ago
- Developing a ROS-Package of a linear inverted pendulum with N-Links along with a controller and create Tutorials for the same.☆10Apr 28, 2020Updated 5 years ago
- Python (pip) package for fitting mixtures of Student's t-distributions using either maximum likelihood (EM) or Bayesian methodology (vari…☆11Sep 23, 2025Updated 4 months ago
- ☆13May 21, 2024Updated last year
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago