Proximal Policy Optimization with TensorFlow and OpenAI Gym
☆18Mar 31, 2018Updated 7 years ago
Alternatives and similar repositories for ppo
Users that are interested in ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Feb 28, 2018Updated 8 years ago
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago
- Alber Deep Learning☆12Sep 25, 2017Updated 8 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- This is the code repo of our Pattern Recognition journal on IPR protection of Image Captioning Models☆11Aug 29, 2023Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- [ICLR 2025] Implementation of "Node Identifiers: Compact, Discrete Representations for Efficient Graph Learning"☆17Jun 6, 2025Updated 9 months ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆362Jun 2, 2020Updated 5 years ago
- Originally written by mamaich☆27Feb 6, 2014Updated 12 years ago
- To convert a 2D image into 3D image and make it move.☆10Mar 3, 2019Updated 7 years ago
- LaTeX template for dissertation proposals in Peking University Shenzhen.☆15Feb 23, 2022Updated 4 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Aug 3, 2020Updated 5 years ago
- Painless distributed training with torch☆12Updated this week
- Reference implementation of the paper "Efficient and Scalable Graph Generation through Iterative Local Expansion"☆16Aug 27, 2025Updated 6 months ago
- Smart RTOS☆10Apr 18, 2015Updated 10 years ago
- official implementation of RoSAS: Deep Semi-supervised Anomaly Detection with Contamination-resilient Continuous Supervision☆11Jul 18, 2023Updated 2 years ago
- ☆13Sep 10, 2025Updated 6 months ago
- Displays upcoming shuttle launch information on a 20x4 RasPi LCD. All data is pulled from SpaceFlightNow.com.☆12Aug 1, 2016Updated 9 years ago
- Official codebase for "Score-based Diffusion Models in Function Space"☆18Jan 23, 2025Updated last year
- Assignments for CS294-112 Deep Reinforcement Learning in UC Berkeley in Fall 2018☆16Nov 15, 2018Updated 7 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Non-autoregressive Translation by Learning Target Categorical Codes☆11Jul 11, 2021Updated 4 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Sign EBOOT.PBP files for PSP☆29Dec 23, 2020Updated 5 years ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- VCR-Graphormer: A Mini-batch Graph Transformer via Virtual Connections, ICLR 2024☆14May 9, 2024Updated last year
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 6 years ago
- An official implementation of "Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective" (KDD 2024)☆12Sep 16, 2024Updated last year
- 🎓 Elegantly manage your GUC academic life☆15Jul 5, 2025Updated 8 months ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- Support material for nucl.ai Conference 2016 workshops and open laboratories.☆20Jul 20, 2016Updated 9 years ago
- Scalable and privacy-enhanced graph generative models for benchmark graph neural networks☆17Nov 1, 2023Updated 2 years ago
- Implementation of Variational Hierarchical User-based Conversation Model☆10Jul 2, 2021Updated 4 years ago
- The code of COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities. https://aclanthology…☆12Oct 12, 2022Updated 3 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- The repository implements the paper "Learning Graph Quantized Tokenizers for Transformers".☆30Apr 2, 2025Updated 11 months ago
- A save editor for QSP games☆25Jan 29, 2021Updated 5 years ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year