Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14May 17, 2024Updated last year
Alternatives and similar repositories for train-procgen-pytorch
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Jan 21, 2023Updated 3 years ago
- Humans consulting HCH☆10Sep 23, 2017Updated 8 years ago
- A framework for implementing equivariant DL☆10May 25, 2021Updated 4 years ago
- @ngrok/mantle ui component library | https://develop.mantle.ngrok.com☆13Apr 9, 2026Updated last week
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Sep 10, 2020Updated 5 years ago
- A library for training crosscoders☆16May 28, 2025Updated 10 months ago
- Implementation of POMDP algorithms on the tiger example, as described in Littman, Cassandra and Kaelbling (1994).☆17Aug 8, 2017Updated 8 years ago
- ☆28Jul 28, 2022Updated 3 years ago
- ☆18Dec 10, 2025Updated 4 months ago
- ☆13Jun 30, 2020Updated 5 years ago
- Runtime library and schema compiler for the Avro serialization format☆21Dec 13, 2021Updated 4 years ago
- A powerful keybind library and daemon for Linux.☆11Jul 24, 2022Updated 3 years ago
- Brutaltester compatible referee for coders strike back☆12Nov 27, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Scala Native 3 bindings for SFML library☆15Jul 9, 2023Updated 2 years ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Tools for optimizing steering vectors in LLMs.☆21Apr 10, 2025Updated last year
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- ☆15Mar 31, 2026Updated 2 weeks ago
- ☆17Jul 9, 2025Updated 9 months ago
- This is the source code for solving the Traveling Salesman Problems (TSP) using Monte Carlo tree search (MCTS).☆35Sep 25, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 4 years ago
- Subject of the hackathon 42☆12Nov 9, 2022Updated 3 years ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Adversarial examples to the new ConvNeXt architecture☆20Jan 12, 2022Updated 4 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated this week
- ☆15Aug 9, 2021Updated 4 years ago
- ☆12Oct 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20Jan 14, 2022Updated 4 years ago
- Evaluating LLMs performance in PR reviews as an indicator for their capability in creating PRs.☆13Apr 10, 2024Updated 2 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 3 years ago
- ☆40Jul 4, 2025Updated 9 months ago
- flexible meta-learning in jax☆16Oct 19, 2023Updated 2 years ago
- A simple 2D ball collision engine.☆12Jun 15, 2023Updated 2 years ago
- ☆22Mar 28, 2025Updated last year