frcs / 4C16-2021View external linksLinks
☆11Nov 15, 2021Updated 4 years ago
Alternatives and similar repositories for 4C16-2021
Users that are interested in 4C16-2021 are comparing it to the libraries listed below
Sorting:
- A neural network library written in jax☆13Feb 3, 2025Updated last year
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- This is a simple automated license plate detector developed in C++ via OpenCV.☆11Sep 26, 2020Updated 5 years ago
- Meta in-context learning for protein fitness prediction☆16Feb 7, 2025Updated last year
- Accompanying repo for the paper - High-speed Autonomous Racing using Trajectory-aided Deep Reinforcement Learning☆17Jan 17, 2024Updated 2 years ago
- clear single-file JAX implementations of common RL algorithms☆16Sep 5, 2021Updated 4 years ago
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- Adaptation of titans-pytorch to llama models on HF☆25Mar 6, 2025Updated 11 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆30Dec 15, 2025Updated 2 months ago
- Parallel hyperparameter tuning with JAX☆39Jul 21, 2025Updated 6 months ago
- Value-Decomposition Networks For Cooperative Multi-Agent Learning☆25Apr 14, 2021Updated 4 years ago
- ☆23Jun 8, 2021Updated 4 years ago
- Our teams's submission to the Datathon held at University of Waterloo, May 12 2018.☆22Jun 28, 2018Updated 7 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Dec 3, 2023Updated 2 years ago
- PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression☆29Aug 16, 2020Updated 5 years ago
- Fast and procedurally generated side-scroller-game-like graphical environments (formerly Procgen)☆33Jul 7, 2023Updated 2 years ago
- Library for efficient training and application of Machine Learning Interatomic Potentials (MLIP)☆81Jan 7, 2026Updated last month
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆97Jan 25, 2026Updated 3 weeks ago
- Efficiently Composable Data Augmentation on the GPU with Jax☆42May 16, 2025Updated 9 months ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- Simple verification experiments codes for multi-agent RL using OpenAI MPE environment☆34Jun 22, 2022Updated 3 years ago
- The Starcraft Multi-Agent challenge lite☆47Sep 13, 2024Updated last year
- Air Hockey Challenge☆42Sep 18, 2025Updated 4 months ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Apr 14, 2022Updated 3 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Nov 24, 2018Updated 7 years ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆61Oct 23, 2023Updated 2 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆54Nov 10, 2025Updated 3 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Aug 4, 2022Updated 3 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆64Nov 14, 2024Updated last year
- Jax/Flax rewrite of Karpathy's nanoGPT☆63Feb 15, 2023Updated 3 years ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆91Nov 4, 2025Updated 3 months ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆58Oct 18, 2022Updated 3 years ago
- A simple PyTorch implementation of Population Based Training of Neural Networks.☆64Mar 14, 2019Updated 6 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆77Mar 25, 2022Updated 3 years ago
- A low-latency C++ generator inspired by Fix8, creating encoder, decoder, and message classes from a custom YAML schema for the Binary Ord…☆107Jun 8, 2025Updated 8 months ago
- Revisiting Rainbow☆75Jun 9, 2021Updated 4 years ago
- A projection-based framework for gradient-free and parallel learning☆110Jun 20, 2025Updated 7 months ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆79Mar 24, 2023Updated 2 years ago