๐ค Reinforcement Learning paper summaries, notebooks, and articles.
โ26Apr 16, 2020Updated 5 years ago
Alternatives and similar repositories for rl-insights
Users that are interested in rl-insights are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- โ52Aug 6, 2020Updated 5 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observationsโ19Sep 17, 2019Updated 6 years ago
- A project copied from google-research which named motion-imitation was rewrited with PyTorchโ10Sep 30, 2022Updated 3 years ago
- A simple wrapper to analyse and visualise reinforcement learning agents' behaviour in the environment.โ14Jan 8, 2022Updated 4 years ago
- A2C is a special case of PPO!โ22May 20, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI โข AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Generic reinforcement learning codebase in TensorFlowโ95Oct 12, 2021Updated 4 years ago
- Chrome extension to remove the "People also search for" elementโ12Apr 16, 2022Updated 3 years ago
- Reproducing Random Numbers in Matlab and Python / NumPyโ11Dec 6, 2015Updated 10 years ago
- Gym implementation of connector to Deepmind labโ12Mar 26, 2019Updated 7 years ago
- A GPU-accelerated fork of stable-baselines. Delivering reliable implementations of reinforcement learning algorithms.โ25Mar 10, 2021Updated 5 years ago
- self-studying the Sutton & Barto the hard wayโ205Nov 27, 2021Updated 4 years ago
- Pytorch code for ens_adv_trainโ17Jun 7, 2019Updated 6 years ago
- Lecture: Data Compression in Computational Science and Quantum Computing (่จ็ฎ็งๅญฆใป้ๅญ่จ็ฎใซใใใๆ ๅ ฑๅง็ธฎ)โ13Jan 18, 2023Updated 3 years ago
- Decoupled Neural Interfaces Using Synthetic Gradients - under develeopmentโ11Jun 27, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive โข AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- โ117Apr 28, 2023Updated 2 years ago
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"โ19Jul 11, 2023Updated 2 years ago
- โ10Aug 30, 2017Updated 8 years ago
- Log to W&B from Juliaโ12Jun 13, 2022Updated 3 years ago
- Code to accompany our paper "The combination of Hebbian and predictive plasticity learns invariant object representations in deep sensoryโฆโ31Jan 14, 2025Updated last year
- โ10Aug 17, 2022Updated 3 years ago
- โ13Jul 22, 2021Updated 4 years ago
- ROS wrapper for pedestrian prediction.โ12Feb 25, 2019Updated 7 years ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]โ30May 16, 2022Updated 3 years ago
- Open source password manager - Proton Pass โข AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A collection of utilities for machine learning experiments.โ11Jan 8, 2026Updated 3 months ago
- Input device (mouse/keyboard) connector app for Windowsโ13Sep 7, 2021Updated 4 years ago
- Code for paper "Continual and Multi-Task Architecture Search (ACL 2019)"โ41Jul 8, 2019Updated 6 years ago
- A neuromechanical model of adult Drosophila melanogaster.โ58Oct 11, 2024Updated last year
- Toy environment set for multi-agent reinforcement learning and moreโ39Nov 26, 2024Updated last year
- Soft Actor-Critic with advanced featuresโ51Mar 2, 2026Updated last month
- Asymmetric methods for partially observable reinforcement learningโ10Jun 9, 2025Updated 10 months ago
- Tensorflow implementation of Synthetic Gradient for RNN (LSTM)โ39Jan 30, 2018Updated 8 years ago
- โ11Feb 17, 2026Updated last month
- DigitalOcean Gradient AI Platform โข AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)โ11Dec 30, 2022Updated 3 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".โ12Jan 30, 2021Updated 5 years ago
- โ36Aug 10, 2018Updated 7 years ago
- Multiagent gridworld for the TEAM project based on gym-minigridโ12Nov 27, 2019Updated 6 years ago
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but โฆโ13Dec 1, 2022Updated 3 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.โ67Oct 3, 2023Updated 2 years ago
- R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualizationโ17Jul 8, 2015Updated 10 years ago