distillpub/post--understanding-rl-vision

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/distillpub/post--understanding-rl-vision)

distillpub / post--understanding-rl-vision

Understanding RL vision Distill article

☆25

Alternatives and similar repositories for post--understanding-rl-vision

Users that are interested in post--understanding-rl-vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jetnew / SlimeRL
View on GitHub
Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …
☆17Nov 15, 2020Updated 5 years ago
dspub99 / betazero
View on GitHub
Tabula Rasa Tic-Tac-Toe
☆10Jan 3, 2019Updated 7 years ago
cyoon1729 / distributedRL
View on GitHub
A framework for easy prototyping of distributed reinforcement learning algorithms
☆97Dec 8, 2020Updated 5 years ago
saiboxx / offline-reinforcement-learning
View on GitHub
Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)
☆17Jul 7, 2020Updated 6 years ago
tmoer / a0c
View on GitHub
Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AnirudhDagar / MessagePassing_for_GNNs
View on GitHub
Experiments with Message Passing GNNs in C++ and PyTorch.
☆26Jul 25, 2024Updated 2 years ago
nickfrosst / neural_additive_models
View on GitHub
stand alone Neural Additive Models, forked from google-reasearch for easy import to colab
☆29Sep 29, 2020Updated 5 years ago
jcoreyes / evolvingrl
View on GitHub
Supplementary Data for Evolving Reinforcement Learning Algorithms
☆47Mar 15, 2021Updated 5 years ago
james-simon / eigenlearning
View on GitHub
codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"
☆52May 2, 2023Updated 3 years ago
wataruhashimoto52 / svgd_tf
View on GitHub
Implementation of Stein Variational Gradient Descent with TensorFlow 2.0
☆12Sep 11, 2019Updated 6 years ago
marcbrittain / Prioritized-Sequence-Experience-Replay
View on GitHub
Prioritized Sequence Experience Replay
☆10Aug 16, 2021Updated 4 years ago
maximilianigl / rl-iter
View on GitHub
Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
☆11Jun 8, 2020Updated 6 years ago
toshikwa / rljax
View on GitHub
A collection of RL algorithms written in JAX.
☆106Jul 5, 2022Updated 4 years ago
n2cholas / progan-flax
View on GitHub
Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation
☆12May 24, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
heinrichjh / nfsp-leduc
View on GitHub
Neural Fictitious Self-Play in Leduc Holdem
☆11Jul 4, 2018Updated 8 years ago
wkwan / procgen
View on GitHub
My Submission for the OpenAI/NeurIPS ProcGen Competition
☆11Nov 12, 2020Updated 5 years ago
lili-chen / SEER
View on GitHub
Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.
☆21Mar 5, 2021Updated 5 years ago
ShangtongZhang / ShangtongZhang.github.io
View on GitHub
My Homepage
☆10Jun 26, 2026Updated last month
attentionagent / attentionagent.github.io
View on GitHub
Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)
☆22Jan 12, 2022Updated 4 years ago
NZ99 / transformer_in_transformer_flax
View on GitHub
☆21Mar 14, 2021Updated 5 years ago
Rowing0914 / TF_RL
View on GitHub
Eagerly Experimentable!!!
☆26Jan 16, 2021Updated 5 years ago
HumanCompatibleAI / seals
View on GitHub
Benchmark environments for reward modelling and imitation learning algorithms.
☆47Sep 19, 2023Updated 2 years ago
LukasStruppek / Exploiting-Cultural-Biases-via-Homoglyphs
View on GitHub
[Journal of Artificial Intelligence Research] Source code for our paper "Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synth…
☆12Jan 8, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dsbrown1331 / CoRL2019-DREX
View on GitHub
Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…
☆51Dec 8, 2022Updated 3 years ago
yidiq7 / MLGeometry
View on GitHub
Machine learning Calabi-Yau metrics
☆25Jan 13, 2026Updated 6 months ago
google-research / dice_rl
View on GitHub
☆114Jul 3, 2026Updated 3 weeks ago
russellmendonca / GMPS
View on GitHub
Guided-Meta Policy Search
☆39Jan 19, 2023Updated 3 years ago
nikolamilosevic86 / FinAnalyzer
View on GitHub
Tool for technical analysis of financial data about companies indexed on the stockmarket using machine learning
☆12Sep 6, 2017Updated 8 years ago
DRL-CASIA / Deep-Reinforcement-Learning
View on GitHub
☆18Jan 4, 2021Updated 5 years ago
jparkerholder / PB2
View on GitHub
Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.
☆20Apr 13, 2021Updated 5 years ago
alexlee-gk / slac
View on GitHub
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
☆154Oct 26, 2020Updated 5 years ago
henry-prior / multimodal-rl
View on GitHub
Solving reinforcement learning tasks which require language and vision
☆33Apr 4, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
BY571 / SCoRe
View on GitHub
SCoRe: Training Language Models to Self-Correct via Reinforcement Learning
☆16May 14, 2026Updated 2 months ago
zj10 / PGA
View on GitHub
A TensorFlow implementation of perceptual generative autoencoder (PGA).
☆22Nov 2, 2020Updated 5 years ago
StepNeverStop / RLwithUnity
View on GitHub
Reinforcement Leanring Algorithms Trained with Unity
☆13Apr 26, 2019Updated 7 years ago
cpapadimitriou / Click-Through-Rate-prediction
View on GitHub
☆11Jun 15, 2019Updated 7 years ago
toshikwa / slac.pytorch
View on GitHub
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
☆94Jul 25, 2024Updated 2 years ago
jimliu741523 / headjackai-sdk
View on GitHub
☆17Sep 23, 2022Updated 3 years ago
openai / train-procgen
View on GitHub
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
☆182Apr 2, 2023Updated 3 years ago