alxthm/rl-cheatsheet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alxthm/rl-cheatsheet)

alxthm / rl-cheatsheet

A summary of important concepts and algorithms in RL

☆45

Alternatives and similar repositories for rl-cheatsheet

Users that are interested in rl-cheatsheet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Luodian / GenBench
View on GitHub
Benchmarking and Analyzing Generative Data for Visual Recognition
☆26Jul 25, 2023Updated 2 years ago
desy-ml / rl-vs-bo
View on GitHub
Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…
☆14Nov 15, 2023Updated 2 years ago
Computer-Vision-in-the-Wild / DataDownload
View on GitHub
☆27Aug 28, 2023Updated 2 years ago
hitgavin / simmechanics
View on GitHub
Matlab Multibody module for machine simulation
☆15Mar 2, 2022Updated 4 years ago
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dimitreOliveira / cryptogans
View on GitHub
TensorFlow implementation of a DCGAN to generate CryptoPunks + Gradio and Streamlit apps
☆11Oct 21, 2023Updated 2 years ago
rmst / rlrd
View on GitHub
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆43May 25, 2022Updated 4 years ago
peterdavidfagan / mujoco_robot_environments
View on GitHub
Prototyping mujoco simulation environments.
☆11Feb 20, 2025Updated last year
TraceElephant / TraceElephant
View on GitHub
Repo of "Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems" (ACL 2026)
☆16Apr 27, 2026Updated 2 months ago
kennymckormick / ARAS-Dataset
View on GitHub
☆11Nov 5, 2024Updated last year
nzw0301 / pb-contrastive
View on GitHub
#UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning
☆14May 23, 2022Updated 4 years ago
Guiraffo / ProVANT-Simulator
View on GitHub
☆15Jun 19, 2025Updated last year
bguedj / pyrotor
View on GitHub
☆14Jun 7, 2023Updated 3 years ago
msangnier / qreg
View on GitHub
Data sparse and non-parametric quantile regression
☆10Jun 10, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mayubo2333 / fewshot_ED
View on GitHub
ACL'2023: Few-shot Event Detection: An Empirical Study and a Unified View
☆11Mar 13, 2024Updated 2 years ago
qpsolvers / free_for_all_qpbenchmark
View on GitHub
Community-built test set to benchmark QP solvers
☆16May 7, 2025Updated last year
IST-DASLab / sparse-imagenet-transfer
View on GitHub
Code for reproducing the results in "How Well do Sparse Imagenet Models Transfer?", presented at CVPR 2022
☆10Jun 3, 2022Updated 4 years ago
KaiyangZhou / on-device-dg
View on GitHub
On-Device Domain Generalization
☆47Nov 9, 2022Updated 3 years ago
alexandercbooth / nblint
View on GitHub
A simple CLI tool to lint to Jupyter notebooks
☆16Feb 2, 2017Updated 9 years ago
rtaori / data_feedback
View on GitHub
Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"
☆18Sep 9, 2022Updated 3 years ago
kylehkhsu / tripod
View on GitHub
☆12Apr 19, 2024Updated 2 years ago
thowell / rs
View on GitHub
A simple JAX-based implementation of random search for locomotion tasks using MuJoCo XLA (MJX).
☆13Jul 18, 2024Updated 2 years ago
KBaur / FiltFilt
View on GitHub
Digital filter implementation
☆20May 1, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
erictzeng / ssa-segmentation-release
View on GitHub
☆12Sep 29, 2019Updated 6 years ago
StephanLorenzen / MajorityVoteBounds
View on GitHub
A framework for majority vote classifiers allowing for computation of PAC Bayesian risk bounds.
☆13Feb 9, 2023Updated 3 years ago
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
takahashihiroshi / vae-iop
View on GitHub
pytorch implementation for "Variational Autoencoder with Implicit Optimal Priors".
☆11Oct 12, 2020Updated 5 years ago
ashish01 / CollinsTagger
View on GitHub
Implementation of Collin's perceptron for structured prediction
☆16Mar 10, 2025Updated last year
Espere-1119-Song / Video-MMLU
View on GitHub
A Massive Multi-Discipline Lecture Understanding Benchmark
☆34Apr 20, 2026Updated 3 months ago
numericalEFT / GreenFunc.jl
View on GitHub
Toolbox to study quantum many-body problem at the treelevel
☆15Dec 31, 2025Updated 6 months ago
UKPLab / iclr2024-model-merging
View on GitHub
This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.
☆31May 15, 2024Updated 2 years ago
xiezheng-cs / DTQ
View on GitHub
PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)
☆18Jun 22, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Luodian / nano-hevc
View on GitHub
A minimal, educational HEVC (H.265) encoder written in Python.
☆53Feb 23, 2026Updated 5 months ago
Gepetto / h1v2-Isaac
View on GitHub
☆15Updated this week
cran / GOplot
View on GitHub
This is a read-only mirror of the CRAN R package repository. GOplot — Visualization of Functional Analysis Data. Homepage: https://gith…
☆15Mar 30, 2016Updated 10 years ago
google-research / imagenet-mistakes
View on GitHub
☆18May 25, 2022Updated 4 years ago
Presentador / presentador.app
View on GitHub
The opinionated presentation app.
☆20May 16, 2021Updated 5 years ago
MAmmoTH-VL / MAmmoTH-VL
View on GitHub
(ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
☆50Jun 4, 2025Updated last year
xvjiarui / IMProv
View on GitHub
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
☆57Sep 26, 2024Updated last year