A summary of important concepts and algorithms in RL
☆44Apr 1, 2022Updated 4 years ago
Alternatives and similar repositories for rl-cheatsheet
Users that are interested in rl-cheatsheet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated 2 months ago
- Implementation of AlphaZero in PyTorch.☆10Apr 19, 2019Updated 6 years ago
- Prototyping mujoco simulation environments.☆11Feb 20, 2025Updated last year
- This repository provides a set of reinforcement learning tasks for Booster robots using Isaac Lab.☆35Apr 2, 2026Updated last week
- ☆11Nov 5, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Community-built test set to benchmark QP solvers☆15May 7, 2025Updated 11 months ago
- #UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning☆14May 23, 2022Updated 3 years ago
- ☆14Jun 7, 2023Updated 2 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- LLM_Assisted_Preference_Prediction☆16Apr 13, 2025Updated 11 months ago
- ACL'2023: Few-shot Event Detection: An Empirical Study and a Unified View☆11Mar 13, 2024Updated 2 years ago
- On-Device Domain Generalization☆46Nov 9, 2022Updated 3 years ago
- A simple CLI tool to lint to Jupyter notebooks☆16Feb 2, 2017Updated 9 years ago
- A simple JAX-based implementation of random search for locomotion tasks using MuJoCo XLA (MJX).☆13Jul 18, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆26Jun 14, 2022Updated 3 years ago
- ☆12Apr 19, 2024Updated last year
- ☆10Apr 2, 2024Updated 2 years ago
- A framework for majority vote classifiers allowing for computation of PAC Bayesian risk bounds.☆13Feb 9, 2023Updated 3 years ago
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- [NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"☆106Nov 9, 2023Updated 2 years ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆34Nov 1, 2025Updated 5 months ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- RL training for quadruped robot(mit minicheetah) various gaits in different velocity based on MPC controller.☆22Jul 11, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)☆18Jun 22, 2022Updated 3 years ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆31May 15, 2024Updated last year
- ☆21May 7, 2024Updated last year
- ☆34Dec 8, 2022Updated 3 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Sep 26, 2024Updated last year
- An implementation of Short Horizon Actor Critic writen in Jax. Core algorithm written in the style of Brax, with several bits taken from …☆22Nov 4, 2024Updated last year
- This is a read-only mirror of the CRAN R package repository. GOplot — Visualization of Functional Analysis Data. Homepage: https://gith…☆15Mar 30, 2016Updated 10 years ago
- The opinionated presentation app.☆20May 16, 2021Updated 4 years ago
- Reinforcement Learning quadruped locomotion for a Unitree Go2 robot in Mujoco XLA (MJX)☆21Jan 13, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆52Updated this week
- Learning Representations that Support Robust Transfer of Predictors☆20Nov 7, 2021Updated 4 years ago
- [ICLR 2025] Code for the PopulationTransformer☆15Oct 16, 2025Updated 5 months ago
- Active inference implementation of dynamic multi-armed bandits☆20Jun 25, 2025Updated 9 months ago
- Corresponding source code for the study "Real-time Synthesis of Imagined Speech Processes from Minimally Invasive Recordings of Neural Ac…☆11Jul 30, 2021Updated 4 years ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆65Oct 19, 2024Updated last year
- Toolkit for Elevater Benchmark☆77Oct 17, 2023Updated 2 years ago