ambujtewari/stats701-winter2021

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ambujtewari/stats701-winter2021)

ambujtewari / stats701-winter2021

Theory of Reinforcement Learning

☆18

Alternatives and similar repositories for stats701-winter2021

Users that are interested in stats701-winter2021 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tianxusky / Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
ziyadsheeba / qfat
View on GitHub
[NeurIPS 2025, Spotlight] An official implementation of the paper Quantization-Free Autoregressive Action Transformer
☆12Mar 3, 2026Updated 4 months ago
earth2observe / downscaling-tools
View on GitHub
python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis
☆10Nov 21, 2017Updated 8 years ago
LAMDA-RL / OfflineRL-Lib
View on GitHub
Benchmarked implementations of Offline RL Algorithms.
☆77Mar 4, 2025Updated last year
zjuchenwei / vector-line-quantization
View on GitHub
☆13Mar 27, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dhruvramani / data-driven-robotics
View on GitHub
A boilerplate (dbs, envs, teleop, models, web-apps) for robotic learning experiments & a Pytorch Implementation of "Learning Latent Plans…
☆11Oct 23, 2020Updated 5 years ago
thethaibinh / agile_flight
View on GitHub
Simulation system for path planning evaluation
☆13Dec 13, 2025Updated 7 months ago
GoldAndRabbit / gold-deep-rank
View on GitHub
Deep neural network codes for ctr/cvr prediction task in ranking process implemented by Tensorflow (1.14/2.4.1 version), using tf.estimat…
☆11Apr 21, 2021Updated 5 years ago
shivakanthsujit / reducible-loss
View on GitHub
Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss
☆12Oct 10, 2022Updated 3 years ago
yevvonlim / kai-presentation
View on GitHub
Claude Code skill for KAI presentation design in HTML
☆16Mar 20, 2026Updated 4 months ago
snu-mllab / DCPG
View on GitHub
Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)
☆15Feb 20, 2023Updated 3 years ago
dennisant / Reach-Avoid-Games
View on GitHub
☆10Dec 6, 2022Updated 3 years ago
NatLabRockies / InSPIRE
View on GitHub
Tutorials, scripts and other modeling aspects of agrivoltaics developed by the InSPIRE team
☆15Updated this week
ZibinDong / cocos
View on GitHub
Official implementation of the paper "Conditioning Matters: Training Diffusion Policies is Faster Than You Think".
☆18May 19, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
4Catalyzer / cyclegan
View on GitHub
☆13Mar 29, 2019Updated 7 years ago
gemcollector / PIE-G
View on GitHub
This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"
☆16Sep 21, 2023Updated 2 years ago
openrlbenchmark / openrlbenchmark
View on GitHub
☆268Mar 11, 2026Updated 4 months ago
zisikons / deep-rl
View on GitHub
Deep Learning (FS 2020)
☆17Oct 10, 2022Updated 3 years ago
hercky / ACER_tf
View on GitHub
Implementation for ACER in tensorflow and sonnet by deepmind
☆11Aug 28, 2017Updated 8 years ago
luk036 / ellpy
View on GitHub
ellipsoid method python code
☆12Feb 12, 2024Updated 2 years ago
AurelianTactics / bcq_tensorflow
View on GitHub
☆15May 24, 2021Updated 5 years ago
kywch / brax-trainer
View on GitHub
Brax + Pufferlib + CARBS for gpu-accelerated robotics RL
☆12Jun 12, 2025Updated last year
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
holarissun / RewardModelingBeyondBradleyTerry
View on GitHub
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…
☆73Apr 2, 2025Updated last year
cair / rl
View on GitHub
☆13Sep 15, 2021Updated 4 years ago
yixiaoer / mistral-jax
View on GitHub
JAX implementation of the Mistral 7b v0.1 model
☆13Mar 27, 2024Updated 2 years ago
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆12Oct 8, 2021Updated 4 years ago
kora-labs / cyclesgym
View on GitHub
CyclesGym: an OpenAI gym interface to the Cycles agricultural simulator
☆18Aug 10, 2022Updated 3 years ago
typoverflow / flow-rl
View on GitHub
Flow RL is a high-performance RL library with flow and diffusion models.
☆42Updated this week
jsw7460 / sb3_jax
View on GitHub
☆13Aug 9, 2022Updated 3 years ago
kristychoi / pixel_exploration
View on GitHub
PyTorch implementation of Count-Based Exploration with Neural Density Models
☆10Mar 22, 2018Updated 8 years ago
tonyling / skb-solver
View on GitHub
sokoban solver
☆10Feb 6, 2014Updated 12 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
billtubbs / gym-CartPole-bt-v0
View on GitHub
A modified version of the cart-pole OpenAI Gym environment for testing different control policies
☆13May 4, 2026Updated 2 months ago
0xWelt / BibTeX-Formatter
View on GitHub
Format your bibtex (.bib) file to help standardize citations for conference and journal submissions
☆14Nov 23, 2025Updated 8 months ago
dojeon-ai / DraftRec
View on GitHub
Code for the paper "DraftRec: Personalized Draft Recommendation for Winning in Multi-Player Online Battle Arena Games" (WWW 2022)
☆18Aug 11, 2023Updated 2 years ago
blei-lab / context-selection-embedding
View on GitHub
Context Selection for Embedding Models
☆26Nov 2, 2017Updated 8 years ago
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆124Jul 31, 2024Updated last year
sail-sg / PatchAIL
View on GitHub
Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>
☆14Feb 15, 2023Updated 3 years ago
sparisi / pvr_habitat
View on GitHub
Pre-Trained Visual Representations for Control
☆21May 26, 2022Updated 4 years ago