IBM/sau-explore

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IBM/sau-explore)

IBM / sau-explore

Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"

☆14

Alternatives and similar repositories for sau-explore

Users that are interested in sau-explore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JoungheeKim / bootsrapped-dqn
View on GitHub
This is pytorch implmentation project of Bootsrapped DQN
☆13Dec 6, 2020Updated 5 years ago
Desny / traffic_light_rl
View on GitHub
Integrate AutoRL into DQN to implement a single traffic signal control system.
☆16Nov 16, 2023Updated 2 years ago
IandRover / meta_gradient_RL
View on GitHub
Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"
☆21Jul 19, 2022Updated 4 years ago
remosasso / PSDRL
View on GitHub
Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023
☆28Mar 7, 2024Updated 2 years ago
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
kazizzad / BDQN-MxNet-Gluon
View on GitHub
Efficient Exploration through Bayesian Deep Q-Networks
☆38Feb 14, 2018Updated 8 years ago
olivierjeunen / pessimism-recsys-2021
View on GitHub
Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.
☆11Dec 15, 2022Updated 3 years ago
pointW / equi_q_corl21
View on GitHub
This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spaces
☆11Nov 4, 2021Updated 4 years ago
MaxenceGiraud / BayesianPCA
View on GitHub
Implementation of Bayesian PCA [Bishop][1999] And Bayesian Kernel PCA
☆13Jan 13, 2021Updated 5 years ago
andrewk1 / pytorch-deep-bayesian-bandits
View on GitHub
PyTorch port and extension of the Deep Bayesian Bandits Library
☆43Sep 4, 2019Updated 6 years ago
zimougao / chargingbot
View on GitHub
Charging robot
☆10May 4, 2020Updated 6 years ago
arasgungore / spacenav-driver
View on GitHub
A C++ ROS package for real-time conversion of 3D motion controller events to ROS messages.
☆12Feb 13, 2024Updated 2 years ago
facebookresearch / dualformer
View on GitHub
implementation of dualformer
☆25Mar 1, 2025Updated last year
sauxpa / neural_exploration
View on GitHub
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆103Dec 14, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
IBM / topography-searcher
View on GitHub
This Python package is designed for mapping the solution space of machine learning models. An understanding of the organisation of the so…
☆23Sep 18, 2025Updated 10 months ago
sirmisscriesalot / Differentiable-Spatial-Planning-using-Transformers
View on GitHub
In-Progress implementation of the 2021 ICML Paper " Differentiable Spatial Planning using Transformers "
☆17Apr 21, 2022Updated 4 years ago
dahyun-kang / cub-200-2011-part-visualizer
View on GitHub
Visualization tool for CUB-200-2011 part keypoints (Wah et al.).
☆10Sep 17, 2021Updated 4 years ago
ksehic / LassoBench
View on GitHub
Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression
☆15Feb 15, 2026Updated 5 months ago
rees-c / PyREMBO
View on GitHub
Python implementation of REMBO built on GPyTorch.
☆18Jul 11, 2020Updated 6 years ago
uclaml / NeuralUCB
View on GitHub
☆52Jul 4, 2020Updated 6 years ago
trunghieu-tran / Transfer-Learning-in-Reinforcement-Learning
View on GitHub
Transfer Learning in Reinforcement Learning using Stable-Baseline3 | Transfer Reinforcement Learning for Differing Action Spaces via Q-Ne…
☆21Feb 27, 2022Updated 4 years ago
H-9786 / CVIS-DRL
View on GitHub
Cooperative Control of Traffic Signals and Connected Vehicles: A Multi-agent Deep Reinforcement Learning Approach
☆24Jul 15, 2021Updated 5 years ago
zhangir-azerbayev / MetaMath
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
IBM / SIC
View on GitHub
Source code for paper Mroueh, Sercu, Rigotti, Padhi, dos Santos, "Sobolev Independence Criterion", NeurIPS 2019
☆14Jun 17, 2024Updated 2 years ago
ColinKohler / helping_hands_rl_envs
View on GitHub
☆20Jan 30, 2023Updated 3 years ago
dtak / hip-mdp-public
View on GitHub
Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning
☆29Dec 28, 2017Updated 8 years ago
cmmp / pyproclus
View on GitHub
A python implementation of PROCLUS: PROjected CLUStering algorithm.
☆10Jan 12, 2015Updated 11 years ago
MohamedFawzy / recommendation-engine
View on GitHub
Recommendation engine and it's algorithms in python , R .
☆12Oct 26, 2018Updated 7 years ago
davidstutz / aml-improved-shape-completion
View on GitHub
ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.
☆11Nov 30, 2018Updated 7 years ago
HaojHuang / Equivariant-Transporter-Net
View on GitHub
Code for Equivariant Transporter Network
☆23Apr 17, 2023Updated 3 years ago
jiechenjiechen / RLCM
View on GitHub
Software library RLCM (recursively low-rank compressed matrices)
☆14Apr 15, 2021Updated 5 years ago
camillol / MacTorcs
View on GitHub
Mac port of Torcs, The Open Racing Car Simulator
☆11Jun 16, 2010Updated 16 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
banditml / banditml
View on GitHub
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆71Jun 4, 2021Updated 5 years ago
Oldpan / Deep-Painterly-Harmonization-Pytorch
View on GitHub
Deep-Painterly-Harmonization-Pytorch
☆28Jul 11, 2018Updated 8 years ago
sunyumark / 2019-TCI-OnlinePnP
View on GitHub
The code for "An online plug-and-play algorithm for regularized image reconstruction", IEEE TCI, 2019.
☆10Jan 22, 2020Updated 6 years ago
PacktPublishing / Hands-On-TensorBoard-for-PyTorch-Developers
View on GitHub
Hands-On TensorBoard for PyTorch Developers, Published by Packt
☆11Dec 15, 2025Updated 7 months ago
amazon-science / unified-ept
View on GitHub
A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021
☆31Oct 11, 2021Updated 4 years ago
JonasGeiping / dataaugs
View on GitHub
☆18Oct 12, 2022Updated 3 years ago
Columbia-NLP-Lab / LionAlignment
View on GitHub
☆12Aug 6, 2024Updated last year