Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"
☆14Sep 12, 2022Updated 3 years ago
Alternatives and similar repositories for sau-explore
Users that are interested in sau-explore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Integrate AutoRL into DQN to implement a single traffic signal control system.☆16Nov 16, 2023Updated 2 years ago
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"☆21Jul 19, 2022Updated 3 years ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆28Mar 7, 2024Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Efficient Exploration through Bayesian Deep Q-Networks☆38Feb 14, 2018Updated 8 years ago
- This Python package is designed for mapping the solution space of machine learning models. An understanding of the organisation of the so…☆23Sep 18, 2025Updated 9 months ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Dec 15, 2022Updated 3 years ago
- This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spaces☆11Nov 4, 2021Updated 4 years ago
- Implementation of Bayesian PCA [Bishop][1999] And Bayesian Kernel PCA☆13Jan 13, 2021Updated 5 years ago
- PyTorch port and extension of the Deep Bayesian Bandits Library☆43Sep 4, 2019Updated 6 years ago
- Charging robot☆10May 4, 2020Updated 6 years ago
- A C++ ROS package for real-time conversion of 3D motion controller events to ROS messages.☆12Feb 13, 2024Updated 2 years ago
- Audit competition repository for Euro-Dollar (0xa4ccd3b6daa763f729ad59eae75f9cbff7baf2cd)☆28Dec 7, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- implementation of dualformer☆25Mar 1, 2025Updated last year
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆103Dec 14, 2021Updated 4 years ago
- Liquid Collective protocol smart contracts☆23Updated this week
- In-Progress implementation of the 2021 ICML Paper " Differentiable Spatial Planning using Transformers "☆17Apr 21, 2022Updated 4 years ago
- Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression☆15Feb 15, 2026Updated 4 months ago
- Python implementation of REMBO built on GPyTorch.☆18Jul 11, 2020Updated 5 years ago
- Visualization tool for CUB-200-2011 part keypoints (Wah et al.).☆10Sep 17, 2021Updated 4 years ago
- The Multiverse Parser module provides seamless conversion between different scene description formats, using USD (Universal Scene Descrip…☆34Nov 20, 2025Updated 7 months ago
- Source code for paper Mroueh, Sercu, Rigotti, Padhi, dos Santos, "Sobolev Independence Criterion", NeurIPS 2019☆14Jun 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Cooperative Control of Traffic Signals and Connected Vehicles: A Multi-agent Deep Reinforcement Learning Approach☆24Jul 15, 2021Updated 4 years ago
- ☆20Jan 30, 2023Updated 3 years ago
- Transfer Learning in Reinforcement Learning using Stable-Baseline3 | Transfer Reinforcement Learning for Differing Action Spaces via Q-Ne…☆22Feb 27, 2022Updated 4 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆29Dec 28, 2017Updated 8 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Code for Equivariant Transporter Network☆23Apr 17, 2023Updated 3 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Software library RLCM (recursively low-rank compressed matrices)☆14Apr 15, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Gitd means “Git for CodexField”, or in other words, “Git for Decentralized Storage”. Gitd is a highly extensible git implementation libra…☆68Jun 22, 2026Updated last week
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆71Jun 4, 2021Updated 5 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 16 years ago
- Deep-Painterly-Harmonization-Pytorch☆28Jul 11, 2018Updated 7 years ago
- The code for "An online plug-and-play algorithm for regularized image reconstruction", IEEE TCI, 2019.☆10Jan 22, 2020Updated 6 years ago
- ☆15Apr 14, 2025Updated last year
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 6 months ago