Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"
☆14Sep 12, 2022Updated 3 years ago
Alternatives and similar repositories for sau-explore
Users that are interested in sau-explore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆28Mar 7, 2024Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 8 months ago
- PyTorch port and extension of the Deep Bayesian Bandits Library☆43Sep 4, 2019Updated 6 years ago
- A C++ ROS package for real-time conversion of 3D motion controller events to ROS messages.☆12Feb 13, 2024Updated 2 years ago
- implementation of dualformer☆25Mar 1, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆101Dec 14, 2021Updated 4 years ago
- Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression☆15Feb 15, 2026Updated last month
- ☆20Jan 30, 2023Updated 3 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Software library RLCM (recursively low-rank compressed matrices)☆14Apr 15, 2021Updated 4 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆71Jun 4, 2021Updated 4 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 3 months ago
- Experiment for General E(2)-Equivariant Steerable CNNs☆30Feb 26, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Oct 11, 2023Updated 2 years ago
- Create a basic Photoshop script☆12Mar 1, 2021Updated 5 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- A State-Space Model with Rational Transfer Function Representation.☆83May 17, 2024Updated last year
- ☆12Aug 6, 2024Updated last year
- ☆10May 3, 2022Updated 3 years ago
- ☆14Jul 15, 2016Updated 9 years ago
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 3 months ago
- ActiveHARNet: Towards On-Device Deep Bayesian Active Learning for Human Activity Recognition☆16Nov 7, 2020Updated 5 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- hmglib - Hierarchical matrices on GPU(s) library☆13Jul 31, 2018Updated 7 years ago
- Benchmark functions for Bayesian optimization☆37Mar 12, 2024Updated 2 years ago
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 5 years ago
- GPU-accelerated first-order low-rank SDP solver☆13Mar 17, 2025Updated last year
- This is a joint implementation of AdaShift optimizer, LGANs, and MaxGP.☆14Oct 7, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch implementation of the estimator proposed in the paper "Estimating Differential Entropy under Gaussian Convolutions"☆13Oct 22, 2020Updated 5 years ago
- We define and estimate smooth unique information of samples with respect to classifier weights and predictions. We compute these quantiti…☆11Mar 9, 2021Updated 5 years ago
- Interpretability dashboard for reinforcement learners☆16Jun 4, 2019Updated 6 years ago
- Porting of fast.ai python notebooks classes for Google Colab platform☆16Jan 16, 2019Updated 7 years ago
- Connect6 (Korean: 육목) for Python.☆11May 15, 2017Updated 8 years ago
- Simulated Annealing for MAX-CUT problems on {+1,-1}-weighted complete graphs☆13Feb 2, 2019Updated 7 years ago
- This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"☆19May 30, 2025Updated 10 months ago