Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"
☆14Sep 12, 2022Updated 3 years ago
Alternatives and similar repositories for sau-explore
Users that are interested in sau-explore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Integrate AutoRL into DQN to implement a single traffic signal control system.☆16Nov 16, 2023Updated 2 years ago
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"☆22Jul 19, 2022Updated 3 years ago
- This Python package is designed for mapping the solution space of machine learning models. An understanding of the organisation of the so…☆22Sep 18, 2025Updated 8 months ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Dec 15, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Implementation of Bayesian PCA [Bishop][1999] And Bayesian Kernel PCA☆13Jan 13, 2021Updated 5 years ago
- PyTorch port and extension of the Deep Bayesian Bandits Library☆43Sep 4, 2019Updated 6 years ago
- Charging robot☆10May 4, 2020Updated 6 years ago
- A C++ ROS package for real-time conversion of 3D motion controller events to ROS messages.☆12Feb 13, 2024Updated 2 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆103Dec 14, 2021Updated 4 years ago
- Transfer Learning in Reinforcement Learning using Stable-Baseline3 | Transfer Reinforcement Learning for Differing Action Spaces via Q-Ne…☆22Feb 27, 2022Updated 4 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆71Jun 4, 2021Updated 4 years ago
- ☆15Apr 14, 2025Updated last year
- ☆18Oct 12, 2022Updated 3 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Yet Another SDP Solver☆10Dec 19, 2015Updated 10 years ago
- ☆10May 3, 2022Updated 4 years ago
- AscTec quadrotor drivers☆17Aug 22, 2019Updated 6 years ago
- Re-Examining Linear Embeddings for High-dimensional Bayesian Optimization☆43Sep 7, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆31Dec 10, 2022Updated 3 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 months ago
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- Gomoku AI based AlphaZero Algorithm☆10Feb 27, 2019Updated 7 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 4 months ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Benchmark functions for Bayesian optimization☆37Mar 12, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 5 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- This is a joint implementation of AdaShift optimizer, LGANs, and MaxGP.☆14Oct 7, 2020Updated 5 years ago
- Python wrapper for lean-gym☆13Apr 5, 2023Updated 3 years ago
- ☆26Dec 15, 2025Updated 5 months ago
- E2C implementation in PyTorch☆43Jul 5, 2017Updated 8 years ago
- ☆21Jul 25, 2024Updated last year