Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"
☆14Sep 12, 2022Updated 3 years ago
Alternatives and similar repositories for sau-explore
Users that are interested in sau-explore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Integrate AutoRL into DQN to implement a single traffic signal control system.☆16Nov 16, 2023Updated 2 years ago
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"☆22Jul 19, 2022Updated 3 years ago
- implementation of dualformer☆25Mar 1, 2025Updated last year
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆29Dec 28, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Software library RLCM (recursively low-rank compressed matrices)☆14Apr 15, 2021Updated 5 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- ☆14Apr 14, 2025Updated last year
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 4 months ago
- ☆18Oct 12, 2022Updated 3 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Create a basic Photoshop script☆12Mar 1, 2021Updated 5 years ago
- A State-Space Model with Rational Transfer Function Representation.☆83May 17, 2024Updated last year
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- ☆12Aug 6, 2024Updated last year
- Punch Out Model Synthesis - a program for constraint based tiling generation☆19Feb 1, 2026Updated 3 months ago
- Yet Another SDP Solver☆10Dec 19, 2015Updated 10 years ago
- ☆10May 3, 2022Updated 3 years ago
- AscTec quadrotor drivers☆17Aug 22, 2019Updated 6 years ago
- Re-Examining Linear Embeddings for High-dimensional Bayesian Optimization☆43Sep 7, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Jul 6, 2023Updated 2 years ago
- ☆14Jul 15, 2016Updated 9 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 2 months ago
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- Gomoku AI based AlphaZero Algorithm☆10Feb 27, 2019Updated 7 years ago
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 3 months ago
- Benchmark functions for Bayesian optimization☆37Mar 12, 2024Updated 2 years ago
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Debate interface, experiments, etc.☆10Mar 12, 2024Updated 2 years ago
- GPU-accelerated first-order low-rank SDP solver☆13Mar 17, 2025Updated last year
- This is a joint implementation of AdaShift optimizer, LGANs, and MaxGP.☆14Oct 7, 2020Updated 5 years ago
- Solving POMDP using Recurrent networks☆93Jun 9, 2020Updated 5 years ago
- ☆25Dec 15, 2025Updated 4 months ago
- Code for "Message Scheduling for Performant, Many-Core Belief Propagation"☆12Oct 25, 2019Updated 6 years ago
- ☆21Jul 25, 2024Updated last year