Implementation of Russo and Van Roy work on Information Directed Sampling (2017)
☆21Jan 18, 2019Updated 7 years ago
Alternatives and similar repositories for Information_Directed_Sampling
Users that are interested in Information_Directed_Sampling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jun 10, 2022Updated 3 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- ☆11May 15, 2020Updated 5 years ago
- ☆29May 27, 2024Updated last year
- INTeractive learning via REPresentatIon Discovery☆36Jun 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆20Apr 3, 2018Updated 8 years ago
- A ready-to-use Jemdoc-based website for research groups and similar organizations. It also contains a dynamic news/RSS-feed system which …☆11May 5, 2022Updated 3 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Apr 28, 2019Updated 7 years ago
- Demos for the MiniWoB++ benchmark☆21Feb 23, 2018Updated 8 years ago
- A Multipath TCP python support library 🐍☆10Feb 5, 2023Updated 3 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Mechanism Design KU course☆11Mar 5, 2025Updated last year
- 一个针对中文聊天机器人的公开数据集☆11Sep 11, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- ☆14May 23, 2021Updated 4 years ago
- Official implementation for the paper "Quantum Bayesian Optimization" accepted to NeurIPS 2023.☆12Jan 7, 2024Updated 2 years ago
- Repository for IROS 2019☆28Nov 16, 2019Updated 6 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation☆13Dec 22, 2022Updated 3 years ago
- Java framework for experimenting with a 2-D version of the voxel-based soft robots.☆20Mar 31, 2023Updated 3 years ago
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- scripts for evaluation of contextual bandit algorithms☆46Apr 27, 2020Updated 6 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆90Feb 13, 2023Updated 3 years ago
- This is a repo for all links and tools in the security and privacy field which I have found useful!!☆20Aug 16, 2022Updated 3 years ago
- ☆13Jul 3, 2022Updated 3 years ago
- Some hard problems for reinforcement learning.☆32Oct 5, 2018Updated 7 years ago
- Recurrent Additive Networks for Tensorflow☆16Jun 30, 2017Updated 8 years ago
- 3d point cloud lectern点云乐课堂☆12Dec 17, 2019Updated 6 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆27May 17, 2019Updated 6 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- For TAMP experiments using Drake☆13Jun 4, 2024Updated last year
- A C++/CUDA toolkit for neural machine translation (RNN-Based NMT) across multiple GPUs☆10Oct 17, 2022Updated 3 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Apr 14, 2022Updated 4 years ago
- ☆12Nov 28, 2022Updated 3 years ago