Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.
☆12Jun 20, 2017Updated 8 years ago
Alternatives and similar repositories for gumbel_dpg
Users that are interested in gumbel_dpg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Experiments with binary LSTM using gumbel-sigmoid☆32May 28, 2020Updated 5 years ago
- UCLA LaTeX Thesis Template☆17Jun 13, 2017Updated 8 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Comprehensive Information Integration Modeling Framework for Video Titling☆11Aug 27, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Dec 9, 2018Updated 7 years ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equa…☆16Nov 12, 2020Updated 5 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 5 years ago
- ☆13Jul 2, 2020Updated 5 years ago
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- [AAAI-23] Improving Pareto Front Learning via Multi-Sample Hypernetworks☆10Aug 22, 2024Updated last year
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Libp2p bindings for Python☆12Mar 21, 2026Updated last month
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- ☆15Sep 15, 2022Updated 3 years ago
- ☆18Dec 6, 2021Updated 4 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Distributed Feedback-Looped Networks☆10Jan 15, 2020Updated 6 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- Pytorch implementation of XGNN☆10Jan 20, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2021 | AIJ 2024] Multi-Objective Meta Learning☆17Jul 31, 2024Updated last year
- Pytorch implementation of Adaptative Dropout a.ka Standout.☆12Feb 22, 2018Updated 8 years ago
- Open AI Gym version of Berkeley AI Pacman with images as states☆13May 4, 2018Updated 8 years ago
- A badge for join telegram chat room or channel.☆15Jan 9, 2016Updated 10 years ago
- Official repository of the paper "Understanding the decisions of CNNs: an in-model approach"☆10Sep 7, 2021Updated 4 years ago
- ☆10Jun 14, 2025Updated 10 months ago
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Feb 14, 2020Updated 6 years ago
- Official python implementation of R3-Transformer☆15Nov 30, 2020Updated 5 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Supporting models and data to doi 10.1021/acs.jcim.1c01163☆15Oct 11, 2022Updated 3 years ago
- Implementation of the paper titled: "FACE: Feasible and actionable counterfactual recourse" by Rafael et. at. - https://arxiv.org/pdf/190…☆14Dec 12, 2020Updated 5 years ago
- pytorch implementation of VAE-Gumble-Softmax☆63Jul 6, 2020Updated 5 years ago
- ☆52Jul 3, 2021Updated 4 years ago
- Implement Categorical Variational autoencoder using Pytorch☆15Apr 25, 2018Updated 8 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- ☆25Apr 21, 2021Updated 5 years ago