Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.
☆12Jun 20, 2017Updated 8 years ago
Alternatives and similar repositories for gumbel_dpg
Users that are interested in gumbel_dpg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- ☆13Jul 2, 2025Updated 9 months ago
- Experiments with binary LSTM using gumbel-sigmoid☆32May 28, 2020Updated 5 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- UCLA LaTeX Thesis Template☆17Jun 13, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- G-HER algorithm☆18May 24, 2019Updated 6 years ago
- Python neighbor-joining library. Goal: Efficient O(n^2) neighbor-joining algorithm.☆12May 5, 2014Updated 11 years ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equa…☆16Nov 12, 2020Updated 5 years ago
- Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL☆18Nov 21, 2023Updated 2 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 4 years ago
- ☆13Jul 2, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [AAAI-23] Improving Pareto Front Learning via Multi-Sample Hypernetworks☆10Aug 22, 2024Updated last year
- Code for ICML2023 Paper: Continuation Path Learning for Homotopy Optimization☆13Dec 31, 2025Updated 3 months ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- ☆18Dec 6, 2021Updated 4 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Source code for ScaleGrad☆19Dec 28, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Replication of the paper "Adaptive dropout for training deep neural networks" using Lasagne.☆12Sep 27, 2016Updated 9 years ago
- Pytorch implementation of XGNN☆10Jan 20, 2021Updated 5 years ago
- [NeurIPS 2021 | AIJ 2024] Multi-Objective Meta Learning☆17Jul 31, 2024Updated last year
- Pytorch implementation of Adaptative Dropout a.ka Standout.☆12Feb 22, 2018Updated 8 years ago
- Open AI Gym version of Berkeley AI Pacman with images as states☆13May 4, 2018Updated 7 years ago
- facebook link prediction kaggle challenge.☆15Aug 10, 2014Updated 11 years ago
- Lectures on NLP☆13Aug 18, 2023Updated 2 years ago
- Official repository of the paper "Understanding the decisions of CNNs: an in-model approach"☆10Sep 7, 2021Updated 4 years ago
- Just a demonstration of some sampling techniques (rejection sampling, importance sampling, sampling importance resampling, Metropolis sam…☆11Aug 24, 2013Updated 12 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Jun 14, 2025Updated 10 months ago
- Implementation of the paper titled: "FACE: Feasible and actionable counterfactual recourse" by Rafael et. at. - https://arxiv.org/pdf/190…☆14Dec 12, 2020Updated 5 years ago
- pytorch implementation of VAE-Gumble-Softmax☆63Jul 6, 2020Updated 5 years ago
- Implement Categorical Variational autoencoder using Pytorch☆15Apr 25, 2018Updated 7 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- ☆25Apr 21, 2021Updated 4 years ago
- Implementation of meta-tail2vec published in CIKM 2020 paper "Towards Locality-Aware Meta-Learning of Tail Node Embeddings on Networks".☆13Dec 10, 2020Updated 5 years ago