Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.
☆12Jun 20, 2017Updated 8 years ago
Alternatives and similar repositories for gumbel_dpg
Users that are interested in gumbel_dpg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Experiments with binary LSTM using gumbel-sigmoid☆32May 28, 2020Updated 6 years ago
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages☆11Feb 9, 2025Updated last year
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆15Dec 31, 2020Updated 5 years ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 5 years ago
- ☆13Jul 2, 2020Updated 5 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 9 years ago
- [AAAI-23] Improving Pareto Front Learning via Multi-Sample Hypernetworks☆10Aug 22, 2024Updated last year
- Code for ICML2023 Paper: Continuation Path Learning for Homotopy Optimization☆13Dec 31, 2025Updated 5 months ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Apr 12, 2023Updated 3 years ago
- ☆15Sep 15, 2022Updated 3 years ago
- ☆18Dec 6, 2021Updated 4 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 7 years ago
- Source code for ScaleGrad☆19Dec 28, 2021Updated 4 years ago
- [not maintained anymore] [for study purpose] A simple PyTorch implementation for "Global Vectors for Word Representation".☆17Nov 7, 2019Updated 6 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Distributed Feedback-Looped Networks☆10Jan 15, 2020Updated 6 years ago
- Pytorch implementation of XGNN☆10Jan 20, 2021Updated 5 years ago
- Pytorch implementation of Adaptative Dropout a.ka Standout.☆12Feb 22, 2018Updated 8 years ago
- Open AI Gym version of Berkeley AI Pacman with images as states☆13May 4, 2018Updated 8 years ago
- Code for "A Framework for Controllable Pareto Front Learning with Completed Scalarization Functions and its Applications"☆16Aug 11, 2024Updated last year
- facebook link prediction kaggle challenge.☆15Aug 10, 2014Updated 11 years ago
- Recommended system algorithm implementation☆10Feb 18, 2020Updated 6 years ago
- Lectures on NLP☆13Aug 18, 2023Updated 2 years ago
- Official repository of the paper "Understanding the decisions of CNNs: an in-model approach"☆10Sep 7, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Just a demonstration of some sampling techniques (rejection sampling, importance sampling, sampling importance resampling, Metropolis sam…☆11Aug 24, 2013Updated 12 years ago
- ☆10Jun 14, 2025Updated last year
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Feb 14, 2020Updated 6 years ago
- First-Order Probabilistic Programming Language☆29Jun 3, 2019Updated 7 years ago
- Official repository for the AAAI-21 paper 'Explainable Models with Consistent Interpretations'☆18Apr 5, 2022Updated 4 years ago
- Supporting models and data to doi 10.1021/acs.jcim.1c01163☆15Oct 11, 2022Updated 3 years ago
- Implementation of the paper titled: "FACE: Feasible and actionable counterfactual recourse" by Rafael et. at. - https://arxiv.org/pdf/190…☆14Dec 12, 2020Updated 5 years ago