Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.
☆12Jun 20, 2017Updated 8 years ago
Alternatives and similar repositories for gumbel_dpg
Users that are interested in gumbel_dpg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- ☆13Jul 2, 2025Updated 10 months ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Experiments with binary LSTM using gumbel-sigmoid☆32May 28, 2020Updated 6 years ago
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages☆11Feb 9, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Online Preference Alignment for Language Models via Count-based Exploration☆18Jan 14, 2025Updated last year
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- G-HER algorithm☆18May 24, 2019Updated 7 years ago
- ☆15Dec 31, 2020Updated 5 years ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- Code for ICML2023 Paper: Continuation Path Learning for Homotopy Optimization☆13Dec 31, 2025Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 3 years ago
- ☆16Apr 12, 2023Updated 3 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- An implementation of the Sequence to Sequence model using the Lasagne library (WIP)☆12Aug 11, 2016Updated 9 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- [NeurIPS 2021 | AIJ 2024] Multi-Objective Meta Learning☆17Jul 31, 2024Updated last year
- Pytorch implementation of Adaptative Dropout a.ka Standout.☆12Feb 22, 2018Updated 8 years ago
- A badge for join telegram chat room or channel.☆15Jan 9, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for "A Framework for Controllable Pareto Front Learning with Completed Scalarization Functions and its Applications"☆16Aug 11, 2024Updated last year
- Recommended system algorithm implementation☆10Feb 18, 2020Updated 6 years ago
- Lectures on NLP☆13Aug 18, 2023Updated 2 years ago
- Official repository of the paper "Understanding the decisions of CNNs: an in-model approach"☆10Sep 7, 2021Updated 4 years ago
- Official python implementation of R3-Transformer☆15Nov 30, 2020Updated 5 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- Official repository for the AAAI-21 paper 'Explainable Models with Consistent Interpretations'☆18Apr 5, 2022Updated 4 years ago
- First-Order Probabilistic Programming Language☆29Jun 3, 2019Updated 6 years ago
- Supporting models and data to doi 10.1021/acs.jcim.1c01163☆15Oct 11, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of the paper titled: "FACE: Feasible and actionable counterfactual recourse" by Rafael et. at. - https://arxiv.org/pdf/190…☆14Dec 12, 2020Updated 5 years ago
- pytorch implementation of VAE-Gumble-Softmax☆63Jul 6, 2020Updated 5 years ago
- Interpretable Explanations of Black Boxes by Meaningful Perturbation Pytorch☆12Aug 30, 2024Updated last year
- Implement Categorical Variational autoencoder using Pytorch☆15Apr 25, 2018Updated 8 years ago
- Voice to vector [Russian]☆15Feb 5, 2017Updated 9 years ago
- Implementation of meta-tail2vec published in CIKM 2020 paper "Towards Locality-Aware Meta-Learning of Tail Node Embeddings on Networks".☆13Dec 10, 2020Updated 5 years ago
- Official Python implementation of IEEE JBHI 2021 paper: "Choquet Integral and Coalition Game-based Ensemble of Deep Learning Models for C…☆16Jun 27, 2022Updated 3 years ago