yandexdataschool/gumbel_dpg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yandexdataschool/gumbel_dpg)

yandexdataschool / gumbel_dpg

Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.

☆12

Alternatives and similar repositories for gumbel_dpg

Users that are interested in gumbel_dpg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YourMapper / GTFS-realtime-JSON
View on GitHub
Online JS Google Maps API application that pulls in real time structured JSON to create a live map of all busses/trains in a city's GTFS …
☆18Oct 31, 2013Updated 12 years ago
timvieira / rl
View on GitHub
Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆12Jan 28, 2021Updated 5 years ago
eniompw / nanoGPTshakespeare
View on GitHub
finetuning shakespeare on karpathy/nanoGPT
☆23Feb 2, 2023Updated 3 years ago
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
Baichenjia / Pix2Pix-eager
View on GitHub
Tensorflow eager implementation of Pix2Pix (Image-to-image translation with conditional adversarial networks)
☆12Aug 12, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
timvieira / vocrf
View on GitHub
Variable-order CRFs with structure learning
☆17Aug 1, 2024Updated last year
yandexdataschool / gumbel_lstm
View on GitHub
Experiments with binary LSTM using gumbel-sigmoid
☆32May 28, 2020Updated 6 years ago
JoshuaDavid / Neighbor_Joining
View on GitHub
Python neighbor-joining library. Goal: Efficient O(n^2) neighbor-joining algorithm.
☆12May 5, 2014Updated 12 years ago
learning-at-home / go-libp2p-daemon
View on GitHub
a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages
☆11Feb 9, 2025Updated last year
cicl-stanford / csm
View on GitHub
Contains all materials for the paper "A counterfactual simulation model of causal judgment".
☆25Jul 15, 2021Updated 5 years ago
zgcgreat / ffm-tencent
View on GitHub
☆10May 14, 2017Updated 9 years ago
mauryquijada / gtfs-mysql
View on GitHub
This repository is a MySQL database schema for the GTFS specification.
☆22Feb 26, 2017Updated 9 years ago
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆12Oct 8, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
louis925 / uclathes
View on GitHub
UCLA LaTeX Thesis Template
☆17Jun 13, 2017Updated 9 years ago
UMBC-CMSC447 / General
View on GitHub
General things for CMSC 447.
☆21Feb 3, 2015Updated 11 years ago
Baichenjia / GHER
View on GitHub
G-HER algorithm
☆18May 24, 2019Updated 7 years ago
shengyuzhang / VideoTitling
View on GitHub
Comprehensive Information Integration Modeling Framework for Video Titling
☆11Aug 27, 2020Updated 5 years ago
nshepperd / ode-gan-pytorch
View on GitHub
☆15Dec 31, 2020Updated 5 years ago
luisibanez / ImageReconstruction
View on GitHub
Reconstruction of 3D volumetric dataset from a collection of 2D slices
☆25Dec 28, 2020Updated 5 years ago
BMS-geodev / vectra-py
View on GitHub
WIP port of the vectra js in memory vector database.
☆27Nov 8, 2024Updated last year
cyrus- / thesis
View on GitHub
My PhD thesis, titled "Reasonably Programmable Syntax"
☆15Aug 28, 2018Updated 7 years ago
Pratik08 / Vis-DSS
View on GitHub
☆12Dec 9, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sustainable-computing / EnergyBoost
View on GitHub
Code release of EnergyBoost: Learning-based Control of Home Batteries
☆27Apr 13, 2021Updated 5 years ago
Baichenjia / UTDS
View on GitHub
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
☆18Nov 21, 2023Updated 2 years ago
titu1994 / pytorch_odegan
View on GitHub
Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equa…
☆16Nov 12, 2020Updated 5 years ago
avikj / L4DC-MPC-OCD
View on GitHub
Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"
☆12Apr 23, 2021Updated 5 years ago
wellecks / mgs
View on GitHub
MLE-Guided Parameter Search (AAAI 2021)
☆12Sep 16, 2021Updated 4 years ago
callous-youth / IAPTT-GM
View on GitHub
Code Repository for NeurIPS 2021 accepted paper, named "Torwards Gradient-based Bilevel Optimization with non-convex Followers and Beyond…
☆11Mar 28, 2022Updated 4 years ago
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
THUDM / APAR
View on GitHub
APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding
☆14Jul 22, 2024Updated 2 years ago
wokas36 / DFNets
View on GitHub
Distributed Feedback-Looped Networks
☆10Jan 15, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
matejbalog / gumbel-relatives
View on GitHub
Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick
☆17Jun 14, 2017Updated 9 years ago
fiezt / ICML-2020-Implicit-Stackelberg-Learning
View on GitHub
☆13Jul 2, 2020Updated 6 years ago
agentpilled / ClipBrain
View on GitHub
Your AI never starts from zero. Capture everything you read with Cmd+Shift+S.
☆24Jul 1, 2026Updated 3 weeks ago
Xi-L / CPL
View on GitHub
Code for ICML2023 Paper: Continuation Path Learning for Homotopy Optimization
☆13Dec 31, 2025Updated 6 months ago
longhp1618 / MultiSample-Hypernetworks
View on GitHub
[AAAI-23] Improving Pareto Front Learning via Multi-Sample Hypernetworks
☆10Aug 22, 2024Updated last year
wulfebw / playing_atari
View on GitHub
learning to play atari games with reinforcement learning
☆10Jan 4, 2016Updated 10 years ago
splintersu / NetworkSimplex
View on GitHub
A C++ implementation of Network Simplex Algorithm
☆11Nov 12, 2018Updated 7 years ago