Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning Algorithms": https://arxiv.org/abs/1909.01779 To appear at the next NeurIPS2019 DRL-Workshop
☆11Jul 14, 2021Updated 4 years ago
Alternatives and similar repositories for Deep-Quality-Value-Family
Users that are interested in Deep-Quality-Value-Family are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 7 years ago
- Code for the paper Recurrent Machines for Likelihood-Free Inference☆15Feb 1, 2019Updated 7 years ago
- Meeting repo for likelihood free inference meeting☆14Oct 6, 2022Updated 3 years ago
- Excursion Set Estimation☆22Sep 23, 2021Updated 4 years ago
- Code repository for the paper "Constraining Effective Field Theories with Machine Learning"☆22Sep 11, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Template for talks in remark+KaTeX.☆26Aug 27, 2020Updated 5 years ago
- Materials for DATS0001 Foundations of Data Science, ULiège☆48Jan 8, 2026Updated 5 months ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 7 years ago
- ☆11Apr 20, 2021Updated 5 years ago
- A Python toolkit for (simulation-based) inference and the mechanization of science.☆54Apr 15, 2022Updated 4 years ago
- Reproducible research and reusable acyclic workflows in Python. Execute code on HPC systems as if you executed them on your personal comp…☆18Jan 11, 2022Updated 4 years ago
- Bot for Minecraft environment☆13Jun 18, 2019Updated 7 years ago
- A tool for correcting for the look-elsewhere effect in 2 dimensions☆11Mar 3, 2016Updated 10 years ago
- Manuscript and code for the paper "Gradient Energy Matching for Distributed Asynchronous Gradient Descent".☆19May 25, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Scripts for estimating and visualizing epidemiological modeling of an epidemic (developed for COVID-19)☆13May 28, 2021Updated 5 years ago
- ☆14Aug 8, 2023Updated 2 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 8 years ago
- Codepack accompanying "Internal models for interpreting neural population activity during sensorimotor control," by Matthew D. Golub, Byr…☆16Jul 3, 2018Updated 8 years ago
- Pylearn2 in practice☆41Dec 25, 2014Updated 11 years ago
- Repository for the paper "Adversarial Variational Optimization of Non-Differentiable Simulators"☆16Dec 17, 2018Updated 7 years ago
- Research on Inverse Reinforcement Learning for self driving vehicles at UCLA☆13Nov 7, 2018Updated 7 years ago
- Creates graphs to show a publication's impact, and the impact of cited publications, and papers who've cited a publication of interest.☆16Aug 16, 2016Updated 9 years ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆28Mar 7, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Nonequispaced FFTs on GPUs (based on NFFT: http://www.nfft.org)☆11Apr 30, 2018Updated 8 years ago
- Notes and slides for a course on social and scientific aspects of machine learning☆10Jun 14, 2026Updated 2 weeks ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Jun 6, 2018Updated 8 years ago
- Code for the AsiaCCS 2017 paper "Discovering Logical Vulnerabilities in the Wi-Fi Handshake using Model-Based Testing".☆13Oct 12, 2018Updated 7 years ago
- Normalizing flow models allowing for a conditioning context, implemented using Jax, Flax, and Distrax.☆20Mar 10, 2024Updated 2 years ago
- Repository for 'Interpretable embeddings from molecular simulations using gaussian mixture variational autoencoders'☆20Jan 6, 2020Updated 6 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- End-to-end analysis pipeline of the hierarchical time delay cosmographic analysis presented in TDCOSMO IV☆12Jul 8, 2020Updated 5 years ago
- pacport documents☆12Jul 21, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The XENON1T raw data processor [deprecated]☆16Jan 3, 2021Updated 5 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆45Apr 28, 2021Updated 5 years ago
- Implementation of Unconstrained Monotonic Neural Network and the related experiments. These architectures are particularly useful for mod…☆129Dec 7, 2025Updated 6 months ago
- differentiable (binned) likelihoods with JAX☆28Jun 27, 2026Updated last week
- ☆46Oct 15, 2013Updated 12 years ago
- VAE + Quantile Networks for MNIST☆12Nov 29, 2018Updated 7 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago