Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning Algorithms": https://arxiv.org/abs/1909.01779 To appear at the next NeurIPS2019 DRL-Workshop
☆11Jul 14, 2021Updated 4 years ago
Alternatives and similar repositories for Deep-Quality-Value-Family
Users that are interested in Deep-Quality-Value-Family are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Documentation and guidelines for the Alan GPU cluster at the University of Liège.☆21Jul 19, 2023Updated 2 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Code repository for the generalized Galton board example in the paper "Mining gold from implicit models to improve likelihood-free infere…☆34Dec 2, 2019Updated 6 years ago
- Code repository for the paper "Constraining Effective Field Theories with Machine Learning"☆22Sep 11, 2019Updated 6 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 7 years ago
- Sentiment Analysis via RNN, RNTN. Based on Stanford's Sentiment Analysis page.☆10Feb 5, 2015Updated 11 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- Emotiv SDK Community Edition☆12Oct 9, 2015Updated 10 years ago
- Reproducible research and reusable acyclic workflows in Python. Execute code on HPC systems as if you executed them on your personal comp…☆18Jan 11, 2022Updated 4 years ago
- Information and Cyber Security Certifications☆14Feb 14, 2019Updated 7 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Repository with the code of "HNPE: Leveraging Global Parameters for Neural Posterior Estimation"☆14Mar 18, 2024Updated 2 years ago
- Manuscript and code for the paper "Gradient Energy Matching for Distributed Asynchronous Gradient Descent".☆19May 25, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scripts for estimating and visualizing epidemiological modeling of an epidemic (developed for COVID-19)☆13May 28, 2021Updated 4 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- Semantic alignment of astronomical data with natural language using multi-modal models. (Jax) Code associated with https://arxiv.org/abs/…☆17Oct 18, 2024Updated last year
- Codepack accompanying "Internal models for interpreting neural population activity during sensorimotor control," by Matthew D. Golub, Byr…☆16Jul 3, 2018Updated 7 years ago
- Code for reproducing the experiment results of the paper Imitation Learning with Sinkhorn Distances.☆14Aug 2, 2020Updated 5 years ago
- Pylearn2 in practice☆41Dec 25, 2014Updated 11 years ago
- Procedural object generation for robotic manipulation☆11Oct 6, 2018Updated 7 years ago
- ☆18Dec 26, 2024Updated last year
- A tensorflow implementation of hindsight experience replay☆17Apr 19, 2018Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Creates graphs to show a publication's impact, and the impact of cited publications, and papers who've cited a publication of interest.☆16Aug 16, 2016Updated 9 years ago
- ZeroMQ For Robot Control☆15Oct 6, 2016Updated 9 years ago
- Nonequispaced FFTs on GPUs (based on NFFT: http://www.nfft.org)☆11Apr 30, 2018Updated 8 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆51Nov 13, 2018Updated 7 years ago
- Code for the paper "Towards Reliable Simulation-Based Inference with Balanced Neural Ratio Estimation".☆14Nov 14, 2022Updated 3 years ago
- Code for "Dynamic NeRFs for Soccer Scenes", by Lewin, Vandegar, Hoyoux, Barnich, and Louppe. (2023)☆25May 31, 2024Updated last year
- ☆14Jun 26, 2019Updated 6 years ago
- Tutorial on Multi-Agent Reinforcement for Train Scheduling☆11May 18, 2020Updated 5 years ago
- some NDK sample☆11Mar 11, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Normalizing flow models allowing for a conditioning context, implemented using Jax, Flax, and Distrax.☆20Mar 10, 2024Updated 2 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- Repository for 'Interpretable embeddings from molecular simulations using gaussian mixture variational autoencoders'☆20Jan 6, 2020Updated 6 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- End-to-end analysis pipeline of the hierarchical time delay cosmographic analysis presented in TDCOSMO IV☆12Jul 8, 2020Updated 5 years ago
- Cross-entropy method variants for optimization in Julia☆12Apr 29, 2021Updated 5 years ago
- Flying Cavalry Project - Ucan Kavalye Projesi☆15May 12, 2022Updated 3 years ago