[ICLR'20] Learning to Learn by Zeroth-Order Oracle
☆14Feb 7, 2020Updated 6 years ago
Alternatives and similar repositories for ZO-L2L
Users that are interested in ZO-L2L are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆40Sep 2, 2024Updated last year
- ☆14Jun 26, 2019Updated 6 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- DNE4py is a python library that aims to run and visualize many different evolutionary algorithms with high performance using mpi4py. It a…☆10Oct 13, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- This course introduced me to three cutting-edge technologies for privacy-preserving AI: Federated Learning, Differential Privacy, and Enc…☆11Sep 2, 2019Updated 6 years ago
- Inverse Reinforcement learning proof-of-concept using the Guided Cost/Reward Learning approach☆10Mar 23, 2020Updated 6 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- ☆12Oct 20, 2024Updated last year
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A reinforcement learning algorithm controller for a satellite using the orekit library☆20Feb 20, 2022Updated 4 years ago
- Experiments on AMSGrad -- pytorch version☆12May 30, 2018Updated 7 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆35Oct 28, 2020Updated 5 years ago
- Implementation of the paper "Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models"☆24Sep 7, 2018Updated 7 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆12Mar 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Package to support the analysis of high-precision astrometry timeseries, in particular the determination of Keplerian orbits.☆18May 28, 2025Updated 10 months ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Jul 26, 2019Updated 6 years ago
- Code for☆15Oct 16, 2020Updated 5 years ago
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- Computes trajectories for evolutionary dynamics.☆15Oct 6, 2020Updated 5 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆15Mar 22, 2023Updated 3 years ago
- Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.☆16May 1, 2024Updated last year
- Official implementation of MURAL (ICML 2021)☆17Sep 23, 2021Updated 4 years ago
- PyTorch Implementation of "NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction"☆14Jun 29, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repository for DTU Special Course, focusing on Variational Inference using Normalizing Flows (VINF). Supervised by Michael Riis Andersen☆26Jun 11, 2020Updated 5 years ago
- Minimizing Control for Credit Assignment with Strong Feedback☆14Nov 3, 2024Updated last year
- just a few trouble shooting tips I have found for training variational autoencoders. All code in tensorflow☆23Sep 18, 2016Updated 9 years ago
- Repo for the paper: Learning with Muscles: Benefits for Data-Efficiency and Robustness in Anthropomorphic Tasks. https://al.is.mpg.de/pub…☆15Dec 1, 2022Updated 3 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- BBO optimiser☆11Feb 11, 2020Updated 6 years ago
- Lightweight simulator of a roomba-like robot☆13Nov 30, 2022Updated 3 years ago