[ICLR'20] Learning to Learn by Zeroth-Order Oracle
☆14Feb 7, 2020Updated 6 years ago
Alternatives and similar repositories for ZO-L2L
Users that are interested in ZO-L2L are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆40Sep 2, 2024Updated last year
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- DNE4py is a python library that aims to run and visualize many different evolutionary algorithms with high performance using mpi4py. It a…☆10Oct 13, 2020Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This course introduced me to three cutting-edge technologies for privacy-preserving AI: Federated Learning, Differential Privacy, and Enc…☆11Sep 2, 2019Updated 6 years ago
- Inverse Reinforcement learning proof-of-concept using the Guided Cost/Reward Learning approach☆10Mar 23, 2020Updated 6 years ago
- Code for paper 'ZO-AdaMM: Zeroth-Order Adaptive MomentumMethod for Black-Box Optimization'☆31Jul 7, 2020Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- ☆11Dec 9, 2020Updated 5 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A reinforcement learning algorithm controller for a satellite using the orekit library☆20Feb 20, 2022Updated 4 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 5 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- Implementation of the paper "Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models"☆24Sep 7, 2018Updated 7 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Jul 26, 2019Updated 6 years ago
- Code for☆15Oct 16, 2020Updated 5 years ago
- Reproducing Policy Distillation (DeepMind paper ICLR 2016)☆22Feb 17, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Codes for Understanding Architectures Learnt by Cell-based Neural Architecture Search☆28Feb 6, 2020Updated 6 years ago
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- Computes trajectories for evolutionary dynamics.☆15Oct 6, 2020Updated 5 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆14Mar 22, 2023Updated 3 years ago
- Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.☆16May 1, 2024Updated 2 years ago
- Official implementation of MURAL (ICML 2021)☆17Sep 23, 2021Updated 4 years ago
- Implementation of SVRG for training neural networks☆24Nov 24, 2019Updated 6 years ago
- PyTorch Implementation of "NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction"☆14Jun 29, 2019Updated 6 years ago
- Minimizing Control for Credit Assignment with Strong Feedback☆14Nov 3, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Repository for DTU Special Course, focusing on Variational Inference using Normalizing Flows (VINF). Supervised by Michael Riis Andersen☆27Jun 11, 2020Updated 5 years ago
- just a few trouble shooting tips I have found for training variational autoencoders. All code in tensorflow☆23Sep 18, 2016Updated 9 years ago
- Repo for the paper: Learning with Muscles: Benefits for Data-Efficiency and Robustness in Anthropomorphic Tasks. https://al.is.mpg.de/pub…☆15Dec 1, 2022Updated 3 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆10Jun 2, 2022Updated 4 years ago
- BBO optimiser☆11Feb 11, 2020Updated 6 years ago
- Lightweight simulator of a roomba-like robot☆13Nov 30, 2022Updated 3 years ago