Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆13Nov 3, 2021Updated 4 years ago
Alternatives and similar repositories for neurips2021-meta-gradient-offpolicy-evaluation
Users that are interested in neurips2021-meta-gradient-offpolicy-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- A collection of deep reinforcement learning algorithm implementations☆11Jan 9, 2020Updated 6 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆52Jul 16, 2024Updated last year
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆32Jun 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Solving the inverse kinematics problem of a 3 Link Planar Manipulator using neural networks.☆10Jul 19, 2020Updated 5 years ago
- ☆13Jun 3, 2022Updated 3 years ago
- Multi-agent reinforcement learning methods to solve the multi-ship dynamic weapon-target assignment problem☆14Jul 25, 2024Updated last year
- Variational Gaussian Process Motion Planning☆20Jul 30, 2024Updated last year
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Comparison between a PID, ANN & Fuzzy-PID controller while controlling a Robotic Arm in Simulink☆11Jan 31, 2022Updated 4 years ago
- [EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…☆13Dec 15, 2024Updated last year
- ☆10Apr 26, 2023Updated 2 years ago
- Tutorial on Multi-Objective Recommender Systems @ KDD 2021☆19Dec 4, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Butterfly Optimization Algorithm for Weapon Target Assignment Problem☆12Jul 24, 2021Updated 4 years ago
- ☆10Apr 23, 2021Updated 4 years ago
- Source code for the Joint Shapley values: a measure of joint feature importance☆12Sep 14, 2021Updated 4 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆69Jun 5, 2020Updated 5 years ago
- Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…☆31Nov 23, 2021Updated 4 years ago
- A python API for plane detection in point clouds☆12Apr 22, 2021Updated 4 years ago
- WIP implementation of Probabilistic Differential Dynamic Programming in PyTorch☆16Jul 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Point cloud capture of Gocator3100 device☆13Dec 20, 2016Updated 9 years ago
- ☆15Nov 4, 2021Updated 4 years ago
- BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution☆59Oct 13, 2025Updated 6 months ago
- Graph Neural Networks utilization for Spatiotemporal graphs. These methods will be applied into the problem of forecasting traffic flow o…☆26Mar 22, 2021Updated 5 years ago
- ☆10Jul 14, 2018Updated 7 years ago
- Official PyTorch implementation of "EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization"☆23Oct 24, 2021Updated 4 years ago
- rbf network based control for robot manipulator☆15Feb 21, 2022Updated 4 years ago
- 本项目对于高动态飞行器过载控制系统,提出了基于角加速度反馈的自动驾驶仪设计方案,并设计了自抗扰控制系统。☆20Mar 25, 2024Updated 2 years ago
- MetaPlanner is an open source automated treatment planning method that performs meta-optimization of treatment planning hyperparameters. …☆14Nov 7, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 用react-native开发的一个简单跑马灯抽奖demo,使用了react-navigation,可以自定义奖品名称,抽奖定时等☆19Dec 18, 2017Updated 8 years ago
- Trajectory planning for ABB IRB140 industrial manipulator using MATLAB and CoppeliaSim(VREP). The manipulator' task is to spill a bottle …☆14Feb 8, 2021Updated 5 years ago
- Approximate Q-Learning algorithm designed for the WTA (weapon target assignment) problem. Created for an AI Competition.☆15Mar 7, 2023Updated 3 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- Play with agents and more.☆21Sep 18, 2023Updated 2 years ago
- Ruckig and nonlinear filters to create dynamically feasible reference trajectories☆23Jan 15, 2026Updated 3 months ago
- Combined Learning from Demonstration and Motion Planning☆14Feb 5, 2019Updated 7 years ago