proroklab / graph-conv-memory
Graph convolutional memory
☆15Updated 2 years ago
Related projects: ⓘ
- Heterogeneous Multi-Robot Reinforcement Learning☆32Updated last week
- ☆20Updated 5 months ago
- Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.☆26Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆32Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆37Updated 2 years ago
- Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations (ICLR 2020)☆25Updated 2 years ago
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)☆43Updated 2 years ago
- Learning multi-agent robotic mobile manipulation with deep reinforcement learning☆94Updated 2 years ago
- Model-based Policy Gradients☆29Updated 4 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆37Updated 3 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆35Updated 2 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆31Updated last year
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆49Updated 3 years ago
- ☆68Updated 4 years ago
- ☆32Updated 2 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- ☆30Updated 3 years ago
- ☆27Updated 3 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆49Updated last year
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆35Updated last week
- ☆33Updated last year
- Model Predictive Actor-Critic Reinforcement Learning☆50Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- Working directory for dynamics learning for experimental robots.☆55Updated 3 years ago
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆50Updated last year
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.☆50Updated 3 years ago
- Learning robotic mobile manipulation with deep reinforcement learning☆32Updated 2 years ago