Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
Alternatives and similar repositories for a0c
Users that are interested in a0c are comparing it to the libraries listed below
Sorting:
- Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.☆11May 1, 2020Updated 5 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 4 years ago
- ☆17Jun 20, 2023Updated 2 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Dec 8, 2022Updated 3 years ago
- ☆54Feb 28, 2024Updated 2 years ago
- ☆25Dec 10, 2021Updated 4 years ago
- DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control☆23Sep 14, 2020Updated 5 years ago
- ☆27May 17, 2019Updated 6 years ago
- ☆74Mar 15, 2021Updated 4 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆38Mar 1, 2021Updated 5 years ago
- Reinforcement learning environment for UR5e robot with OPENAI gym like format. Include both simulation and real parts.☆14Nov 2, 2021Updated 4 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- A generic tensorflow library for robotics: a bridge between robotics problem and modern machine learning architecture. Provides forward k…☆13Apr 12, 2024Updated last year
- A mouse brain histology tool for neuroscientists.☆13Feb 16, 2026Updated 2 weeks ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Source codes of Learning Causal Representations for Robust Domain Adaptation (IEEE TKDE)☆12Feb 14, 2022Updated 4 years ago
- Free file storage options for Heroku hosted applications☆12Jan 27, 2025Updated last year
- Contextual Bandit Spectral Representation Learner☆12Oct 25, 2022Updated 3 years ago
- Heatmap-based Out-of-Distribution Detection (WACV 2023)☆13Mar 27, 2024Updated last year
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- ROS integration for Franka Emika research robots in Cognitive Robotics TU Delft☆10Jan 18, 2023Updated 3 years ago
- J. Solomon, G. Peyré, V. Kim, S. Sra. Entropic Metric Alignment for Correspondence Problems. ACM Transactions on Graphics (Proc. SIGGRAPH…☆10Jan 7, 2017Updated 9 years ago
- Ranking-Consistent Language-Image Pretraining☆12Oct 24, 2025Updated 4 months ago
- ☆11Jun 7, 2023Updated 2 years ago
- PyTorch examples powered by Lightning☆12Dec 28, 2022Updated 3 years ago
- Supabase MCP Server enabling Cursor & Windsurf to use any method from Management API and query your database☆10Mar 3, 2025Updated 11 months ago
- Track and blur any object or person in a video.☆14Feb 10, 2024Updated 2 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)☆11Feb 9, 2023Updated 3 years ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- add any graph structure between objects with 2 simple classes + iteration, visitation, shortest path☆12Oct 27, 2022Updated 3 years ago