Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
Alternatives and similar repositories for a0c
Users that are interested in a0c are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.☆12May 1, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A PyTorch implementation of DeepMind's MCTSnet☆18Dec 8, 2022Updated 3 years ago
- Giving Up Control: Neurons as Reinforcement Learning Agents☆13May 6, 2024Updated last year
- ☆55Feb 28, 2024Updated 2 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago
- Hand Mesh Recovery models on OakInk-Image dataset☆13Apr 4, 2024Updated 2 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- ☆27May 17, 2019Updated 6 years ago
- ☆25Dec 10, 2021Updated 4 years ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Apr 5, 2021Updated 5 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- This is the implementation of the paper "Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians"(CVPR 2025).☆18May 15, 2025Updated 10 months ago
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆11Nov 12, 2020Updated 5 years ago
- ☆17Jun 20, 2023Updated 2 years ago
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- Code for ICRA2021 "Policy Transfer via Kinematic Domain Randomization and Adaptation"☆12Apr 28, 2021Updated 4 years ago
- Semantic-Aware Fine-Grained Correspondence, at ECCV 2022 (Oral)☆14Oct 29, 2022Updated 3 years ago
- My Homepage☆10Mar 30, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆17Nov 15, 2020Updated 5 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago