A PyTorch implementation of DeepMind's MCTSnet
☆18Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for MCTSnet
Users that are interested in MCTSnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AlphaGo Zero Reinforcement Learning Sokoban Solver☆11Jun 20, 2018Updated 7 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).☆28Aug 19, 2021Updated 4 years ago
- (CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning☆14Dec 27, 2022Updated 3 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Oct 3, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- MONTE Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space…☆13Mar 21, 2021Updated 5 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆60Sep 25, 2024Updated last year
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Oct 21, 2022Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated 3 weeks ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Graph convolutional memory for reinforcement learning☆24Jul 10, 2021Updated 4 years ago
- Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'☆13May 24, 2018Updated 8 years ago
- Sokoban solver☆17Apr 22, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The simple C/C++ library for hexapod (Robot spider with 6 legs) on Arduino.☆13Dec 27, 2018Updated 7 years ago
- Monte Carlo Tree Search (MCTS) ,realize using python☆12Mar 10, 2016Updated 10 years ago
- Documentation and ressources of Kraby, an open-source hexapod robot☆14Aug 24, 2020Updated 5 years ago
- Behavior planner fusing runtime verification on traffic rules with single- and multi-agent Monte Carlo Tree Search☆11Jun 15, 2021Updated 4 years ago
- TD-VAE in PyTorch☆10May 28, 2019Updated 6 years ago
- ☆13May 10, 2021Updated 5 years ago
- A Unity WebGL project for a TicTacToe game, using Monte Carlo Tree Search (MCTS) for its AI decision making.☆13Mar 18, 2023Updated 3 years ago
- ☆22Dec 1, 2021Updated 4 years ago
- Astar and RRT implementation using matplotlib☆10May 24, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Sep 22, 2023Updated 2 years ago
- Supporting code for the paper "Predicting aptamer sequences that interact with target proteins using an Aptamer-Protein Interaction class…☆15Dec 31, 2021Updated 4 years ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- Dur360BEV: (ICRA 2025) A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving☆23Feb 2, 2026Updated 3 months ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- Production build of the new website☆13May 19, 2024Updated 2 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 5 years ago
- A lightweight toolbox for debugging and benchmarking Python code☆14Oct 4, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- A quadruped running machine in webots with three different gaits: trotting, pacing, and bounding.☆14May 22, 2022Updated 4 years ago
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆77Jan 6, 2020Updated 6 years ago
- PyTorch implementation of SimPLe (Simulated Policy Learning) on the Atari 100k benchmark.☆17Dec 7, 2022Updated 3 years ago
- homework for shenlan's "Motion Planning For Mobile Robots "☆15May 14, 2020Updated 6 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago