uct tree search + supervised lerning for atari games
☆12Feb 14, 2017Updated 9 years ago
Alternatives and similar repositories for uct_atari
Users that are interested in uct_atari are comparing it to the libraries listed below
Sorting:
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Python MUD/MUX/MUSH/MU* development system☆26Oct 30, 2015Updated 10 years ago
- Reinforcement learning library for PyTorch.☆11Jun 15, 2018Updated 7 years ago
- Implement BinaryNet of CNN with chainer☆11May 5, 2016Updated 9 years ago
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages☆11Feb 9, 2025Updated last year
- ☆10Sep 20, 2018Updated 7 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- Libp2p bindings for Python☆12Jan 26, 2026Updated last month
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- SVM Entity Relation classification for ace2005 chinese data☆14Jun 25, 2017Updated 8 years ago
- ☆16Jun 30, 2019Updated 6 years ago
- Final project for Artificial Intelligence(DATA130008.01)@ Fudan University☆12Jun 28, 2017Updated 8 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 9 months ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13May 5, 2021Updated 4 years ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 3 months ago
- Contains simple MPC implementation with neural network learned dynamics.☆17Feb 16, 2018Updated 8 years ago
- A ridiculously small JavaScript gomoku AI implementation, as a jQuery plugin☆17Apr 30, 2024Updated last year
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- Implementation of the Self Paced Reinforcement Learning Experiments☆19Sep 27, 2023Updated 2 years ago
- ☆17Dec 15, 2025Updated 3 months ago
- ☆14Dec 28, 2021Updated 4 years ago
- ☆16Mar 24, 2023Updated 2 years ago
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated last month
- [ECCV 2024] Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures☆32Oct 28, 2024Updated last year
- Gomoku AI based AlphaZero Algorithm☆10Feb 27, 2019Updated 7 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Stochastic Variance Reduction Policy Gradient Estimation☆11Nov 6, 2018Updated 7 years ago
- A Fast and Open Source Autonomous Perception System.☆26Nov 23, 2022Updated 3 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- Official implementation for our paper "Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control"☆20Sep 2, 2022Updated 3 years ago
- Explore and find reinforcement learning environments in a list of 150+ open source environments.☆90Jan 13, 2023Updated 3 years ago
- Dataset for Image-Goal Navigation in Habitat☆11Feb 24, 2022Updated 4 years ago
- Deep learning algorithms: A sparse autoencoder (and someday more algorithms), implemented in Common Lisp.☆27Jun 10, 2010Updated 15 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- Interpretability dashboard for reinforcement learners☆16Jun 4, 2019Updated 6 years ago
- ☆12Mar 12, 2024Updated 2 years ago