基于DQN的五子棋人机对弈
☆62Mar 24, 2019Updated 7 years ago
Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-
Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Apr 15, 2018Updated 7 years ago
- 基于博弈树α-β剪枝搜索的五子棋AI☆779Jul 14, 2017Updated 8 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆52Feb 10, 2019Updated 7 years ago
- ☆18Oct 4, 2024Updated last year
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 💪📈 Powerfolio! is a stock screener and portfolio analysis. Backtest buy-and-hold vs. trading on RSI. Build a portfolio using efficient…☆10Jun 7, 2021Updated 4 years ago
- A Read-time MIDI visualization tool using PyQt☆10Nov 24, 2020Updated 5 years ago
- Automate hyper-parameters tuning for NNs (learning rate, number of dense layers and nodes and activation function)☆14Aug 9, 2020Updated 5 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated last year
- ☆12Aug 28, 2020Updated 5 years ago
- Predicting 2D Steady State Fluid Flow Fields using Convolutional Neural Networks☆11Oct 3, 2020Updated 5 years ago
- Dice Scores Recognition in images and live video using CNN.☆13Dec 19, 2020Updated 5 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆16May 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Input files and results of paper: Phase equilibrium of liquid water and hexagonal from ice enhanced sampling molecular dynamics simulatio…☆10Apr 9, 2021Updated 4 years ago
- Desktop Debugger for CS303 (Artificial Intelligence) Gomoku Project / 和自己的五子棋 AI 桌面对战☆16Sep 27, 2019Updated 6 years ago
- 基于Deep Qlearning Network的股票交易模型☆57May 15, 2017Updated 8 years ago
- Final Thesis at Fudan University, built a trading strategy on Bitcoin market using recurrent reinforcement learning☆27Nov 5, 2018Updated 7 years ago
- MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)☆11Jan 15, 2018Updated 8 years ago
- Generation of columnar jointed rock using Voronoi method☆10Dec 13, 2019Updated 6 years ago
- Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Im…☆13Feb 2, 2019Updated 7 years ago
- Statistical methods for estimating scaling laws in urban data☆11Dec 9, 2024Updated last year
- Matlab version of all the code of Lorena A. Barba's 12 steps to Navier stokes☆13Jul 17, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Geog 2021 Environmental Remote Sensing☆16Jan 4, 2019Updated 7 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Codes for understanding Reinforcement Learning( updating... )☆24Jan 2, 2019Updated 7 years ago
- Code for Visuotactile-Based Learning for Insertion with Compliant Hands☆21May 20, 2025Updated 10 months ago
- 16-811 Project.☆10Jan 12, 2018Updated 8 years ago
- Dockerfile to build an image with Nginx and Node (npm and yarn) on Alpine Linux☆19Aug 23, 2018Updated 7 years ago
- ☆14May 31, 2022Updated 3 years ago
- Reference implementation and experiments for combining reaction-diffusion and tissue growth☆13Jan 13, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago
- 计算流体力学和离散元耦合手册☆10Jun 4, 2017Updated 8 years ago
- 基于RFID的非接触式的定位与追踪☆16Jan 10, 2021Updated 5 years ago
- Hyperpatameter Bayesian Optimization for Image Classification in PyTorch☆12Aug 20, 2019Updated 6 years ago
- 【编译原理】语法分析实验☆12May 29, 2019Updated 6 years ago
- Library for solving time dependent PDEs in FEniCS using Runge-Kutta ESDIRK methods☆13Sep 25, 2019Updated 6 years ago
- a simple asr system☆14Mar 8, 2018Updated 8 years ago