train AI agents to master Free-style Gomoku(五子棋)
☆24Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for gomoku_rl
Users that are interested in gomoku_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An automatic differentiation system for dense and sparse problems☆13Jan 16, 2025Updated last year
- 无人机编队重构☆12Jul 28, 2018Updated 7 years ago
- RenderToy is an experimental path tracing rendering library for academic purposes.☆12Apr 15, 2023Updated 3 years ago
- A collection of awesome projects using MuJoCo.☆16May 27, 2025Updated 10 months ago
- Sourcecode & CAD drawings of NimbRo-OP☆27Oct 30, 2012Updated 13 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 实践番茄工作法:工作时屏蔽浪费时间的网站,休息时允许访问。A Chrome/Edge extension that helps you stay focused by blocking sites during work timers and letting you bro…☆13Jul 26, 2022Updated 3 years ago
- Retarget from Human Mesh Descriptions (SMPL, SMPL-X, etc) to Humanoid Poses☆21Apr 11, 2025Updated last year
- ☆11Nov 18, 2023Updated 2 years ago
- Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments☆16Nov 15, 2021Updated 4 years ago
- The code for the paper, 'Meta-Curvature, Eunbyung Park and Junier Oliver, NeurIPS 2019'☆11Jan 20, 2020Updated 6 years ago
- A lightweight driving simulator, written in Julia.☆19Sep 25, 2024Updated last year
- ☆25Nov 25, 2025Updated 4 months ago
- Code for Scalable Offline Model-Based RL with Action chunking☆21Feb 20, 2026Updated last month
- [CoRL 2025] Robot Learning from Any Images☆34Nov 11, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Research Project on Multi-robot Target Tracking via Deep Reinforcement Learning☆21Dec 17, 2020Updated 5 years ago
- Official implementation of Get a Grip: Multi-Finger Grasp Evaluation at Scale Enables Robust Sim-to-Real Transfer☆32Feb 12, 2025Updated last year
- ☆10Mar 11, 2024Updated 2 years ago
- Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.☆11Jun 26, 2021Updated 4 years ago
- (NeurIPS 2025) LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search☆23Mar 9, 2026Updated last month
- ☆16Jan 30, 2025Updated last year
- Mixed complementarity problems parameterized by "runtime"-parameters with support for implicit differentiation.☆21Nov 24, 2025Updated 4 months ago
- 集群算法olfati saber论文仿真☆22Dec 6, 2022Updated 3 years ago
- Planning with inferred internal states of other players in general-sum differential games.☆17May 3, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- ☆13Jul 9, 2018Updated 7 years ago
- [NeurIPS 2024] Doubly Mild Generalization for Offline Reinforcement Learning☆16Oct 29, 2025Updated 5 months ago
- Implementation of Hierarchical Control for Head-to-Head Autonomous Racing paper☆19Feb 11, 2024Updated 2 years ago
- mobile 3D model library☆23Oct 15, 2015Updated 10 years ago
- A Metropolis-Hastings MCMC sampler accelerated via diffusion models☆17Jul 25, 2024Updated last year
- Does Self-supervision Always Improve Few-shot Learning? - MLRC 2021 and ReScience-C☆22May 23, 2022Updated 3 years ago
- Proximal Policy Optimization (PPO) written in C++ with PyTorch (LibTorch)☆17Jul 21, 2024Updated last year
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch Implementation of the Sequential Multiagent Rollout algorithm☆11Jun 28, 2024Updated last year
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆19Mar 17, 2025Updated last year
- Simulation studies for research "Tac-Man: Tactile-Informed Prior-Free Manipulation of Articulated Objects".☆39Nov 20, 2025Updated 4 months ago
- 这是参加顶会的会议纪要☆16Dec 7, 2019Updated 6 years ago
- Underactuated Robotics Spring 2020 Final Project☆15May 13, 2020Updated 5 years ago
- ☆16Apr 20, 2018Updated 7 years ago
- The official repository for the paper "Real-world Reinforcement Learning from Suboptimal Interventions”.☆45Mar 19, 2026Updated 3 weeks ago