强化学习求解迷宫问题,Q-learning和监督学习
☆26Sep 20, 2020Updated 5 years ago
Alternatives and similar repositories for Maze-solver-using-reinforcement-learning
Users that are interested in Maze-solver-using-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- 基于强化学习DQN实现的走迷宫程序☆11Mar 25, 2020Updated 5 years ago
- 使用强化学习和深度 Q 学习的 AI 驱动的蛇游戏。☆20Jul 19, 2023Updated 2 years ago
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- ☆13May 23, 2024Updated last year
- ☆11May 6, 2021Updated 4 years ago
- 基于强化学习的游戏空战推演☆13May 8, 2021Updated 4 years ago
- Parallel implementations of Bellman-Ford algorithm with MPI, OpenMP and CUDA.☆11Sep 25, 2018Updated 7 years ago
- A program that runs a sobel filter edge detection algorithm on an image using a single thread on the CPU, another using OpenMP to paralle…☆10Oct 18, 2017Updated 8 years ago
- A PyTorch implementation of MixNet: Mixed Depthwise Convolutional Kernels☆11Aug 5, 2019Updated 6 years ago
- MRCPSP: This is an implementation of multi-mode resource constrained project scheduling problem (MRCPSP) in MATLAB.☆11May 10, 2019Updated 6 years ago
- ☆11May 27, 2022Updated 3 years ago
- lecture32_AI挑战星际争霸II(强化学习)☆17Aug 23, 2022Updated 3 years ago
- a simple test for understanding the theory of GAN, [matlab code]☆12Nov 20, 2017Updated 8 years ago
- code and demo of the ISMIR 2021 paper CollageNet☆12Jul 12, 2021Updated 4 years ago
- Age Estimation: Implementation of DEX paper in Pytorch☆10Jan 17, 2020Updated 6 years ago
- AFFNet-Unofficial Implementation☆15Aug 23, 2023Updated 2 years ago
- Codes for AAAI22 paper "Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum"☆23Mar 3, 2022Updated 4 years ago
- Implementation and analysis using CUDA and openMP☆11Dec 14, 2016Updated 9 years ago
- Open Sourced ML Research Paper Implementations in Tensorflow☆19Jan 8, 2022Updated 4 years ago
- Implementing EC2-VAE to the conditional generative model to generate music with controlling rhythm patterns☆14Aug 13, 2020Updated 5 years ago
- [CVPR 2024] Targeted Representation Alignment for Open-World Semi-Supervised Learning☆15Sep 23, 2024Updated last year
- 2021-2022国科大强化学习格斗游戏大作业☆37Jun 11, 2022Updated 3 years ago
- 轻量化卷积神经网络实现(SqueezeNet/MobileNet/ShuffleNet/MnasNet)☆12Mar 5, 2026Updated 2 weeks ago
- [CVPR'24] Solving the Catastrophic Forgetting Problem in Generalized Category Discovery https://arxiv.org/pdf/2501.05272☆16Dec 24, 2024Updated last year
- code for Automatic Modulation Open Set Recognition with diffusion models☆17Jan 4, 2025Updated last year
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Implementation of Learning without Prejudices: Continual Unbiased Learning via Benign and Malignant Forgetting (ICLR 2023)☆13Apr 14, 2023Updated 2 years ago
- Security-Constrained Unit Commitment Programming Project☆11Jul 17, 2025Updated 8 months ago
- PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).☆31Oct 27, 2021Updated 4 years ago
- ☆23Oct 6, 2024Updated last year
- Parallel FFT for big integer multiplication. Written in three versions: MPI, OpenMP and CUDA(cufft).☆15Oct 19, 2020Updated 5 years ago
- ☆19Jun 17, 2024Updated last year
- 使用深度强化学习解决视觉跟踪和视觉导航问题☆28Mar 18, 2021Updated 5 years ago
- Adversarial Examples Detection Benchmark☆17Dec 6, 2024Updated last year
- 应用强化学习在复杂的交通环境下自动学习最佳驾驶策略的方案,在测试环境下准确率达到100%。☆21Feb 26, 2017Updated 9 years ago
- ☆20Aug 13, 2024Updated last year
- ☆18Jul 22, 2021Updated 4 years ago
- PyTorch implementation of ResNeSt : Split-Attention Networks☆18Jun 13, 2020Updated 5 years ago
- Radar datasets for self-supervised radar signal recognition. This work is published at the 35th IEEE International Workshop on Machine Le…☆32Sep 22, 2025Updated 5 months ago