ChengpengLi1003/Q-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChengpengLi1003/Q-learning)

ChengpengLi1003 / Q-learning

针对最经典的表格型Q learning算法进行了复现，能够支持gym中大多数的离散动作和状态空间的环境，譬如CliffWalking-v0。

☆10

Alternatives and similar repositories for Q-learning

Users that are interested in Q-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JackShDr / InfluentialRS
View on GitHub
Implementations of Influential Recommender System
☆12Oct 29, 2024Updated last year
MIRALab-USTC / GNN-LMC
View on GitHub
The code of paper LMC: Fast Training of GNNs via Subgraph Sampling with Provable Convergence. Zhihao Shi, Xize Liang, Jie Wang. ICLR 2023…
☆48Feb 15, 2023Updated 3 years ago
laohao78 / Lerobot-Mujoco
View on GitHub
基于 LeRobot 和 MuJoCo 的机器人学习教程，包含 ACT、pi0、SmolVLA 模型的完整复现：数据采集、训练与部署。
☆16Apr 26, 2026Updated 2 months ago
ni-ning / AdvancePython
View on GitHub
Python 高级编程
☆15Dec 18, 2019Updated 6 years ago
algopapi / RetroformAgent
View on GitHub
Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)
☆16Sep 5, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
narendrant7 / dblp-coauthor-network-analysis
View on GitHub
Analyse Social Network of co-authors in DBLP website (https://dblp.uni-trier.de) using NetworkX.
☆13May 27, 2020Updated 6 years ago
dDostalker / Penguin
View on GitHub
An analyzer for PE files
☆16Jan 14, 2026Updated 6 months ago
xiaochuang-lxc / protocol
View on GitHub
An ASCII Header Generator for Network Protocols
☆14Dec 12, 2024Updated last year
9beach / jech-set-theory-solutions
View on GitHub
A solutions manual for Set Theory by Thomas Jech
☆14Aug 12, 2018Updated 7 years ago
hanningzhang / prm
View on GitHub
☆17Nov 3, 2024Updated last year
MIRALab-USTC / DD-RetroDCVAE
View on GitHub
A novel template-free retrosynthesizer that can generate diverse sets of reactants for a desired product via discrete conditional variati…
☆15Aug 7, 2022Updated 3 years ago
Tlntin / booking_simulator
View on GitHub
☆11Jan 6, 2024Updated 2 years ago
strands-project / strands_executive
View on GitHub
Executive control code for STRANDS robots.
☆11Feb 13, 2020Updated 6 years ago
MIRALab-USTC / RL-SCPO
View on GitHub
The code of paper *Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization*.
☆18Mar 26, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MIRALab-USTC / KGRPapers
View on GitHub
Must-read papers on Knowledge Graph Reasoning (KGR)
☆21Mar 16, 2020Updated 6 years ago
Safe-RL-Power-Systems-Control / Voltage-Control
View on GitHub
☆13Oct 5, 2021Updated 4 years ago
Carl0520 / DANN_pytorch-
View on GitHub
Implementation of the paper Unsupervised Domain Adaptation by Backpropagation
☆11Dec 1, 2018Updated 7 years ago
menggedu / EDL
View on GitHub
Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics
☆14Mar 6, 2025Updated last year
cmu-l3 / minictx-eval
View on GitHub
Neural theorem proving evaluation via the Lean REPL
☆24Jul 12, 2025Updated last year
NatLabRockies / learning-building-control
View on GitHub
☆20Jan 26, 2024Updated 2 years ago
ZhuXMMM / Afford-X
View on GitHub
☆11Apr 23, 2025Updated last year
frankroeder / lanro-gym
View on GitHub
OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning
☆14Jan 27, 2026Updated 5 months ago
braraki / logical-options-framework
View on GitHub
☆10Jun 7, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ChirikjianLab / primp-python
View on GitHub
[T-RO] Python implementation of PRobabilistically-Informed Motion Primitives (PRIMP)
☆14Apr 19, 2024Updated 2 years ago
zjujdj / MetalProGNet
View on GitHub
☆15Jul 7, 2024Updated 2 years ago
ekorudiawan / DQN-robot-arm
View on GitHub
Deep Q learning algorithm written on PyTorch for solving 2D robot arm reacher
☆12Feb 19, 2020Updated 6 years ago
boyuezhong / SSGCN
View on GitHub
SSGCN
☆11Jul 23, 2020Updated 6 years ago
darcywep / MCTS_Gobang
View on GitHub
最基本的基于蒙特卡洛搜索树(MCTS)的五子棋。
☆13Apr 8, 2021Updated 5 years ago
JingHuangLab / SWIT
View on GitHub
☆11Jul 1, 2024Updated 2 years ago
labicon / POLICEd-RL
View on GitHub
Official Code Repository for the POLICEd-RL Paper: https://www.roboticsproceedings.org/rss20/p104.html
☆14Mar 4, 2025Updated last year
zouwj16 / MUPO
View on GitHub
Code for Policy Bifurcation in Safe Reinforcement Learning
☆10Jul 4, 2025Updated last year
fdeng18 / dreamer-pro
View on GitHub
☆38Dec 26, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MIRALab-USTC / KGEPapers
View on GitHub
Must-read papers on Knowledge Graph Embedding
☆29Oct 15, 2020Updated 5 years ago
ChengpengLi1003 / DotaMath
View on GitHub
☆30Dec 27, 2024Updated last year
OpenDFM / Rememberer
View on GitHub
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆40May 2, 2024Updated 2 years ago
jeah-z / IFP-RNN
View on GitHub
A molecule generative model used interaction fingerprint (docking pose) as constraints.
☆15Feb 13, 2022Updated 4 years ago
lyxwll / Android---project
View on GitHub
Android一些我看到的开源项目
☆20Sep 11, 2017Updated 8 years ago
hiwonjoon / IROS2021_SORS
View on GitHub
☆11Jul 29, 2021Updated 4 years ago
YilunZhou / RoCUS
View on GitHub
Code repository for the CoRL 2021 paper "RoCUS: Robot Controller Understanding via Sampling"
☆11Mar 24, 2022Updated 4 years ago