KMnO4-zx/hand-on-rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KMnO4-zx/hand-on-rl)

KMnO4-zx / hand-on-rl

☆112

Alternatives and similar repositories for hand-on-rl

Users that are interested in hand-on-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RethinkFun / trian_ppo
View on GitHub
☆147Sep 29, 2024Updated last year
lfy-123 / WIST
View on GitHub
WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement
☆19Apr 10, 2026Updated 3 months ago
KMnO4-zx / iclr26-high-rating
View on GitHub
☆26Nov 20, 2025Updated 8 months ago
rainfallclub / rainfallmodel
View on GitHub
A simple and powerful tool for building Large Language Models from scratch【从零训练大模型】
☆19Sep 29, 2025Updated 9 months ago
hanmingbai / mimic_distill_walk
View on GitHub
using mimic and distill to achieve natural human-like walk instead of AMP
☆60Apr 20, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
cyhdmjzzy / DeepEP-Code-Analysis
View on GitHub
☆26Feb 27, 2026Updated 4 months ago
wangruohui / EfficientVideoAgent
View on GitHub
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
☆26May 6, 2026Updated 2 months ago
EvolvingLMMs-Lab / ParaVT
View on GitHub
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
☆54Jun 2, 2026Updated last month
senlanke / Beyondmimic_sim2sim_G1
View on GitHub
☆26Jan 30, 2026Updated 5 months ago
fansion314 / curvefitting
View on GitHub
An easy-use and MATLAB-like graphical curve fitting tool, for Python, Jupyter and other environments.
☆10Jul 26, 2021Updated 4 years ago
KMnO4-zx / nips25-all-papers
View on GitHub
nips25-all-papers
☆44Feb 26, 2026Updated 4 months ago
Zessay / sohu_2019
View on GitHub
2019搜狐第三届内容识别挑战赛rank10
☆11Oct 17, 2019Updated 6 years ago
Raxxll / ChatBI
View on GitHub
本项目旨在利用LangChain和大语言模型（如ZhipuAI）开发一个智能数据库问答系统。该系统能够通过自然语言理解用户的查询请求，自动生成相应的SQL语句并执行，最后将查询结果以自然语言形式返回用户。
☆15Jul 31, 2024Updated last year
jjhw / SayCan_experimental
View on GitHub
Proof of concept of the SayCan project applying on real UR5 robot
☆10May 15, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Hymwgk / panda_go_grasp
View on GitHub
使用panda机械臂接收grasp pose，执行抓取和一些其他操作
☆12Mar 2, 2022Updated 4 years ago
limafang / tiny-graphrag
View on GitHub
☆44May 9, 2025Updated last year
QinYang12 / SVGC-AVA
View on GitHub
☆14Aug 17, 2024Updated last year
SynergyAnalyzer / SynergyAnalyzerToolbox
View on GitHub
The Synergy Analyzer Toolbox for MATLAB is an open source software to extract muscle and kinematic synergies
☆22Jun 15, 2024Updated 2 years ago
sanbuphy / WhisperTranslator
View on GitHub
A free tool that helps you transcribe, translate, and summarize videos in any language.
☆18Feb 27, 2024Updated 2 years ago
travers-rhodes / kinova_gen3_control
View on GitHub
Open-source ros_control hardware_interface::RobotHW (hardware_interface::EffortJointInterface and hardware_interface::JointStateInterface…
☆12Jan 22, 2024Updated 2 years ago
sail-sg / offbench
View on GitHub
☆16Jun 1, 2023Updated 3 years ago
Zachary193 / Franka-Sigma.7-Teleoperation
View on GitHub
Franka Emikia Research 3 robotic arm teleoperation by Force Dimension Sigma.7
☆14Jul 27, 2024Updated last year
foreverYoungGitHub / cookiecutter-pytorch-lightning
View on GitHub
A modern cookiecutter template for deep learning projects with pytorch lightning that use uv for dependency management
☆39Aug 10, 2025Updated 11 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Rao-Kai / Numerical-Optimization-in-Robotics
View on GitHub
☆20Oct 21, 2022Updated 3 years ago
Hymwgk / baxter_hand_eye_calibrate
View on GitHub
基于easy_handeye开源项目，对baxter双臂协作机器人进行手眼标定（Kinect v2眼在手外)
☆11Dec 20, 2021Updated 4 years ago
kyassini / manipulation_experiments
View on GitHub
Pick & place code for testing dynamic grasping packages (GPD and GQCNN) with the Kinova Gen3 arm
☆14May 12, 2021Updated 5 years ago
lyx3911 / srtp
View on GitHub
本人的SRTP项目：双臂协同示教学习
☆11Sep 14, 2020Updated 5 years ago
RethinkFun / sft
View on GitHub
☆69Aug 23, 2024Updated last year
yuan5 / -biped-robot-for-running-
View on GitHub
设计制作一款能够奔跑的双足机器人，只为稳定奔跑。省去一切华丽的表演动作。如果一定要给它取个名字，就叫狂奔吧！Design and make a biped robot that can run, only for stable running. Eliminate a…
☆13Jun 18, 2019Updated 7 years ago
Exusial / gaussian-splatting-jittor
View on GitHub
☆11Jul 14, 2024Updated 2 years ago
iwangjian / Midi-Tuning
View on GitHub
[ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
☆26Oct 18, 2025Updated 9 months ago
fishbotics / avoid-everything
View on GitHub
☆18Jun 29, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
tud-amr / IA-MPPI-LBM
View on GitHub
Training and testing scripts for the prediction model used in the "Interaction-Aware Sampling-Based MPC with Learned Local Goal Predictio…
☆21Nov 14, 2023Updated 2 years ago
ros-industrial / stomp
View on GitHub
Stochastic Trajectory Optimization for Motion Planning (STOMP)
☆30Jan 2, 2026Updated 6 months ago
IntMeGroup / LMM4LMM
View on GitHub
[ICCV 2025 Highlight] LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs
☆20Nov 16, 2025Updated 8 months ago
robofit / arcor2
View on GitHub
Solution for end-user programming of (collaborative) robots using Augmented Reality. From AR to Python and back!
☆21Updated this week
owenliang / qwen-dpo
View on GitHub
通义千问的DPO训练
☆67Sep 21, 2024Updated last year
maurock / urdf_to_obj
View on GitHub
Repository to extract obj files in world frame from a URDF description
☆21Jan 7, 2023Updated 3 years ago
luotianyou349 / PnPDA
View on GitHub
This is the official implementation of ECCV2024 paper "Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Percepti…
☆19Aug 13, 2024Updated last year