学习强化学习过程中的笔记和代码
☆12Jul 27, 2020Updated 5 years ago
Alternatives and similar repositories for RL_notes_and_codes
Users that are interested in RL_notes_and_codes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A design optimization study of underwater vehicle using Bayesian optimization and deep learning based surrogate model☆13Mar 13, 2023Updated 3 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- ☆11Jan 21, 2022Updated 4 years ago
- Python package for Simulink-based reinforcement learning environments.☆11Aug 20, 2021Updated 4 years ago
- standalone node and matlab wrapper for teb-planner package☆13Jul 1, 2021Updated 4 years ago
- CMU Masters Thesis Project: UAV Path Planning and Human Trajectory Prediction for Navigation through Work Sites.☆11May 4, 2021Updated 4 years ago
- Implementation of an ADAS feature in Matlab and Simulink☆12Jan 1, 2019Updated 7 years ago
- A package for multiple ultrasonic sensor into ROS☆12Nov 1, 2017Updated 8 years ago
- 众人的因子回测框架 stock factor test☆30Updated this week
- ☆12Jan 3, 2022Updated 4 years ago
- Path Planner Plugin for turtlebot3 using KinoDynamic A Star in ROS melodic☆12Jun 28, 2020Updated 5 years ago
- Optimal and Full Coverage Path Planning for Agricultural Sector☆14May 19, 2021Updated 4 years ago
- Work related to the master thesis: Side-Scan Sonar Imaging and Error-State Kalman Filter Aiding Unmanned Underwater Vehicle (UUV) to Auto…☆15Jan 31, 2023Updated 3 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆12Nov 14, 2019Updated 6 years ago
- ☆12Aug 24, 2020Updated 5 years ago
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆11Jul 4, 2022Updated 3 years ago
- Genetic Algorithm, GA—matlab☆11Oct 16, 2019Updated 6 years ago
- ☆14Aug 26, 2018Updated 7 years ago
- UCB CS294-112 深度强化学习中文笔记☆51Jan 2, 2021Updated 5 years ago
- 课程笔记,David Silver,CS294 ...☆15Jan 7, 2019Updated 7 years ago
- A simple automatic parking system for car based on fuzzy logics by matlab.☆14Apr 26, 2017Updated 8 years ago
- This directory simulates UUV dynamics and control purely in the Matlab programming language.☆19Nov 10, 2017Updated 8 years ago
- REDSearch: A scalable, cost-efficient framework for long-horizon search agents. Features complex task synthesis, optimized mid-training, …☆70Feb 26, 2026Updated 3 weeks ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 6 years ago
- 在A股(股票)市场上训练强化学习交易智能体☆341Mar 27, 2024Updated last year
- The code has been implemented in Carla Simulator with the help of Double DQN to train an agent how to drive autonomously using different …☆16Aug 20, 2019Updated 6 years ago
- SIA - C++/Python library for model-based stochastic estimation and optimal control☆23Apr 3, 2024Updated last year
- Hydronautics team simple AUV simulator based on Gazebo for testing control algorithms☆15Mar 3, 2026Updated 3 weeks ago
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆32Jan 3, 2026Updated 2 months ago
- ☆13Feb 29, 2020Updated 6 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆15Dec 8, 2020Updated 5 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- A safe and efficient autonomous driving algorithm. Winner of the 2019 DriveML Huawei Autonomous Vehicles Challenge. Built using RLLib and…☆18Jan 24, 2020Updated 6 years ago
- 基于Matlab实现纯跟踪(Pure Pursuit)算法☆16Oct 12, 2022Updated 3 years ago
- "4D TRAJECTORY GENERATION FOR GUIDANCE MODULE OF A UAV FOR A GATE TO GATE FLIGHT IN PRESENCE OF TURBULENCE", International Journal of A…☆16Jul 29, 2018Updated 7 years ago
- deploy machine learning model in tensorflow sering and docker☆10Dec 5, 2018Updated 7 years ago
- Codebase - Comparing DRL algorithms' ability to safely navigate challenging waters☆16Aug 18, 2021Updated 4 years ago
- Simulates the working environment for Kraken-our Autonomous Underwater Vehicle.☆18Aug 22, 2013Updated 12 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago