David Silver【强化学习】Reinforcement Learning Course课件 该资源是David Silver的强化学习课程所对应的ppt课件。
☆15Apr 27, 2019Updated 6 years ago
Alternatives and similar repositories for DavidSilverRLPPT
Users that are interested in DavidSilverRLPPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 用CLion+Cmake同时管理cartographer和cartographer_ros,实现对carto的编译、debug单步调试☆14Aug 27, 2021Updated 4 years ago
- 🍓 A toy object-oriented programming language written by rust☆17Apr 10, 2024Updated last year
- [ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)☆10Aug 26, 2025Updated 7 months ago
- ☆14Oct 11, 2022Updated 3 years ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning☆15May 26, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- Classifying Verified used users on Twitter based on how likely they are to share Fake News articles☆12Jul 22, 2023Updated 2 years ago
- Simulation scripts used to create 3D MOSFET example used in: J. E. Sanchez and Q. Chen, "Element Edge Based Discretization for TCAD Devic…☆16Nov 17, 2023Updated 2 years ago
- Implementation of Hash table for Nießner's Voxel Hashing method☆16Sep 2, 2015Updated 10 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- Exports messages from topics in ROS bag files to CSV files. Matlab scripts then import the CSV files to Matlab workspaces.☆10Jun 15, 2016Updated 9 years ago
- a LiDAR-based Framework for Perception-aware Planning with Perturbation-induced Metric☆16Apr 18, 2025Updated 11 months ago
- Materials for the paper "Trajectory Replanning for Quadrotors Using Kinodynamic Search and Elastic Optimization"☆12Oct 16, 2017Updated 8 years ago
- ☆11Nov 29, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Interfacing MOOS IvP to the BlueROV2☆11May 29, 2018Updated 7 years ago
- The official repository for BaMBNet☆22May 31, 2023Updated 2 years ago
- Navigation control algorithms based on artificial vector fields☆11Aug 17, 2023Updated 2 years ago
- 2023年秋季计算机科学与技术学院研究生课程网络安全的作业与实验,仅供参考。☆14Jan 6, 2024Updated 2 years ago
- 新版Mujoco学习记录☆21Apr 2, 2023Updated 2 years ago
- Some Multi-Agent Path Planning algorithms☆13Sep 27, 2020Updated 5 years ago
- Collision Avoidance using Buffered Voronoi Cell☆14Feb 10, 2017Updated 9 years ago
- Open source dode for paper Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation☆23Nov 18, 2024Updated last year
- ☆14Jul 15, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pluggin and utils for viewing voxelgrids in RViz☆16May 10, 2021Updated 4 years ago
- A programmable servo capable of turning a defined 1 turn 360° (or more)☆13Dec 25, 2019Updated 6 years ago
- State-of-the-art background removal model, designed to effectively separate foreground from background. <metadata> gpu: T4 | collections:…☆23Mar 11, 2025Updated last year
- You can physically simulate a dove in this program which was developed for "Data-driven Control of Flapping Fight, ACM Transactions on Gr…☆12Dec 6, 2019Updated 6 years ago
- Learning with Higher Expressive Power than Neural Networks (On Learning PDEs)☆16Feb 17, 2021Updated 5 years ago
- RAPA-Planner: Robust and Efficient Motion Planning for Quadrotors Based on Parallel RA-MPPI☆24Oct 9, 2024Updated last year
- Code for the paper "Non-Linear Trajectory Optimization for Large Step-Ups: Application to the Humanoid Robot Atlas"☆19Mar 3, 2021Updated 5 years ago
- Demonstration of various solutions solving the cart pole problem in OpenAI gym.☆18Jun 14, 2018Updated 7 years ago
- Fourier Neural Operator☆13May 24, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This repo contains code for GeoMLE intrinsic dimension estimation algorithm☆20Jul 10, 2020Updated 5 years ago
- ☆13Sep 1, 2019Updated 6 years ago
- Spatial-temporal Trajectory Planning for UAV Teach-and-Repeat☆16Jun 30, 2019Updated 6 years ago
- Just example illustrates how the offline geographical maps capabilities can be added to Labview (using .Net control)☆13Apr 15, 2022Updated 3 years ago
- multi objective, single objective optimization, genetic algorithm for multi-objective optimization, particle swarm intelligence, ... impl…☆15May 17, 2020Updated 5 years ago
- BiC-MPPI: Goal-Pursuing, Sampling-Based Bidirectional Rollout Clustering Path Integral for Trajectory Optimization☆20Sep 25, 2024Updated last year
- This is the code repository of a tutorial overview of Path Integral (PI) approaches for stochastic optimal control and trajectory optimiz…☆24Sep 5, 2023Updated 2 years ago