acezsq / dsx-rlView external linksLinks
动手学强化学习代码
☆66Jan 17, 2024Updated 2 years ago
Alternatives and similar repositories for dsx-rl
Users that are interested in dsx-rl are comparing it to the libraries listed below
Sorting:
- ☆11Sep 13, 2025Updated 5 months ago
- Reinforcement learning☆34Oct 20, 2025Updated 3 months ago
- Collision Avoidance simulator for USV using Deep RL. A result of TTK4550 Fordypningsoppgave at NTNU☆21Mar 21, 2024Updated last year
- [AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming☆36Jun 1, 2025Updated 8 months ago
- OpenAI gym environment for collision avoidance and path following with an AUV☆35Aug 12, 2019Updated 6 years ago
- MRZ recognition from visa and passport documents.☆21Jan 13, 2026Updated last month
- The repo contains source code of sampling-based LTL (linear temporal logic) path planning project.☆11Sep 19, 2023Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- RO47005 Planning & Decision Making. Quadrotor model planner using probabilistic roadmap (PRM) and collision avoidance using Velocity Obst…☆10Feb 28, 2022Updated 3 years ago
- https://hrl.boyuai.com/☆4,496Nov 22, 2022Updated 3 years ago
- Lists of RL Papers to Review☆10Mar 2, 2023Updated 2 years ago
- PDF Extraction Toolkit (wraps and trains LayoutLM)☆10Oct 8, 2021Updated 4 years ago
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Mar 10, 2023Updated 2 years ago
- Fully open reproduction of DeepSeek-R1☆12Mar 24, 2025Updated 10 months ago
- ☆14Aug 31, 2023Updated 2 years ago
- Hybrid Action PPO in stable-baselines3☆17Jan 14, 2025Updated last year
- ☆14Sep 6, 2024Updated last year
- 🖖 图谱式笔记系统,旨在提高个人笔记的使用率!☆12Jan 17, 2021Updated 5 years ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 5 months ago
- MSS: Exploiting Mapping Score for CQF Start Time Planning in Time-Sensitive Networking☆18Jun 26, 2023Updated 2 years ago
- Little gadgets or project that I made, it could be finished, deprecated or remain to be finish.☆11Mar 15, 2019Updated 6 years ago
- ☆13Jul 25, 2024Updated last year
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- This is a Gazebo Classic 11 simulator tutorial courseware. If you are a beginner, it is best to use a Chinese video tutorial.☆13Apr 21, 2024Updated last year
- Code for Dataset and Benchmarks Submission, Neurips 2022☆13Aug 16, 2022Updated 3 years ago
- Implementation of Model-Distributed Inference for Large Language Models, built on top of LitGPT☆13Aug 26, 2025Updated 5 months ago
- Towards explainable value functions in reinforcement learning. A framework for collision probability distribution estimation via deep tem…☆14May 5, 2025Updated 9 months ago
- Python implement of paper "PD-FAC: Probability Density Factorized Multi-Agent Distributional Reinforcement Learning for Multi-Robot Relia…☆11Mar 5, 2022Updated 3 years ago
- ICMEW:A_Generative_Compression_Framework_For_Low_Bandwidth_Video_Conference☆10Dec 7, 2021Updated 4 years ago
- Awesome Video Coding Papers☆13Feb 19, 2025Updated 11 months ago
- the implementation of Q_Learning☆18Jun 12, 2019Updated 6 years ago
- A simple self-driving car implemented with python☆14Jul 23, 2022Updated 3 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55May 17, 2023Updated 2 years ago
- Demo code of ACMMM 2022 "Quality Assessment of Image Super-Resolution: Balancing Deterministic and Statistical Fidelity"☆14Oct 13, 2022Updated 3 years ago
- Attack-Defense是对《弹道导弹攻防对抗的建模与仿真》一书中部分章节模型的Python复现。☆16Jan 16, 2023Updated 3 years ago
- A new improved sca algorithm and application for online path planning of multi-robot systems☆14Mar 27, 2023Updated 2 years ago
- ☆15Nov 26, 2019Updated 6 years ago
- PDF Paper File Rename Software 自动提取PDF论文的文章标题作为该PDF的文件名☆11May 18, 2021Updated 4 years ago
- Documentation for HoloOcean☆15Feb 5, 2026Updated last week