waylandzhang / learn-reinforcement-learningView external linksLinks
《Reinforcement Learning》读书学习与视频分享笔记
☆76Apr 1, 2025Updated 10 months ago
Alternatives and similar repositories for learn-reinforcement-learning
Users that are interested in learn-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- 训练自己的中文 Embedding 模型☆28Jan 6, 2025Updated last year
- A demonstration of how to train a custom tokenizer similar to TikToken.☆16Jan 6, 2025Updated last year
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- Automatic defect recognition in X-ray testing using computer vision☆12Dec 8, 2018Updated 7 years ago
- a Federated Learning Framework adapted for resource-constrained environments, focusing on IoT devices☆10Oct 6, 2025Updated 4 months ago
- ☆10Dec 10, 2021Updated 4 years ago
- Implementation of Dynamic Computation Offloading Control Logic in a Software-Defined Vehicle (SDV) System☆11Dec 19, 2024Updated last year
- This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…☆11Oct 8, 2021Updated 4 years ago
- This operator will manage and configure data processing unit (DPUs) to be used in accelerating/offloading k8s networking functions☆12Jan 10, 2026Updated last month
- ☆24Nov 21, 2025Updated 2 months ago
- 增加了indextts2的简单的界面与api调用方式☆20Oct 27, 2025Updated 3 months ago
- My undergraduate final project - Modeling and control of a distillation column using neural networks and reinforcement learning.☆12Apr 28, 2020Updated 5 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- [ICML 2024 Oral] Consistent Adversarial Robust Deep Q Networks (CAR-DQN)☆15Feb 27, 2025Updated 11 months ago
- ☆11Jan 11, 2022Updated 4 years ago
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 4 years ago
- Teaching the Donkey car to drive a track in the simulator using State Representation Learning and different Reinforcement Learning Algori…☆12Dec 6, 2021Updated 4 years ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆10Jan 3, 2023Updated 3 years ago
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- ☆10Jul 26, 2024Updated last year
- Protect workers with TensorFlow Hard Hat object detection model on a Jetson Nano☆10Sep 27, 2022Updated 3 years ago
- Portable library for binary (bi-valued) image processing☆14Jun 12, 2024Updated last year
- Deep Q learning algorithm written on PyTorch for solving 2D robot arm reacher☆12Feb 19, 2020Updated 5 years ago
- OpenAI Whisper demo on Axera☆14Jan 15, 2026Updated last month
- ☆10Jun 11, 2018Updated 7 years ago
- A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.☆11May 29, 2023Updated 2 years ago
- Official implementation of the paper "MTL-Split: Multi-Task Learning for Edge Devices using Split Computing" accepted @ DAC 2024.☆10Dec 3, 2024Updated last year
- Various DQN method with cartpole☆11May 30, 2018Updated 7 years ago
- A Distribute and Reactive Approach for Real-time Task Offloading in the MEC Environment☆10May 28, 2020Updated 5 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Official codes for "Training Deep Q-Network via Monte Carlo Tree Search for Adaptive Bitrate Control in Video Delivery"☆10Jul 21, 2023Updated 2 years ago
- Script to help download, configure, install, and run the OpenMP offloading version of OpenMC☆10May 3, 2024Updated last year
- Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration☆33Jan 8, 2026Updated last month
- Real time image capture+DQN path planning☆12May 29, 2023Updated 2 years ago
- Try to replicate test with the paper "Adaptive Resource Allocation in Future Wireless Networks With Blockchain and Mobile Edge Computing"☆10Nov 1, 2023Updated 2 years ago
- Source code for the paper "Energy-Efficient Client Sampling for Federated Learning in Heterogeneous Mobile Edge Computing Networks", this…☆13Aug 22, 2024Updated last year
- ☆10Feb 9, 2025Updated last year