《Reinforcement Learning》读书学习与视频分享笔记
☆78Apr 1, 2025Updated 11 months ago
Alternatives and similar repositories for learn-reinforcement-learning
Users that are interested in learn-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- 训练自己的中文 Embedding 模型☆28Jan 6, 2025Updated last year
- DQN for Stock Trading leverages Deep Q-Network (DQN) to develop an intelligent trading agent for stock markets. The project aims to maxim…☆12Jun 27, 2024Updated last year
- A demonstration of how to train a custom tokenizer similar to TikToken.☆15Jan 6, 2025Updated last year
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- ☆85Feb 3, 2025Updated last year
- Implementation of Dynamic Computation Offloading Control Logic in a Software-Defined Vehicle (SDV) System☆11Dec 19, 2024Updated last year
- a Federated Learning Framework adapted for resource-constrained environments, focusing on IoT devices☆10Oct 6, 2025Updated 5 months ago
- Automatic defect recognition in X-ray testing using computer vision☆12Dec 8, 2018Updated 7 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- ☆24Nov 21, 2025Updated 3 months ago
- This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…☆11Oct 8, 2021Updated 4 years ago
- Protect workers with TensorFlow Hard Hat object detection model on a Jetson Nano☆10Sep 27, 2022Updated 3 years ago
- Portable library for binary (bi-valued) image processing☆14Jun 12, 2024Updated last year
- My undergraduate final project - Modeling and control of a distillation column using neural networks and reinforcement learning.☆12Apr 28, 2020Updated 5 years ago
- ☆10Jul 26, 2024Updated last year
- Teaching the Donkey car to drive a track in the simulator using State Representation Learning and different Reinforcement Learning Algori…☆12Dec 6, 2021Updated 4 years ago
- [ICML 2024 Oral] Consistent Adversarial Robust Deep Q Networks (CAR-DQN)☆15Feb 27, 2025Updated last year
- An HTTP client for the Rust AWS SDK that runs on Fastly Compute @ Edge☆10Nov 11, 2025Updated 3 months ago
- This operator will manage and configure data processing unit (DPUs) to be used in accelerating/offloading k8s networking functions☆12Feb 13, 2026Updated 3 weeks ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆10Jan 3, 2023Updated 3 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- Curated collection of AI for Science papers, organized by research domains. 收录并分类整理公众号【你好不吃虾】中分享的 AI for Science 论文。☆32Oct 28, 2025Updated 4 months ago
- Interface definitions for the Compute@Edge platform in witx.☆15Feb 11, 2022Updated 4 years ago
- Chest Xray Classifier using CNNs and Transfer Learning. The jupyter notebook of interest is titled 'Xrays_alt.ipynb'☆11May 18, 2018Updated 7 years ago
- QuantumVerse-Nexus is a cutting-edge blockchain platform designed to harness the power of quantum computing, artificial intelligence, and…☆12Nov 27, 2024Updated last year
- Implementation of the RTSS'23 Best Student Paper Award paper Progressive Neural Compression for Adaptive Image Offloading under Timing Co…☆13Mar 25, 2025Updated 11 months ago
- ☆12Mar 1, 2023Updated 3 years ago
- RedNote MCP - Xiaohongshu Content Search Tool☆21Jun 26, 2025Updated 8 months ago
- ☆10Jun 4, 2024Updated last year
- A SimPy-based Discrete Event Simulator for a relay node in a payment channel network using different submarine swap rebalancing policies,…☆11Oct 8, 2023Updated 2 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- mobileNet SSD 基于caffe的前向检测☆10Nov 30, 2018Updated 7 years ago
- Easy Dataset Docs☆17Jan 21, 2026Updated last month
- Official implementation of paper "Neural Combinatorial Optimization for Multiobjective Task Offloading in Mobile Edge Computing"☆14Aug 26, 2025Updated 6 months ago