epignatelli / human-level-control-through-deep-reinforcement-learningView external linksLinks
A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G. and Petersen, S., 2015. Human-level control through deep reinforcement learning. nature, 518(7540), pp.529-533.
☆10Dec 7, 2020Updated 5 years ago
Alternatives and similar repositories for human-level-control-through-deep-reinforcement-learning
Users that are interested in human-level-control-through-deep-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- This is the code for paper "Scalable Resource Management for Dynamic MEC: An Unsupervised Link-Output Graph Neural Network Approach"☆28Oct 7, 2023Updated 2 years ago
- kun-chat is a lightweight AI conversation app based on Ollama/kun-chat 是一款基于 Ollama 的轻量级 AI 对话应用☆10Jul 16, 2025Updated 6 months ago
- A two phase Fourier neural operator based model for predicting pressures and saturations in porous media☆13May 22, 2025Updated 8 months ago
- ☆39Jul 24, 2024Updated last year
- 免费下载网易云歌单里的所有歌 - Get your Netease Cloud Music playlist BY FREE☆11Jul 27, 2023Updated 2 years ago
- ☆10Apr 2, 2024Updated last year
- ☆11Jul 22, 2024Updated last year
- AWS virtual infrastructure simulator for training reinforcement learning based cloud capacity management systems☆11Sep 23, 2020Updated 5 years ago
- ☆16Jun 5, 2025Updated 8 months ago
- ☆13Sep 4, 2024Updated last year
- This is the MATLAB based simulation of optimized Vehicular Fog Computing framework that minimizes the latency during the computatin tasks…☆12Jul 11, 2024Updated last year
- ☆14Nov 19, 2024Updated last year
- Delay Differential Equations in Haskell☆11Dec 4, 2018Updated 7 years ago
- Haskell binding for Menoh DNN inference library☆12Nov 30, 2018Updated 7 years ago
- This project is a conversion of the source code from VHDL BY EXAMPLE by Blaine C. Readler, and some slightly modfied examples from the C…☆10Sep 4, 2017Updated 8 years ago
- 这是一个为大模型提供 A 股数据的的 MCP(Model Content Protocol) 服务。☆20Aug 31, 2025Updated 5 months ago
- The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …☆10Aug 7, 2022Updated 3 years ago
- A domain-specific language that allows the expression or protein interactions that can be used to build executable models.☆23Dec 13, 2022Updated 3 years ago
- Hearthstone by golang , 用go实现炉石传说☆12Mar 2, 2023Updated 2 years ago
- ☆17Jan 6, 2024Updated 2 years ago
- Pytorch implementation of ProtoAU for recommendation.☆10Dec 19, 2024Updated last year
- Haskell to D3.js binding by deep EDSL approach.☆23Sep 20, 2014Updated 11 years ago
- ☆11Jul 18, 2022Updated 3 years ago
- Source code for the paper "Energy-Efficient Client Sampling for Federated Learning in Heterogeneous Mobile Edge Computing Networks", this…☆13Aug 22, 2024Updated last year
- Dynamic Task Software Caching-Assisted Computation Offloading for Multi-Access Edge Computing☆11Dec 18, 2022Updated 3 years ago
- K8 Deep dive - Core Concepts, CRDs, Operators, Controllers, Openshift, kubebuilder, Coreos operator framework☆10May 1, 2023Updated 2 years ago
- Code and dataset for MobiHoc 2022 paper: #46 Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learni…☆10Aug 16, 2022Updated 3 years ago
- I created some notebooks about different concepts of financial engineering☆10Sep 28, 2025Updated 4 months ago
- Official implementation of the paper "MTL-Split: Multi-Task Learning for Edge Devices using Split Computing" accepted @ DAC 2024.☆10Dec 3, 2024Updated last year
- ☆14Jan 24, 2024Updated 2 years ago
- ☆11Mar 13, 2024Updated last year
- ☆12May 14, 2024Updated last year
- Designing a RAG pipeline using Gemma-2b, DSPy, and Qdrant☆10Mar 19, 2024Updated last year
- ☆10Jul 16, 2025Updated 6 months ago
- Bluebell is a generic Akoma Ntoso 3 parser.☆19Jan 5, 2026Updated last month
- Demo application for the sse-eventbus library☆12Dec 7, 2025Updated 2 months ago
- ☆15Dec 26, 2021Updated 4 years ago
- CliniDeID automatically de-identifies clinical text notes according to the HIPAA Safe Harbor method. It accurately finds identifiers and …☆10Aug 13, 2023Updated 2 years ago
- Java API for the AkomaNtoso XML Schema☆15Jun 29, 2017Updated 8 years ago