☆18Sep 7, 2023Updated 2 years ago
Alternatives and similar repositories for fast-rl-with-slow-updates
Users that are interested in fast-rl-with-slow-updates are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PaloBoost is an overfitting-robust Gradient Boosting algorithm.☆15Dec 20, 2019Updated 6 years ago
- A beginner's tutorial of reinforcement learning in both Chinese and English. 一份面向初学者的强化学习教程(中英双语)☆12Aug 17, 2023Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Oct 4, 2020Updated 5 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆16May 20, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- Source code for ICML 2023 paper "Competing for Shareable Arms in Multi-Player Multi-Armed Bandits"☆10May 14, 2024Updated 2 years ago
- ☆10Sep 21, 2020Updated 5 years ago
- ☆12Jan 6, 2022Updated 4 years ago
- Code of Paper "Cooperative Sensing and Uploading for Quality-Cost Tradeoff of Digital Twins in VEC", IEEE TCE, 2024.☆12Jul 10, 2023Updated 2 years ago
- Deep Recurrent Q-Network with different exploration strategies for self-driving cars (using AirSim)☆10Sep 5, 2024Updated last year
- In this repository, we try to solve musculoskeletal tasks with `Double DQN reinforcement learning` by using a `transformer` model has bee…☆16Nov 7, 2023Updated 2 years ago
- Application of REINFORCE algorithm to downlink NOMA system☆13Jan 28, 2026Updated 4 months ago
- For my MSc final dissertation "Beamforming Optimization for Reconfigurable Intelligent Surfaces-Assisted Integrated Sensing and Communic…☆17Sep 1, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- pytorch implementation of SAC, TD3 and TD7 with Mujoco Benchmark results from 4 seeds.☆15Jul 4, 2024Updated last year
- ☆16Apr 28, 2023Updated 3 years ago
- ☆15Dec 9, 2021Updated 4 years ago
- Code for IEEE GLOBECOM 2023 paper "Caching for Edge Inference at Scale: A Mean Field Multi-Agent Reinforcement Learning Approach".☆14May 13, 2024Updated 2 years ago
- HAPS-UAV-Enabled Heterogeneous Networks: A Deep Reinforcement Learning Approach☆16Jul 13, 2023Updated 2 years ago
- This repository contains all the projects, and necessary scripts and files developed for the anti-jamming project based on ns3-gym. You c…☆15Aug 14, 2023Updated 2 years ago
- ☆57Jan 20, 2023Updated 3 years ago
- An implementation of TRPO with GAE in PyTorch☆16Jul 22, 2023Updated 2 years ago
- This is code of paper entitled "AI-based Radio Resource and Transmission Opportunity Allocation for 5G-V2X HetNets: NR and NR-U networks…☆15Sep 8, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This framework is for resource allocation in C-V2X Mode 4☆17Nov 14, 2025Updated 6 months ago
- UMAP in pure MLX for Apple Silicon. 30x faster than umap-learn.☆43Mar 5, 2026Updated 2 months ago