A explaintable and modified version of udacity DRL homework
☆26Jul 7, 2020Updated 5 years ago
Alternatives and similar repositories for DRL_udacity
Users that are interested in DRL_udacity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- gym-auv repository upgraded to Stable-Baselines 3☆12Aug 24, 2023Updated 2 years ago
- a collection of DRL-repo in Github☆16Oct 21, 2020Updated 5 years ago
- Audio-Visual Perception of Omnidirectional Video for Virtual Reality Applications☆15Feb 22, 2023Updated 3 years ago
- Towards Universal Internet Congestion Control Benchmarking ...☆23Aug 11, 2023Updated 2 years ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The Manta v1 software architecture for Autonomous Underwater Vehicles (AUVs) - Master's thesis☆10Aug 11, 2022Updated 3 years ago
- An approach that integrates Transfomer-based attention mechanisms into model predictive control for a learnable control policy.☆14May 9, 2023Updated 2 years ago
- defogging☆11Oct 18, 2020Updated 5 years ago
- Gym Environment for AUV docking procedure☆11Sep 20, 2022Updated 3 years ago
- Comprehensive Information Integration Modeling Framework for Video Titling☆11Aug 27, 2020Updated 5 years ago
- Experiments on Model-Agnostic Meta-Learning on Few-Shot Image Classification and Meta-RL (Meta-World)☆17Mar 30, 2021Updated 5 years ago
- reinforcement learning for voltage control☆11Apr 2, 2019Updated 7 years ago
- Electricity load prediction of New York state with Matlab and TensorFlow framework.☆11Jul 25, 2021Updated 4 years ago
- Popular Deep RL algorithms implemented in PyTorch☆11Jan 30, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 高性能短序列稀疏Mask Attention CUDA算子,针对<1K序列+75%稀疏度优化☆67Mar 18, 2026Updated 3 weeks ago
- Implementation of GAIL and AIRL using chinerrl☆16Jun 21, 2022Updated 3 years ago
- A simple Python implement of Bilateral Mesh Denoising☆10Dec 1, 2019Updated 6 years ago
- implement gat with batch☆10Nov 28, 2020Updated 5 years ago
- Code for CIKM 2021 best short paper nomination "Modeling Sequences as Distributions with Uncertainty for Sequential Recommendation" https…☆16Jun 11, 2021Updated 4 years ago
- RL Agent for Atari Game Pong☆11Aug 25, 2019Updated 6 years ago
- 深交所年报下载爬虫☆15Feb 10, 2021Updated 5 years ago
- Spectral Method for Multiple Experts Inverse Reinforcement Learning☆14Sep 6, 2014Updated 11 years ago
- ☆16Feb 15, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This is the source code to simulate model-based (MB) and model-free (MF) reinforcement learning algorithms with replays in grid worlds.☆14Dec 19, 2022Updated 3 years ago
- ☆16Mar 24, 2022Updated 4 years ago
- A convolutional autoencoder for feature extraction, with an SVM for image classification.☆10Jan 30, 2019Updated 7 years ago
- R-PCC: A Baseline for Range Image-based Point Cloud Compression☆37Apr 16, 2022Updated 3 years ago
- NASA Project; Plastic Marine Debris Classification-Machine Learning Software☆16Oct 12, 2021Updated 4 years ago
- ☆13Jan 13, 2019Updated 7 years ago
- The source code of team 🥇Schaferct in 2nd Bandwidth Prediction of MMSys'24.☆16May 13, 2024Updated last year
- Code from "Modeling MOOC Student Behavior with Two-Layer Hidden Markov Models"☆15Jun 2, 2018Updated 7 years ago
- 同济大学软件学院小学期数据库课设——【归宿——一个民宿预定服务网站】的前端☆13Jul 16, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The code of the algorithm proposed in the paper "Deep Inverse Reinforcement Learning for Objective Function Identification in Bidding Mod…☆15Aug 13, 2021Updated 4 years ago
- C++ implementation of the GJK algorithm for convex polygon collision detection.☆11Aug 22, 2019Updated 6 years ago
- By using PLECS and Simulink, the goal of this project is to provide voltage and frequency restoration. The methods will include finite ti…☆14Apr 15, 2018Updated 7 years ago
- ☆16Jan 21, 2022Updated 4 years ago
- 一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm☆13Jul 25, 2024Updated last year
- ☆20Mar 4, 2018Updated 8 years ago
- OpenAI gym environment for collision avoidance and path following with an AUV☆22Sep 5, 2020Updated 5 years ago