Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).
☆15Feb 21, 2021Updated 5 years ago
Alternatives and similar repositories for sac-plus
Users that are interested in sac-plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆187Apr 12, 2022Updated 3 years ago
- A C++ library to benchmark inverted indexes.☆21Aug 4, 2020Updated 5 years ago
- ☆18Jun 3, 2017Updated 8 years ago
- 本项目是一个基于 LangGraph和大语言模型(LLM)实现的 Agentic RAG (检索增强生成)系统。它融合了动态查询分析和自我纠错机制,能够根据用户问题的复杂度智能地选择最优的策略(直接回答、向量库检索或网络搜索),并对生成的答案进行相关性评估,从而实现更高质量…☆44Oct 21, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆110Jan 23, 2022Updated 4 years ago
- The codes for the work "AV-casNet: Fully Automatic Arteriole-Venule Segmentation and Differentiation in OCT Angiography"☆15Oct 27, 2022Updated 3 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆39Dec 27, 2022Updated 3 years ago
- Enhance vessel structures in 3D images using Hessian/Frangi/eigenvalue filter through the ITK library☆19Jul 25, 2021Updated 4 years ago
- ☆15Feb 11, 2021Updated 5 years ago
- 6-DoF wheeled biped robot☆18Jan 19, 2022Updated 4 years ago
- ☆25Mar 7, 2025Updated last year
- ☆17Jul 11, 2020Updated 5 years ago
- cmdr cxx version, a C++17/20 header-only command-line parser with hierarchical config data manager here☆18Mar 20, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- World Models with A3C on Carracing-v0 in gym☆32Mar 29, 2020Updated 6 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Agentic AI ex-US equity evaluator (LangGraph+Gemini)☆59Mar 21, 2026Updated last week
- Established a UNet model to deal with image denoising problem☆21Jun 7, 2021Updated 4 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- A curated list of awesome AI developments for ophthalmology☆19Jun 22, 2021Updated 4 years ago
- 拓片图像去噪,以UNet为基本框架,编码器基于VGG16☆22Apr 9, 2020Updated 5 years ago
- ☆14Apr 17, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ROS Package for running LQR controller in a simulated mobile robot. Capstone project for Udacity C++ Nanodegree☆14Apr 4, 2023Updated 2 years ago
- Mod Source for BOTATO, brotato auto-battler mod☆22Aug 25, 2025Updated 7 months ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- PyTorch Implementation of Hamilton-Jacobi DQN☆16May 12, 2021Updated 4 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines☆10Apr 3, 2020Updated 5 years ago
- 2D Iterative Learning Control with Deep Reinforcement Learning Compensation for the Non-repetitive Batch Processes☆11Mar 4, 2025Updated last year
- 动手学深度学习图像配准(DLIR)☆25Oct 18, 2022Updated 3 years ago
- Reference based Image Super-Resolution via Variational AutoEncoder☆29Jul 26, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Redesigning the Pix2Pix model for small datasets with fewer parameters and different PatchGAN architecture☆23Oct 18, 2025Updated 5 months ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- 基于UNet、PatchGan网络的地震叠加数据去噪方法-tensorflow、Pytorch实现☆21Jan 20, 2022Updated 4 years ago
- ☆19Mar 28, 2025Updated last year
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆14Feb 3, 2023Updated 3 years ago
- A personal project where I publish my research paper notes on a weekly basis.☆13Jul 28, 2021Updated 4 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year