Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).
☆15Feb 21, 2021Updated 5 years ago
Alternatives and similar repositories for sac-plus
Users that are interested in sac-plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆186Apr 12, 2022Updated 4 years ago
- A C++ library to benchmark inverted indexes.☆21Aug 4, 2020Updated 5 years ago
- ☆18Jun 3, 2017Updated 8 years ago
- 本项目是一个基于 LangGraph和大语言模型(LLM)实现的 Agentic RAG (检索增强生成)系统。它融合了动态查询分析和自我纠错机制,能够根据用户问题的复杂度智能地选择最优的策略(直接回答、向量库检索或网络搜索),并对生成的答案进行相关性评估,从而实现更高质量…☆54Oct 21, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆110Jan 23, 2022Updated 4 years ago
- The codes for the work "AV-casNet: Fully Automatic Arteriole-Venule Segmentation and Differentiation in OCT Angiography"☆15Oct 27, 2022Updated 3 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆38Dec 27, 2022Updated 3 years ago
- Enhance vessel structures in 3D images using Hessian/Frangi/eigenvalue filter through the ITK library☆19Jul 25, 2021Updated 4 years ago
- ☆15Feb 11, 2021Updated 5 years ago
- 6-DoF wheeled biped robot☆18Jan 19, 2022Updated 4 years ago
- ☆25Mar 7, 2025Updated last year
- ☆17Jul 11, 2020Updated 5 years ago
- cmdr cxx version, a C++17/20 header-only command-line parser with hierarchical config data manager here☆18Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- World Models with A3C on Carracing-v0 in gym☆32Mar 29, 2020Updated 6 years ago
- Agentic AI ex-US equity evaluator (LangGraph+Gemini)☆61Updated this week
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- Established a UNet model to deal with image denoising problem☆21Jun 7, 2021Updated 4 years ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- A curated list of awesome AI developments for ophthalmology☆20Jun 22, 2021Updated 4 years ago
- 拓片图像去噪,以UNet为基本框架,编码器基于VGG16☆22Apr 9, 2020Updated 6 years ago
- ☆14Apr 17, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ROS Package for running LQR controller in a simulated mobile robot. Capstone project for Udacity C++ Nanodegree☆14Apr 4, 2023Updated 3 years ago
- Mod Source for BOTATO, brotato auto-battler mod☆21Aug 25, 2025Updated 7 months ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- PyTorch Implementation of Hamilton-Jacobi DQN☆16May 12, 2021Updated 4 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines☆10Apr 3, 2020Updated 6 years ago
- 2D Iterative Learning Control with Deep Reinforcement Learning Compensation for the Non-repetitive Batch Processes☆11Mar 4, 2025Updated last year
- 动手学深度学习图像配准(DLIR)☆25Oct 18, 2022Updated 3 years ago
- Reference based Image Super-Resolution via Variational AutoEncoder☆29Jul 26, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Redesigning the Pix2Pix model for small datasets with fewer parameters and different PatchGAN architecture☆23Oct 18, 2025Updated 6 months ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ☆19Apr 10, 2026Updated last week
- 基于UNet、PatchGan网络的地震叠加数据去噪方法-tensorflow、Pytorch实现☆21Jan 20, 2022Updated 4 years ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆14Feb 3, 2023Updated 3 years ago
- A personal project where I publish my research paper notes on a weekly basis.☆13Jul 28, 2021Updated 4 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year