Pytorch implementation of large network design in continous control RL.
☆19Jan 5, 2022Updated 4 years ago
Alternatives and similar repositories for Deeper_Larger_Actor-Critic_RL
Users that are interested in Deeper_Larger_Actor-Critic_RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight deep RL Libraray for continuous control.☆15Mar 18, 2022Updated 4 years ago
- Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.☆33Jan 23, 2021Updated 5 years ago
- ☆10Mar 22, 2021Updated 5 years ago
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆28Nov 5, 2020Updated 5 years ago
- reinforcement learning, navigation, unitree, velodyne, slam, collision avoidance☆29Jul 8, 2025Updated 8 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆22Mar 11, 2022Updated 4 years ago
- In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and thei…☆12Nov 18, 2020Updated 5 years ago
- ☆11Mar 18, 2021Updated 5 years ago
- ☆10Oct 31, 2021Updated 4 years ago
- Actor Prioritized Experience Replay☆18Nov 20, 2023Updated 2 years ago
- ☆22Nov 9, 2025Updated 4 months ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CUSF Landing Prediction Software☆14Jan 26, 2023Updated 3 years ago
- soft q learning and soft actor critic☆16Dec 23, 2018Updated 7 years ago
- 1D3V Particle in Cell Code in JAX☆22Mar 23, 2026Updated last week
- ☆17Sep 15, 2017Updated 8 years ago
- Collision-detection and collision-avoidance navigation demonstration using a feedforward neural network.☆13Nov 4, 2018Updated 7 years ago
- Code and data for article "Learning nonlocal constitutive models with neural networks", CMAME☆12Sep 7, 2021Updated 4 years ago
- Awesome Long-CoT Data☆19Mar 26, 2025Updated last year
- Tentabot: Navigation Framework for Mobile Robots by Evaluating Motion Primitives (Tentacles)☆60Aug 26, 2025Updated 7 months ago
- Blog about ML and CFD☆10Mar 16, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Aug 4, 2023Updated 2 years ago
- ☆14Nov 23, 2022Updated 3 years ago
- The code used, and a docker image to run it, of the paper `Exploiting locality and physical invariants to design effective Deep Reinforce…☆13Dec 10, 2019Updated 6 years ago
- Autonomous visual navigation using the depth images☆11Aug 15, 2019Updated 6 years ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆36Jul 16, 2025Updated 8 months ago
- Panorama stitching of images or real-time video streams☆10Aug 12, 2020Updated 5 years ago
- Implementation of Proximal Policy Optimization using Transformer☆12Jul 4, 2023Updated 2 years ago
- ☆19Feb 18, 2022Updated 4 years ago
- A framework for implementing path tracking algorithms at ROS and Pyhton. Including implementations of three methods: Pure Pursuit, MPC, a…☆12Jan 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- BILIBILI.☆15Jan 6, 2019Updated 7 years ago
- ☆13May 10, 2021Updated 4 years ago
- This is a DRL platform built with Gazebo for the purpose of robot navigation☆20Jul 14, 2018Updated 7 years ago
- Sample-efficient learning-based dynamic environment navigation with transferring experience from optimization-based planner☆17May 31, 2025Updated 9 months ago
- ☆16Jul 11, 2023Updated 2 years ago
- Code space for L4DC paper "State-wise Safe Reinforcement Learning With Pixel Observations"☆11Apr 5, 2024Updated last year