OpenAI Gym 课程练习笔记
☆15Apr 16, 2024Updated 2 years ago
Alternatives and similar repositories for gym-course-exercises
Users that are interested in gym-course-exercises are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The goal of this project is to develop a program for planetary soft landings using lossless convexification of non convex control bounds.☆12Mar 25, 2022Updated 4 years ago
- Privacy-preserving Voice Analysis via Disentangled Representations☆12Aug 30, 2021Updated 4 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Deep Reinforcement Learning for Keras.☆12May 23, 2022Updated 4 years ago
- Using reinforcement learning to minimize fuel consuption when landing a rover on Mars☆13Mar 21, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Proximity Operations: Planetary Landing☆11Dec 11, 2025Updated 6 months ago
- We introduce a way to extend sparse dictionary learning to deep architectures.☆17Jan 13, 2022Updated 4 years ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆20Jan 11, 2023Updated 3 years ago
- A clean Pytorch implementation of DDPG on continuous action space.☆31Jun 8, 2024Updated 2 years ago
- iCub simulator in Python☆11Mar 5, 2026Updated 3 months ago
- Training a vision-based agent with the Deep Q Learning Network (DQN) in Atari's Breakout environment, implementation in Tensorflow.☆18Dec 12, 2018Updated 7 years ago
- Methods for using OpenFace in R☆11Feb 26, 2024Updated 2 years ago
- Large Language Models Powered Context-aware Motion Prediction☆15Jan 12, 2026Updated 4 months ago
- Implementation of WGAN-QC☆16Nov 25, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A modified CARLA client which sends out CAN messages on vcan0 and future plans for a lane detection algorithm and other ideas!☆10Oct 1, 2020Updated 5 years ago
- Implementation of "Visual number sense in untrained deep neural networks" (Kim et al., Science Advances, 2021)☆11Oct 22, 2020Updated 5 years ago
- Estimating driver safety on Pointer's dataset☆11Jun 27, 2018Updated 7 years ago
- ☆14Nov 2, 2022Updated 3 years ago
- ☆27Aug 16, 2023Updated 2 years ago
- decision-making processes of human drivers☆14Mar 28, 2024Updated 2 years ago
- Multi agent PPO implementation in Pytorch for Unity ML Agents environments.☆29Jul 25, 2024Updated last year
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆28Sep 13, 2023Updated 2 years ago
- Collection of multi-robot SLAM algorithms for ROS☆16Sep 28, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Clustering Through Decision Tree Construction☆33Apr 14, 2019Updated 7 years ago
- studyforrest.org: Phase2 data (movie, eyetracking, retmapping, visual localizers) [BIDS]☆11Apr 13, 2023Updated 3 years ago
- [NeurIPS 2020 Spotlight] State-adversarial PPO for robust deep reinforcement learning☆32Nov 18, 2021Updated 4 years ago
- Object recognition with NAO using a deep learning model☆17Sep 16, 2021Updated 4 years ago
- We use reachability to ensure the safety of a decision agent acting on a dynamic system in real-time. We compute the Forward Reachable Se…☆34Jun 19, 2021Updated 4 years ago
- SpinVision is a computer vision project that analyzes the motion of a cricket ball in video footage. It detects the ball's trajectory, pr…☆14Jun 12, 2025Updated 11 months ago
- Open Access NAO (OAN). A ROS2 framework for HRI studies with NAO v6☆19Apr 29, 2024Updated 2 years ago
- Robust Reinforcement Learning Suite☆37Dec 24, 2024Updated last year
- ☆37Dec 8, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- stata command for pair wise correlation matrix☆22Jan 26, 2025Updated last year
- [TVCG 2024] ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions☆23Feb 28, 2025Updated last year
- A ros package for visualizing a robot face with different facial expressions☆15Sep 5, 2019Updated 6 years ago
- Code for running the transformers in the ICML 2021 paper "Thinking Like Transformers"☆18Jun 28, 2021Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆42Aug 27, 2022Updated 3 years ago
- Repositories made from data collected in my research in the Master in Software Engineering.☆19Jan 23, 2019Updated 7 years ago
- ☆10Sep 13, 2025Updated 8 months ago