Model-Based RL Demo for Pendulum-v0
☆13Jun 16, 2020Updated 6 years ago
Alternatives and similar repositories for PendulumDemo
Users that are interested in PendulumDemo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 3 years ago
- ☆14Jan 16, 2025Updated last year
- Fully differentiable RL environments, written in Ivy.☆66Aug 28, 2023Updated 2 years ago
- In this project I developed LSTM models for uni-variate , multivariate , multi-step time series forecasting.☆11Feb 27, 2020Updated 6 years ago
- Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach☆15May 10, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Apr 5, 2024Updated 2 years ago
- Quantum computing bootcamp with Qiskit☆13Jul 6, 2023Updated 2 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 3 years ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated last year
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 7 years ago
- Apache Airflow CI pipeline☆19Jun 12, 2019Updated 7 years ago
- Building the Bi-LSTM & the CNN-GAN models to compose Classical Music in different eras☆12Aug 2, 2021Updated 4 years ago
- ☆18Aug 10, 2020Updated 5 years ago
- Codebase for a Marimba playing robot☆16Nov 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple, flexible, productive static site generator written entirely in Ink☆20Jul 19, 2021Updated 4 years ago
- Thesis: Application of Reinforcement Learning for the Control of Nonlinear Dynamical Systems☆18Apr 16, 2020Updated 6 years ago
- DINO-based perceptual losses and FDD feature extraction☆33Jan 7, 2026Updated 5 months ago
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- Reinforcement Learning Methods with PyTorch☆38Jan 16, 2020Updated 6 years ago
- Scalable Computation of Hessian Diagonals☆14Jun 2, 2024Updated 2 years ago
- This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…☆14Dec 12, 2023Updated 2 years ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Model based RL for fault-rotor quadrotor☆18Jan 16, 2020Updated 6 years ago
- This Repository allows to super fast download historical ohlcv data from binance.☆12Nov 27, 2020Updated 5 years ago
- A Bitmex client☆14Jan 4, 2023Updated 3 years ago
- A Github Action that'll convert a PDF Writeup into images for your README☆13Jan 12, 2021Updated 5 years ago
- Spin up an EC2 instance in AWS well-endowed for deep learning using Terraform☆17Jul 9, 2020Updated 5 years ago
- Fourier Spatial-Temporal Network for Multivariate Time Series Forecasting☆11Jan 1, 2023Updated 3 years ago
- Building an Agent to Trade with Reinforcement Learning☆41Dec 29, 2025Updated 6 months ago
- A C++ pytorch implementation of MuZero☆40May 18, 2026Updated last month
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- LayZ is a simple cross-platform renderer engine written in C/C++.☆12Oct 13, 2020Updated 5 years ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 8 months ago
- MetaProbformer for Charging Load Probabilistic Forecasting of Electric Vehicle Charging Stations [T-ITS, 2023]☆30Apr 17, 2023Updated 3 years ago
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 3 years ago