A simple example of randomized ensembled double q learning
☆19Sep 3, 2021Updated 4 years ago
Alternatives and similar repositories for REDQ_simple_example
Users that are interested in REDQ_simple_example are comparing it to the libraries listed below
Sorting:
- ☆24Mar 26, 2023Updated 2 years ago
- Implementation of 6 DQN extension methods using Pytorch. (RAINBOW)☆16Dec 7, 2020Updated 5 years ago
- implementation of distributed reinforcement learning with distributed tensorflow☆57Jun 5, 2021Updated 4 years ago
- State Lattice based A star path planner☆10May 27, 2016Updated 9 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- Deep Multi-Agent Reinforcement Learning with StarCraft 2☆10Sep 27, 2020Updated 5 years ago
- Code for paper Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training☆22Apr 18, 2023Updated 2 years ago
- ☆11Sep 1, 2017Updated 8 years ago
- kaggle - RSNA STR Pulmonary Embolism Detection☆10Nov 22, 2020Updated 5 years ago
- A mini racetrack world for developing and testing robots with AWS RoboMaker and Gazebo simulations.☆15Sep 8, 2020Updated 5 years ago
- ☆11Aug 17, 2025Updated 7 months ago
- ☆20Dec 8, 2022Updated 3 years ago
- Hugging Face tutorials☆15Jun 3, 2021Updated 4 years ago
- presentations☆44Dec 8, 2018Updated 7 years ago
- Reinforcement Leanring for Tetris☆19Oct 24, 2016Updated 9 years ago
- Simple Tensorflow implementation of "SDIT: Scalable and Diverse Cross-domain Image Translation" (ACM-MM 2019)☆16Oct 14, 2019Updated 6 years ago
- code for CoRL 2020 paper "Contrastive Variational Model-Based Reinforcement Learning for Complex Observations"☆24Dec 29, 2021Updated 4 years ago
- Introduction to Deep Reinforcement Learning☆88Nov 24, 2025Updated 3 months ago
- A Python reference implementation of rigid body dynamics algorithms☆18Jul 17, 2023Updated 2 years ago
- [Assignments] CS231N: Convolutional Neural Networks for Visual Recognition (2016 & 2017)☆47Feb 12, 2024Updated 2 years ago
- 3rd place solution for ALASKA2 Image Steganalysis on Kaggle☆12Mar 3, 2021Updated 5 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- ☆40Jul 29, 2019Updated 6 years ago
- ☆15Nov 27, 2020Updated 5 years ago
- ☆39Jan 8, 2020Updated 6 years ago
- Today I Learned☆21Jan 1, 2025Updated last year
- Implementations of autoencoders (VAE, AAE, and others)☆11Oct 1, 2018Updated 7 years ago
- Tic Tac Toe with Alpha Zero method - My first work☆18Aug 23, 2018Updated 7 years ago
- Visual Verb Sense Disambiguation☆13Apr 26, 2019Updated 6 years ago
- [AAAI 2026] TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution☆14Aug 1, 2025Updated 7 months ago
- Escaping from Collapsing Modes in a Constrained Space (ECCV 2018)☆13Sep 1, 2018Updated 7 years ago
- Amazon EC2 Deployment: Complete CI/CD Pipeline using GitHub Actions and AWS CodeDeploy☆25Jan 29, 2024Updated 2 years ago
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆93Jan 4, 2024Updated 2 years ago
- A repo for mpc learner.☆20May 30, 2023Updated 2 years ago
- Official implementation for GraphDE: A Generative Framework for Debiased Learning and Out-of-Distribution Detection on Graphs (NeurIPS 20…☆20Oct 14, 2022Updated 3 years ago
- ☆33May 13, 2021Updated 4 years ago
- [ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optim…☆10Dec 19, 2023Updated 2 years ago
- ☆13Jul 15, 2024Updated last year