locuslab / aseLinks
Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamics.
☆11Updated 4 years ago
Alternatives and similar repositories for ase
Users that are interested in ase are comparing it to the libraries listed below
Sorting:
- Code for "Boosted Generative Models", AAAI 2018.☆20Updated 7 years ago
- codes for TokenManipulationGAN☆7Updated 5 years ago
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 3 years ago
- Variable-order CRFs with structure learning☆16Updated 10 months ago
- NeurIPS 2019 Paper Implementation☆12Updated 2 years ago
- A framework for implementing equivariant DL☆10Updated 4 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Updated 5 years ago
- Deep reinforcement learning for adaptation in evolutionary algorithms☆9Updated 5 years ago
- Implementation of the LOSSGRAD optimization algorithm☆15Updated 6 years ago
- Flexible Reinforcement Learning Framework with PyTorch☆22Updated 4 years ago
- ☆12Updated 3 years ago
- Variational Walkback, NIPS'17☆28Updated 7 years ago
- ☆12Updated 6 years ago
- ☆18Updated 3 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆14Updated 6 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago
- Investigate the speed of adaptation of structural causal models☆15Updated 4 years ago
- Official repository for the paper "Goal-Conditioned Generators of Deep Policies"☆11Updated last week
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Updated 8 years ago
- NER System Developed at CMU☆11Updated 7 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 3 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Updated 4 years ago
- ☆12Updated 4 years ago
- Code for "MIM: Mutual Information Machine" paper.☆16Updated 2 years ago
- Python package for graph statistics☆9Updated 4 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.☆8Updated 6 years ago
- ☆10Updated 2 years ago
- A study of the downstream instability of word embeddings☆12Updated 2 years ago
- This repository contains a Pytorch implementation of the article "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Network…☆9Updated 4 years ago