woutervanheeswijk / cliff_walking_publicLinks
Cliff walking reinforcement learning example, with a variety of RL algorithms
☆13Updated last year
Alternatives and similar repositories for cliff_walking_public
Users that are interested in cliff_walking_public are comparing it to the libraries listed below
Sorting:
- ☆17Updated 3 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Updated 5 years ago
- ☆17Updated last week
- A template gymnasium environment for users to build upon☆18Updated 7 months ago
- ☆19Updated 7 months ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆24Updated 5 months ago
- Kalman Optimization for Value Approximation☆11Updated 5 years ago
- Multi-agent active perception with prediction rewards☆12Updated 4 years ago
- High-performance tokenized language data-loader for Python C++ extension☆13Updated 10 months ago
- Evolutionary Algorithms implementations, for various (discrete & continuous) optimization problems, including for autonomous agent contro…☆13Updated 8 months ago
- Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray☆27Updated 4 months ago
- Personal solutions to the Triton Puzzles☆18Updated 10 months ago
- Interactive scalable auditing of model biases and vulnerabilities with interpretable mitigation☆23Updated 3 years ago
- Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"☆19Updated this week
- A question bank for interview questions for data related roles☆10Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Profile repository of Pietro Monticone.☆11Updated last week
- High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm☆28Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- PyTorch implementation of algorithms in https://arxiv.org/abs/2207.09238☆14Updated 2 years ago
- Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model☆14Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- ☆16Updated last year
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- Accelerated Stochastic Power Iteration with Momentum☆9Updated 7 years ago
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Updated 4 years ago
- Building on the MLFlow toolset this project aims to extend the functionality for MLFlow, increase the automation and therefore reduce the…☆14Updated 2 years ago
- A tool for the creation and visualization of citation networks which combines citation data obtained from parsing the paper's PDF files a…☆14Updated 2 years ago
- Code repo for paper: ICML 2020 paper Natural lottery ticket winner: RL for ordinary neural circuits☆13Updated 5 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year