woutervanheeswijk / cliff_walking_publicLinks

Cliff walking reinforcement learning example, with a variety of RL algorithms

☆12

Alternatives and similar repositories for cliff_walking_public

Users that are interested in cliff_walking_public are comparing it to the libraries listed below

Sorting:

EliorBenYosef / evolutionary-algorithms
Evolutionary Algorithms implementations, for various (discrete & continuous) optimization problems, including for autonomous agent contro…
☆13Updated 9 months ago
Farama-Foundation / A2Perf
A2Perf is a benchmark for evaluating agents on sequential decision problems that are relevant to the real world. This repository contains…
☆10Updated 9 months ago
Farama-Foundation / gymnasium-env-template
A template gymnasium environment for users to build upon
☆20Updated 8 months ago
ttchengab / MnistGAN
☆16Updated 3 years ago
uvadlc / uvadlc_practicals_2021
Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition
☆10Updated 2 years ago
mrandri19 / smolppl
A Probabilistic Programming Language in 70 lines of Python. Code for the blog post https://mrandri19.github.io/2022/01/12/a-PPL-in-70-lin…
☆17Updated 3 years ago
LMU-Seminar-LLMs / AutoTestGen
Automatic Test Generator
☆12Updated 3 months ago
SqueezeAILab / open_source_projects
Open Source Projects from Pallas Lab
☆20Updated 3 years ago
laurimi / multiagent-prediction-reward
Multi-agent active perception with prediction rewards
☆11Updated 4 years ago
HazyResearch / Accelerated-PCA
Accelerated Stochastic Power Iteration with Momentum
☆9Updated 7 years ago
harish-kamath / rqae
Residual Quantization Autoencoder, used for interpreting LLMs
☆12Updated 5 months ago
RylanSchaeffer / Stanford-AI-Alignment-Double-Descent-Tutorial
Code for Arxiv Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle
☆27Updated last year
akash-agni / ReadThePaper
This repo will be an effort to learn and implement some of the milestone papers and models in Deep Learning based language models.
☆10Updated 2 years ago
microsoft / MAMBA
Imitation learning from multiple experts
☆12Updated 2 years ago
clinicalml / realhumaneval
☆21Updated 7 months ago
ducminhkhoi / autograd_pytorch
Building your own autograd mechanism based on PyTorch tensor only (not Variable, can be seen as numpy array)
☆21Updated last year
aitor-martinez-seras / SNN-Automotive-Object-Detection
Code of the paper "Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analys…
☆25Updated last year
kachayev / dataclasses-tensor
Easily serialize dataclasses to and from tensors (PyTorch, NumPy)
☆18Updated 4 years ago
ernoult / scalingDTP
"Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)
☆12Updated 2 years ago
dbpedia / ontology-time-machine
☆11Updated 5 months ago
GATECH-EIC / SuperTickets
[ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
☆20Updated 2 years ago
christopher-beckham / coms-are-energy-models
Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model
☆14Updated last year
tbroderick / ml_6036_2020_captions
Captions for the 6.036 Introduction to Machine Learning (Fall 2020) lecture videos
☆11Updated 4 years ago
DPBayes / jax-chacha-prng
A cryptographically-secure pseudo-random number generator for JAX based on the 20 round ChaCha cipher.
☆12Updated last year
google-deepmind / agent_debugger
Causal Analysis of Agent Behavior for AI Safety
☆18Updated 2 years ago
sdicastro / KOVA
Kalman Optimization for Value Approximation
☆11Updated 5 years ago
mohmdelsayed / HesScale
Scalable Computation of Hessian Diagonals
☆13Updated last year
nutansahoo / InterviewQs
A question bank for interview questions for data related roles
☆10Updated last year
etimush / ARC_NCA
Repo for solving arc problems with an Neural Cellular Automata
☆17Updated last month
FrancescoSaverioZuppichini / pytorch-2.0-benchmark
Benchmarking PyTorch 2.0 different models
☆21Updated 2 years ago