woutervanheeswijk / cliff_walking_publicLinks
Cliff walking reinforcement learning example, with a variety of RL algorithms
☆14Updated last year
Alternatives and similar repositories for cliff_walking_public
Users that are interested in cliff_walking_public are comparing it to the libraries listed below
Sorting:
- A2Perf is a benchmark for evaluating agents on sequential decision problems that are relevant to the real world. This repository contains…☆10Updated last year
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆43Updated 3 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated 2 years ago
- A template gymnasium environment for users to build upon☆21Updated last year
- Hyperparameter tuning via uncertainty modeling☆48Updated last year
- A Probabilistic Programming Language in 70 lines of Python. Code for the blog post https://mrandri19.github.io/2022/01/12/a-PPL-in-70-lin…☆19Updated 3 years ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated 2 years ago
- ☆17Updated 5 years ago
- Causal Analysis of Agent Behavior for AI Safety☆19Updated 2 years ago
- A Gentle Principled Introduction to Deep Reinforcement Learning☆19Updated 7 months ago
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆48Updated 3 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆17Updated last year
- ML/DL Math and Method notes☆64Updated last year
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆34Updated 4 years ago
- ☆19Updated 4 years ago
- Code from Machine Learning competitions on Kaggle☆11Updated 4 years ago
- Core Utilities for NVIDIA Merlin☆19Updated last year
- An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting☆31Updated last year
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- Color detection beginner data science project☆13Updated 4 years ago
- ☆11Updated 5 years ago
- 3rd party dependencies for DALI project☆10Updated last week
- A python package for running directed acyclic graphs of asynchronous I/O operations☆17Updated 4 years ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆32Updated 11 months ago
- Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.☆37Updated 8 months ago
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆35Updated last year
- 🚀 Deep Learning GPU Selector☆22Updated 4 months ago
- A simple OpenAI Gym environment for Neural Architecture Search (NAS)☆30Updated 5 years ago
- Smart reproducible analytical pipeline inspection☆19Updated 3 weeks ago