woutervanheeswijk / cliff_walking_publicLinks
Cliff walking reinforcement learning example, with a variety of RL algorithms
☆13Updated last year
Alternatives and similar repositories for cliff_walking_public
Users that are interested in cliff_walking_public are comparing it to the libraries listed below
Sorting:
- A2Perf is a benchmark for evaluating agents on sequential decision problems that are relevant to the real world. This repository contains…☆10Updated 10 months ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated 2 years ago
- Hyperparameter tuning via uncertainty modeling☆47Updated last year
- ☆16Updated last year
- Understanding RL vision Distill article☆23Updated 2 years ago
- ☆20Updated last year
- 3rd party dependencies for DALI project☆10Updated last week
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated last year
- Produce intelligence by means of natural selection without objective/reward optimization☆14Updated 3 years ago
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆33Updated 4 years ago
- A portal for direct sale between farmers and consumers.☆9Updated 4 years ago
- Accelerated Stochastic Power Iteration with Momentum☆9Updated 7 years ago
- Color detection beginner data science project☆13Updated 4 years ago
- The tool to read/get/extract and write/change/modify BIOS/UEFI settings from Linux terminal.☆6Updated 2 years ago
- A simple OpenAI Gym environment for Neural Architecture Search (NAS)☆30Updated 5 years ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆41Updated 3 years ago
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- ⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems☆19Updated 2 years ago
- Interactive scalable auditing of model biases and vulnerabilities with interpretable mitigation☆25Updated 3 years ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆39Updated this week
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆46Updated 2 years ago
- Core Utilities for NVIDIA Merlin☆19Updated last year
- Renee: End-to-end training of extreme classification models☆23Updated last year
- Documentation for dynamic machine learning systems.☆29Updated 10 months ago
- Implementation of transformers based architecture in PyTorch.☆54Updated 4 years ago
- 11-785 Introduction to Deep Learning (IDeeL) website with logistics and select course materials☆53Updated last week
- A Python tool to solve logic games with AI, Deep Learning and Computer Vision☆17Updated 4 years ago
- ☆20Updated 2 weeks ago
- Analysing ML conference data and plotting interesting statistics.☆11Updated 2 years ago
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆16Updated last year