woutervanheeswijk / cliff_walking_publicLinks
Cliff walking reinforcement learning example, with a variety of RL algorithms
☆13Updated last year
Alternatives and similar repositories for cliff_walking_public
Users that are interested in cliff_walking_public are comparing it to the libraries listed below
Sorting:
- A2Perf is a benchmark for evaluating agents on sequential decision problems that are relevant to the real world. This repository contains…☆10Updated last year
- A template gymnasium environment for users to build upon☆21Updated 11 months ago
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated 2 years ago
- Hyperparameter tuning via uncertainty modeling☆48Updated last year
- ☆10Updated last year
- Color detection beginner data science project☆13Updated 4 years ago
- Clean RL implementation using MLX☆33Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- ☆19Updated 2 months ago
- ☆26Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated last year
- CS412-Introduction-to-Data-Mining☆13Updated 9 years ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆50Updated last week
- A Gentle Principled Introduction to Deep Reinforcement Learning☆19Updated 6 months ago
- Understanding RL vision Distill article☆24Updated 2 years ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated last year
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆34Updated 4 years ago
- Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.☆37Updated 8 months ago
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆17Updated last year
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated 2 years ago
- ☆29Updated 2 years ago
- ☆15Updated 4 months ago
- ☆16Updated last year
- ☆29Updated last year
- Minimum Description Length probing for neural network representations☆20Updated 8 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆43Updated 3 years ago
- Exploration into the Firefly algorithm in Pytorch☆41Updated 7 months ago
- ☆22Updated 8 months ago