fostiropoulos / ablator
Model Ablation Tool-Kit for Deep Learning Model
☆35Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for ablator
- PyTorch code corresponding to my blog series on adversarial examples and (confidence-calibrated) adversarial training.☆67Updated last year
- Course to learn the basics of self supervised learning☆15Updated 2 months ago
- Extensive acceptance rates and information of main AI conferences☆144Updated 3 months ago
- Code for Contrastive Preference Learning (CPL)☆153Updated 8 months ago
- ☆127Updated this week
- ☆73Updated 4 months ago
- Example of how to use Weights & Biases on Slurm☆109Updated 2 years ago
- ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).☆174Updated this week
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆332Updated 4 months ago
- Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (M…☆15Updated 3 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆89Updated this week
- Gradient Boosting Reinforcement Learning (GBRL)☆87Updated this week
- This repository collects all relevant resources about interpretability in LLMs☆282Updated last week
- ☆200Updated 9 months ago
- Efficient baselines for autocurricula in JAX.☆173Updated 2 months ago
- NEVIS'22: Benchmarking the next generation of never-ending learners☆98Updated last year
- about me☆21Updated 2 weeks ago
- A lightweight research framework☆21Updated 8 months ago
- Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)☆146Updated 2 years ago
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆137Updated last month
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆126Updated 2 months ago
- Must-read Papers on Large Language Model (LLM) Planning.☆365Updated 4 months ago
- A bibliography and survey of the papers surrounding o1☆643Updated this week
- Cost aware hyperparameter tuning algorithm☆119Updated 4 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆213Updated 3 weeks ago
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning☆84Updated this week
- ☆85Updated this week
- LLM finetuning in resource-constrained environments.☆41Updated 4 months ago
- git extension for {collaborative, communal, continual} model development☆205Updated 5 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago