CalculatedContent / setol_paperLinks
SETOL: SemiEmpirical Theory of (Deep) Learning
☆28Updated 5 months ago
Alternatives and similar repositories for setol_paper
Users that are interested in setol_paper are comparing it to the libraries listed below
Sorting:
- ☆150Updated 4 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆572Updated last year
- ☆236Updated last month
- 🧱 Modula software package☆322Updated 4 months ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆105Updated last year
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆29Updated last month
- Deep Learning, an Energy Approach☆229Updated 7 months ago
- ☆69Updated 9 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆181Updated 6 months ago
- nanoGPT-like codebase for LLM training☆113Updated 2 months ago
- 🪄 Interpreto is an interpretability toolbox for LLMs☆95Updated 2 weeks ago
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- code for training & evaluating Contextual Document Embedding models☆201Updated 7 months ago
- ☆131Updated last year
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆305Updated 3 weeks ago
- Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggrega…☆679Updated last year
- Attribution-based Parameter Decomposition☆33Updated 6 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 8 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆234Updated 5 months ago
- Lightweight and educational reimplementation of TabPFN https://arxiv.org/pdf/2511.03634☆61Updated last month
- ☆83Updated 10 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆236Updated last year
- ☆45Updated 7 months ago
- 🧠 Starter templates for doing interpretability research☆76Updated 2 years ago
- A Python toolbox for conformal prediction research on deep learning models, using PyTorch.☆439Updated 2 months ago
- Mechanistic Interpretability Visualizations using React☆306Updated last year
- ☆380Updated 4 months ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆49Updated this week
- An extension of the nanoGPT repository for training small MOE models.☆224Updated 10 months ago
- ☆58Updated last year