CalculatedContent / setol_paperLinks
SETOL: SemiEmpirical Theory of (Deep) Learning
☆28Updated 2 months ago
Alternatives and similar repositories for setol_paper
Users that are interested in setol_paper are comparing it to the libraries listed below
Sorting:
- ☆230Updated this week
- Attribution-based Parameter Decomposition☆31Updated 4 months ago
- 🧱 Modula software package☆282Updated last month
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆88Updated 2 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 5 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆268Updated this week
- Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggrega…☆650Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆164Updated 3 months ago
- nanoGPT-like codebase for LLM training☆108Updated 4 months ago
- Mechanistic Interpretability Visualizations using React☆291Updated 9 months ago
- Deep Learning, an Energy Approach☆215Updated 4 months ago
- Open source interpretability artefacts for R1.☆161Updated 5 months ago
- ☆348Updated last month
- ☆142Updated last month
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆27Updated 2 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆221Updated 9 months ago
- ☆81Updated 7 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆562Updated last year
- code for training & evaluating Contextual Document Embedding models☆197Updated 5 months ago
- ☆65Updated 6 months ago
- ☆124Updated 9 months ago
- Sparsify transformers with SAEs and transcoders☆631Updated this week
- ☆54Updated 10 months ago
- A Python toolbox for conformal prediction research on deep learning models, using PyTorch.☆426Updated this week
- ☆14Updated 11 months ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆103Updated 9 months ago
- ☆216Updated 10 months ago
- ☆150Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆98Updated 2 weeks ago