CalculatedContent / setol_paperLinks
SETOL: SemiEmpirical Theory of (Deep) Learning
☆27Updated last month
Alternatives and similar repositories for setol_paper
Users that are interested in setol_paper are comparing it to the libraries listed below
Sorting:
- Deep Learning, an Energy Approach☆204Updated 2 months ago
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆82Updated 3 weeks ago
- Attribution-based Parameter Decomposition☆29Updated 2 months ago
- ☆141Updated 2 weeks ago
- nanoGPT-like codebase for LLM training☆103Updated 3 months ago
- ☆81Updated 6 months ago
- Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggrega…☆651Updated last year
- ☆228Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆156Updated 2 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 4 months ago
- ☆63Updated 5 months ago
- Open source interpretability artefacts for R1.☆157Updated 4 months ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆101Updated 7 months ago
- ☆119Updated 8 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆143Updated 3 months ago
- PyTorch library for Active Fine-Tuning☆90Updated last week
- An example starter repo for Python projects☆295Updated 2 months ago
- code for training & evaluating Contextual Document Embedding models☆197Updated 3 months ago
- ☆198Updated 5 months ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆198Updated 9 months ago
- ☆14Updated 10 months ago
- 🧱 Modula software package☆231Updated 2 weeks ago
- ⏰ AI conference deadline countdowns☆280Updated last week
- Steering vectors for transformer language models in Pytorch / Huggingface☆121Updated 6 months ago
- Efficient LLM inference on Slurm clusters using vLLM.☆77Updated this week
- A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)☆20Updated 3 months ago
- ☆53Updated 9 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆561Updated last year
- A Python toolbox for conformal prediction research on deep learning models, using PyTorch.☆419Updated this week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆214Updated 8 months ago