CalculatedContent / setol_paperLinks
SETOL: SemiEmpirical Theory of (Deep) Learning
☆28Updated 3 months ago
Alternatives and similar repositories for setol_paper
Users that are interested in setol_paper are comparing it to the libraries listed below
Sorting:
- Attribution-based Parameter Decomposition☆31Updated 4 months ago
 - ☆142Updated last month
 - Open source interpretability artefacts for R1.☆163Updated 6 months ago
 - ☆66Updated 7 months ago
 - Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆27Updated 3 months ago
 - ☆230Updated this week
 - Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 6 months ago
 - The simplest, fastest repository for training/finetuning medium-sized GPTs.☆170Updated 4 months ago
 - ☆55Updated 11 months ago
 - Unified access to Large Language Model modules using NNsight☆52Updated last week
 - Mechanistic Interpretability Visualizations using React☆297Updated 10 months ago
 - 🧱 Modula software package☆299Updated 2 months ago
 - ☆14Updated last year
 - nanoGPT-like codebase for LLM training☆110Updated this week
 - ☆81Updated 8 months ago
 - Deep Learning, an Energy Approach☆218Updated 4 months ago
 - code for training & evaluating Contextual Document Embedding models☆199Updated 5 months ago
 - Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆91Updated 2 months ago
 - Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆104Updated 9 months ago
 - Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated last year
 - PyTorch library for Active Fine-Tuning☆93Updated last month
 - A toolkit for describing model features and intervening on those features to steer behavior.☆211Updated 11 months ago
 - Training-Ready RL Environments + Evals☆158Updated this week
 - $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last month
 - Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆563Updated last year
 - Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆224Updated 10 months ago
 - ☆355Updated 2 months ago
 - Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆281Updated this week
 - ☆47Updated last month
 - Mediterranean Machine Learning school 2025 tutorials☆39Updated last month