Stochastic Parameter Decomposition
☆66Feb 27, 2026Updated this week
Alternatives and similar repositories for spd
Users that are interested in spd are comparing it to the libraries listed below
Sorting:
- Code repo for the model organisms and convergent directions of EM papers.☆53Sep 22, 2025Updated 5 months ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 8 months ago
- ☆48May 27, 2025Updated 9 months ago
- ☆12Jul 17, 2022Updated 3 years ago
- Tools for studying developmental interpretability in neural networks.☆127Dec 30, 2025Updated 2 months ago
- ☆18Feb 4, 2025Updated last year
- This project demonstrates function-calling with Python and Ollama, utilizing the Africa's Talking API to send airtime and messages to pho…☆18Feb 21, 2026Updated last week
- A Rust library and CLI for computing optimal and heuristic tree decompositions☆12Feb 19, 2025Updated last year
- ☆250Feb 22, 2024Updated 2 years ago
- ☆74Feb 18, 2026Updated 2 weeks ago
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆158Updated this week
- Open source interpretability artefacts for R1.☆172Apr 21, 2025Updated 10 months ago
- ☆12Feb 26, 2026Updated last week
- ☆14Jan 23, 2026Updated last month
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Library on Arduino to add over the air (OTA) Update Capabilities to bw16/rtl8720DN☆11Aug 6, 2024Updated last year
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- Pytorch implementation of HCNAF: Hyper-Conditioned Neural Autoregressive Flow (CVPR 2020)☆15Jun 14, 2020Updated 5 years ago
- BachDuet enables a human performer to improvise a duet counterpoint with a computer agent in real time.☆14Aug 8, 2022Updated 3 years ago
- ☆10Jun 3, 2019Updated 6 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- ☆10Nov 18, 2024Updated last year
- A scalable anonymous blocklisting scheme☆12Oct 6, 2023Updated 2 years ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago
- ☆15Aug 19, 2024Updated last year
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- ☆13Sep 2, 2023Updated 2 years ago
- All-in-one tool to generate, and correctly drizzle, HST, JWST, and Roman PSFs.☆13Updated this week
- An implementation of macroexpand-time conditionalization.☆13Nov 20, 2023Updated 2 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- interactively identify related Authors on arxiv☆14Sep 22, 2023Updated 2 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- SED fitting comparison on CANDELS observations☆15Dec 4, 2022Updated 3 years ago
- Tiny AI model embedded in NES ROMs to generate character names in-game.☆29Sep 28, 2025Updated 5 months ago
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- Python version of MothNet: Computational model of the moth olfactory network 🐛☆12Oct 17, 2023Updated 2 years ago
- Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generati…☆12Aug 26, 2025Updated 6 months ago
- A varitation graph tool☆10Dec 23, 2019Updated 6 years ago