Code associated to papers on superposition (in ML interpretability)
☆38Sep 13, 2022Updated 3 years ago
Alternatives and similar repositories for superposition
Users that are interested in superposition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Config files for my GitHub profile.☆25Mar 23, 2025Updated last year
- ☆28May 4, 2023Updated 2 years ago
- Understanding how features learned by neural networks evolve throughout training☆41Oct 24, 2024Updated last year
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- ☆43Nov 16, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- BFloat16 Fused Adam Operator for PyTorch☆17Nov 16, 2024Updated last year
- ☆17Feb 14, 2024Updated 2 years ago
- ☆12Apr 26, 2024Updated last year
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆30Feb 25, 2025Updated last year
- Variational Autoencoder with non-euclidean (hyperbolic) latent space☆12Nov 25, 2022Updated 3 years ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Jul 17, 2024Updated last year
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- Layerwise Batch Entropy Regularization☆24Aug 3, 2022Updated 3 years ago
- ☆17Jul 9, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆22Mar 23, 2022Updated 4 years ago
- Implementation of Bitune: Bidirectional Instruction-Tuning☆27Jun 19, 2025Updated 9 months ago
- [NeurIPS 2024] Official Implementation of "SDformer: Similarity-driven Discrete Transformer For Time Series Generation"☆13May 23, 2025Updated 10 months ago
- ☆12Sep 26, 2019Updated 6 years ago
- ☆26Updated this week
- ☆16Dec 30, 2024Updated last year
- Fast, High-Fidelity LLM Decoding with Regex Constraints☆21Jul 26, 2024Updated last year
- Syntax-aware Word Mover’s Distance for Sentence Similarity Modeling☆20Nov 6, 2023Updated 2 years ago
- Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024☆18Mar 25, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A searchable (vector + FTS) index of every issue of the Whole Earth Catalog☆31Mar 20, 2026Updated last week
- ☆40Mar 17, 2026Updated last week
- ☆13Apr 9, 2022Updated 3 years ago
- ☆15Jun 18, 2025Updated 9 months ago
- ☆18Mar 13, 2026Updated 2 weeks ago
- The most parameter efficient machine learning models on a few popular benchmarks☆42May 15, 2022Updated 3 years ago
- ☆11Nov 27, 2019Updated 6 years ago
- gRelay is an open source project written in Go that provides the circuit break pattern with a relay idea behind.☆31Sep 1, 2022Updated 3 years ago
- ☆25Jan 1, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of Categorical Flow Maps on text.☆49Feb 16, 2026Updated last month
- Implementation of the algorithm detailed in paper "Evolutionary design of molecules based on deep learning and a genetic algorithm"☆24Dec 15, 2023Updated 2 years ago
- ☆16Jan 3, 2023Updated 3 years ago
- ☆14Feb 2, 2025Updated last year
- Learning world model learning from scratch☆50Feb 5, 2026Updated last month
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Jul 27, 2025Updated 8 months ago
- ☆14Mar 2, 2023Updated 3 years ago