Open-source release accompanying Gao et al. 2025
☆513Dec 11, 2025Updated 4 months ago
Alternatives and similar repositories for circuit_sparsity
Users that are interested in circuit_sparsity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated 2 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆17Nov 21, 2025Updated 4 months ago
- Code for ICLR 2023 Harnessing Out-Of-Distribution Examples via Augmenting Content and Style☆13Jul 3, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆254Feb 27, 2026Updated last month
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 7 months ago
- ☆17Jul 9, 2025Updated 9 months ago
- ☆33Jan 6, 2025Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆64Oct 27, 2024Updated last year
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆72Mar 26, 2026Updated 2 weeks ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆658Mar 21, 2026Updated 3 weeks ago
- ☆13Oct 5, 2025Updated 6 months ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Mar 14, 2022Updated 4 years ago
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated last year
- Training Sparse Autoencoders on Language Models☆1,312Mar 19, 2026Updated 3 weeks ago
- ☆83Feb 25, 2025Updated last year
- TopoLM: brain-like spatio-functional organization in a topographic language model☆29May 23, 2025Updated 10 months ago
- An agent for CUDA compute-communication kernel co-design☆34Mar 24, 2026Updated 2 weeks ago
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- ☆26Jan 14, 2025Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆97Oct 23, 2025Updated 5 months ago
- Open source interpretability artefacts for R1.☆172Apr 21, 2025Updated 11 months ago
- ☆159Dec 30, 2025Updated 3 months ago
- Official implementation of Categorical Flow Maps on text.☆51Feb 16, 2026Updated last month
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- ☆13Aug 7, 2021Updated 4 years ago
- 3D Gif maker☆18Feb 18, 2024Updated 2 years ago
- ☆28Nov 28, 2024Updated last year
- ☆19Dec 4, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Scaling Long-Horizon LLM Agent via Context-Folding☆143Jan 26, 2026Updated 2 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆132Dec 3, 2024Updated last year
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Pure MLX implementations of UMAP, t-SNE, PaCMAP, TriMap, DREAMS, CNE, MMAE, and NNDescent for Apple Silicon. Metal GPU for computation an…☆80Mar 20, 2026Updated 3 weeks ago
- ☆26Dec 8, 2025Updated 4 months ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Jan 1, 2026Updated 3 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆169Jan 16, 2025Updated last year