Open-source release accompanying Gao et al. 2025
☆510Dec 11, 2025Updated 3 months ago
Alternatives and similar repositories for circuit_sparsity
Users that are interested in circuit_sparsity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated last month
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 4 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Code for ICLR 2023 Harnessing Out-Of-Distribution Examples via Augmenting Content and Style☆13Jul 3, 2023Updated 2 years ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆251Feb 27, 2026Updated 3 weeks ago
- [MM 2025] Towards Modality Generalization: A Benchmark and Prospective Analysis☆28May 22, 2025Updated 10 months ago
- ☆17Jul 9, 2025Updated 8 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- [Technical Report] A Comprehensive Evaluation of Nano Banana Pro on 14 Low-Level Vision Tasks and 40 Datasets☆71Dec 24, 2025Updated 2 months ago
- ☆33Jan 6, 2025Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆63Oct 27, 2024Updated last year
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆645Updated this week
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆72Jan 13, 2026Updated 2 months ago
- ☆18Dec 7, 2024Updated last year
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- ☆13Oct 5, 2025Updated 5 months ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Mar 14, 2022Updated 4 years ago
- An agent for CUDA compute-communication kernel co-design☆32Mar 11, 2026Updated last week
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated 11 months ago
- Training Sparse Autoencoders on Language Models☆1,272Updated this week
- ☆83Feb 25, 2025Updated last year
- Long-range camera-conditioned scene generation from one single image.☆107Dec 23, 2025Updated 3 months ago
- TopoLM: brain-like spatio-functional organization in a topographic language model☆29May 23, 2025Updated 10 months ago
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆97Oct 23, 2025Updated 5 months ago
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- ☆26Jan 14, 2025Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- ☆157Dec 30, 2025Updated 2 months ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆131Jan 26, 2026Updated last month
- A library for efficient patching and automatic circuit discovery.☆93Dec 31, 2025Updated 2 months ago
- ☆28Nov 28, 2024Updated last year
- ☆19Dec 4, 2025Updated 3 months ago
- Simple migration engine for Peewee☆19Updated this week
- Objective Develop a web-based chatbot application where users upload resumes (PDF or image). The system should: 1. Extract resume content…☆21Sep 30, 2025Updated 5 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆133Dec 3, 2024Updated last year
- A training framework for large-scale language models based on Megatron-Core, the COOM Training Framework is designed to efficiently handl…☆25Nov 14, 2025Updated 4 months ago
- Learning Universal Predictors☆82Aug 1, 2024Updated last year