☆85Oct 11, 2022Updated 3 years ago
Alternatives and similar repositories for progress-measures-paper
Users that are interested in progress-measures-paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Omnigrok: Grokking Beyond Algorithmic Data☆64Feb 24, 2023Updated 3 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆585Aug 7, 2025Updated 8 months ago
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆19Nov 24, 2023Updated 2 years ago
- Deep Networks Grok All the Time and Here is Why☆38Apr 20, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆39Mar 25, 2026Updated last month
- ☆206Nov 17, 2024Updated last year
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 7 months ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆61Feb 7, 2025Updated last year
- ☆18Feb 28, 2025Updated last year
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 6 months ago
- ☆33Jul 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Sparse probing paper full code.☆67Dec 17, 2023Updated 2 years ago
- A library for mechanistic interpretability of GPT-style language models☆3,357Apr 24, 2026Updated last week
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Apr 12, 2023Updated 3 years ago
- Using sparse coding to find distributed representations used by neural networks.☆301Nov 10, 2023Updated 2 years ago
- ☆33Nov 11, 2024Updated last year
- AVPipe :-)☆12Jul 16, 2021Updated 4 years ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆58Oct 30, 2025Updated 6 months ago
- Code for paper "Prompt Engineering a Prompt Engineer" (https://arxiv.org/abs/2311.05661)☆10Aug 1, 2024Updated last year
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆54Jun 12, 2025Updated 10 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 7 months ago
- Machine Learning from Human Preferences☆32Mar 23, 2026Updated last month
- Localizing Memorized Sequences in Language Models☆22Oct 15, 2025Updated 6 months ago
- ☆46Feb 16, 2022Updated 4 years ago
- Test-time-training on nearest neighbors for large language models☆50Apr 18, 2024Updated 2 years ago
- LCA-on-the-line (ICML 2024 Oral)☆14Feb 13, 2025Updated last year
- ☆12Aug 6, 2024Updated last year
- Implementation of approximate free-energy minimization in PyTorch☆21Oct 16, 2021Updated 4 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- parallel differential expression for single-cell perturbation sequencing☆25Apr 7, 2026Updated 3 weeks ago
- Code associated with ICML (2024). "Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normaliz…☆10Feb 22, 2026Updated 2 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Dec 16, 2022Updated 3 years ago
- ☆14Mar 31, 2024Updated 2 years ago
- Training Sparse Autoencoders on Language Models☆1,335Apr 22, 2026Updated last week
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025☆23Apr 19, 2026Updated last week
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 10 months ago