harish-kamath / rqaeLinks
Residual Quantization Autoencoder, used for interpreting LLMs
☆12Updated 5 months ago
Alternatives and similar repositories for rqae
Users that are interested in rqae are comparing it to the libraries listed below
Sorting:
- Minimum Description Length probing for neural network representations☆18Updated 4 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- ☆29Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Efficient Scaling laws and collaborative pretraining.☆16Updated 4 months ago
- ☆18Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆14Updated 6 months ago
- Understanding how features learned by neural networks evolve throughout training☆35Updated 8 months ago
- ☆23Updated 6 months ago
- Closed-form polynomial approximations to neural networks☆13Updated 4 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆20Updated last year
- ☆22Updated 8 months ago
- ☆13Updated 3 weeks ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 5 months ago
- Repo for solving arc problems with an Neural Cellular Automata☆16Updated last month
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆10Updated 2 months ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 3 years ago
- ☆16Updated last year
- ☆35Updated 2 years ago
- ☆18Updated 3 weeks ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Simple repository for training small reasoning models☆33Updated 4 months ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- ☆23Updated 2 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆19Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆28Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆18Updated this week
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year