jxbz / entropix
📰 Computing the information content of trained neural networks
☆21Updated 3 years ago
Alternatives and similar repositories for entropix:
Users that are interested in entropix are comparing it to the libraries listed below
- Minimum Description Length probing for neural network representations☆18Updated this week
- ☆14Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"☆17Updated 3 weeks ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- This repo contains code for the paper: "Can Foundation Models Help Us Achieve Perfect Secrecy?"☆24Updated last year
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- ☆28Updated last year
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆14Updated 3 years ago
- ☆26Updated last year
- ☆18Updated 2 years ago
- ☆26Updated last year
- Understanding how features learned by neural networks evolve throughout training☆32Updated 3 months ago
- ☆21Updated last year
- Investigate the speed of adaptation of structural causal models☆16Updated 3 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆20Updated last month
- Official implementation of the paper "Interventions, Where and How? Experimental Design for Causal Models at Scale", NeurIPS 2022.☆19Updated 2 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Privacy-Preserving Bandits (MLSys'20)☆23Updated 2 years ago
- ☆35Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Implementation of the LOSSGRAD optimization algorithm☆15Updated 5 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆47Updated 3 years ago
- Implementation of N-Grammer in Flax☆16Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated 11 months ago
- Sparse and discrete interpretability tool for neural networks☆59Updated 11 months ago
- Evaluation of neuro-symbolic engines☆34Updated 5 months ago
- Recycling diverse models☆44Updated 2 years ago
- The repository contains code for Adaptive Data Optimization☆20Updated last month
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago