jxbz / entropixLinks
π° Computing the information content of trained neural networks
β22Updated 4 years ago
Alternatives and similar repositories for entropix
Users that are interested in entropix are comparing it to the libraries listed below
Sorting:
- Official code for the paper: "Metadata Archaeology"β19Updated 2 years ago
- This repo contains code for the paper: "Can Foundation Models Help Us Achieve Perfect Secrecy?"β24Updated 2 years ago
- Google Researchβ46Updated 3 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.β30Updated last year
- Repository for the PopulAtion Parameter Averaging (PAPA) paperβ28Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"β24Updated this week
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbolsβ16Updated 4 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.β11Updated 5 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwoβ¦β74Updated 4 months ago
- solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightningβ23Updated 2 years ago
- Ludwig benchmarkβ19Updated 3 years ago
- Developing adversarial examples and showing their semantic generalization for the OpenAI CLIP model (https://github.com/openai/CLIP)β26Updated 4 years ago
- β38Updated last year
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"β25Updated 4 years ago
- Understanding how features learned by neural networks evolve throughout trainingβ39Updated last year
- β10Updated last year
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"β13Updated 4 years ago
- Implementation of Metaformer, but in an autoregressive mannerβ26Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012β49Updated 3 years ago
- β23Updated 11 months ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.β49Updated 3 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"β15Updated 4 years ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ44Updated last year
- A sample pattern for running CI tests on Modalβ18Updated 7 months ago
- Hyperparameter tuning via uncertainty modelingβ48Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language modelsβ19Updated last month
- Recycling diverse modelsβ46Updated 2 years ago
- Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"β31Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ50Updated 3 years ago
- Official code for paper "Non-Adversarial Image Synthesis with Generative Latent Nearest Neighbors"β28Updated 5 years ago