vertaix / Vendi-Score
☆106Updated this week
Alternatives and similar repositories for Vendi-Score:
Users that are interested in Vendi-Score are comparing it to the libraries listed below
- ☆52Updated last year
- ☆16Updated 9 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆121Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆58Updated 3 months ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated 9 months ago
- Experiment with diffusion models that you can run on your local jupyter instances☆56Updated 2 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆80Updated last year
- ☆43Updated 5 months ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆97Updated last year
- ☆82Updated 11 months ago
- ☆58Updated 3 years ago
- ☆164Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆66Updated 2 months ago
- ☆51Updated 7 months ago
- Personal implementation of ASIF by Antonio Norelli☆25Updated 7 months ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆87Updated last year
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆62Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆94Updated last year
- Unofficial implementation of Conformal Language Modeling by Quach et al☆29Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆42Updated last year
- ☆78Updated last year
- Implementation of Bitune: Bidirectional Instruction-Tuning☆16Updated 7 months ago
- Sparse and discrete interpretability tool for neural networks☆58Updated 11 months ago
- ☆80Updated 5 months ago
- Sequence Modeling with Structured State Spaces☆61Updated 2 years ago
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆25Updated 7 months ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆34Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated last year
- Training and evaluating NBM and SPAM for interpretable machine learning.☆75Updated last year
- Transformers with doubly stochastic attention☆44Updated 2 years ago