lxuechen / ml-swissknife
An ML research codebase built with friends :)
☆20Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for ml-swissknife
- ☆25Updated 4 months ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆20Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆69Updated last year
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆56Updated 2 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- ☆18Updated last month
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆18Updated 2 months ago
- The repository contains code for Adaptive Data Optimization☆18Updated last month
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated last year
- ☆40Updated 2 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs?☆25Updated 5 months ago
- Official Repository for Dataset Inference for LLMs☆23Updated 3 months ago
- ☆47Updated 9 months ago
- ☆45Updated 9 months ago
- Official code for the paper "Attention as a Hypernetwork"☆23Updated 5 months ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆58Updated 3 years ago
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago
- ☆44Updated last year
- ☆17Updated 2 years ago
- ☆37Updated 3 years ago
- ☆53Updated 3 weeks ago
- ☆34Updated 2 years ago
- ☆26Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆91Updated last year
- ☆50Updated 6 months ago
- Efficient Scaling laws and collaborative pretraining.☆13Updated this week
- ☆26Updated 3 weeks ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- ☆25Updated last month