facebookresearch / pal
PAL: Predictive Analysis & Laws of Large Language Models
☆32Updated 3 months ago
Alternatives and similar repositories for pal:
Users that are interested in pal are comparing it to the libraries listed below
- Recycling diverse models☆44Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- ☆26Updated 2 years ago
- Official PyTorch implementation for NeurIPS'24 paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆19Updated last month
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated last year
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆36Updated last week
- ☆22Updated 2 years ago
- Implementation of Bitune: Bidirectional Instruction-Tuning☆19Updated 10 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆50Updated 10 months ago
- Research on Tabular Foundation Models☆45Updated 4 months ago
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆13Updated last year
- Aioli: A unified optimization framework for language model data mixing☆23Updated 2 months ago
- Efficient Computation of d-Dimensional Earth Mover's Distance☆9Updated last year
- ☆24Updated last year
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆27Updated last year
- Official implementation for Sparse MetA-Tuning (SMAT)☆16Updated 9 months ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆18Updated 5 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- TabDPT: Scaling Tabular Foundation Models☆26Updated this week
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 2 years ago
- Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …☆38Updated last year
- ☆58Updated last year
- Model Stock: All we need is just a few fine-tuned models☆107Updated 6 months ago
- Google Research☆46Updated 2 years ago
- ☆69Updated 3 weeks ago
- ☆51Updated 10 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆39Updated 6 months ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆96Updated last year
- ☆15Updated last year