facebookresearch / pal
PAL: Predictive Analysis & Laws of Large Language Models
☆31Updated 2 months ago
Alternatives and similar repositories for pal:
Users that are interested in pal are comparing it to the libraries listed below
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆59Updated 6 months ago
- Recycling diverse models☆44Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last week
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆28Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 9 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆49Updated 9 months ago
- ☆15Updated last year
- Model Stock: All we need is just a few fine-tuned models☆107Updated 6 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆25Updated last year
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆13Updated last year
- Personal implementation of ASIF by Antonio Norelli☆25Updated 10 months ago
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆50Updated 9 months ago
- ☆18Updated 8 months ago
- Official PyTorch implementation for NeurIPS'24 paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆19Updated last month
- Implementation of Bitune: Bidirectional Instruction-Tuning☆19Updated 9 months ago
- ☆65Updated this week
- Official implementation for Sparse MetA-Tuning (SMAT)☆16Updated 8 months ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆35Updated 2 years ago
- ☆10Updated 3 months ago
- Making Heads or Tails Towards Semantically Consistent Visual Counterfactuals☆30Updated 2 years ago
- ☆40Updated 8 months ago
- ☆58Updated last year
- Code for paper "Can contrastive learning avoid shortcut solutions?" NeurIPS 2021.☆47Updated 2 years ago
- Quantification of Uncertainty with Adversarial Models☆28Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- ☆30Updated 2 months ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 11 months ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated last week