facebookresearch / palLinks
PAL: Predictive Analysis & Laws of Large Language Models
☆36Updated 4 months ago
Alternatives and similar repositories for pal
Users that are interested in pal are comparing it to the libraries listed below
Sorting:
- Aioli: A unified optimization framework for language model data mixing☆27Updated 4 months ago
- Recycling diverse models☆44Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆30Updated last year
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆18Updated 11 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆92Updated this week
- ☆72Updated last month
- ☆25Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 7 months ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆16Updated 11 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 8 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆58Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆56Updated 5 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated 2 weeks ago
- Model Stock: All we need is just a few fine-tuned models☆116Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- ☆51Updated last year
- ☆32Updated 5 months ago
- We study toy models of skill learning.☆28Updated 4 months ago
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆20Updated 3 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆63Updated 8 months ago
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- Lottery Ticket Adaptation☆39Updated 6 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆18Updated 5 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 8 months ago
- ☆179Updated last year
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆26Updated 7 months ago
- Building modular LMs with parameter-efficient fine-tuning.☆105Updated this week