facebookresearch / palLinks
PAL: Predictive Analysis & Laws of Large Language Models
☆39Updated last year
Alternatives and similar repositories for pal
Users that are interested in pal are comparing it to the libraries listed below
Sorting:
- Recycling diverse models☆46Updated 2 years ago
- Train, tune, and infer Bamba model☆137Updated 7 months ago
- Model Stock: All we need is just a few fine-tuned models☆128Updated 5 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆34Updated 2 years ago
- ☆82Updated last month
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆72Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆121Updated 6 months ago
- ☆59Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- ☆15Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆91Updated 2 years ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆46Updated 2 months ago
- ☆27Updated last year
- Minimum Description Length probing for neural network representations☆20Updated 11 months ago
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆27Updated 4 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated 2 years ago
- ☆33Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 7 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- ☆69Updated 5 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated 2 years ago
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆95Updated last year
- ☆52Updated last year
- An automated data pipeline scaling RL to pretraining levels☆72Updated 2 months ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆26Updated 2 months ago
- Aioli: A unified optimization framework for language model data mixing☆32Updated 11 months ago