facebookresearch / palLinks
PAL: Predictive Analysis & Laws of Large Language Models
☆38Updated last year
Alternatives and similar repositories for pal
Users that are interested in pal are comparing it to the libraries listed below
Sorting:
- ☆82Updated 2 months ago
- ☆59Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆127Updated this week
- Recycling diverse models☆46Updated 3 years ago
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆28Updated 5 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆41Updated 2 years ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆35Updated 2 years ago
- Train, tune, and infer Bamba model☆137Updated 8 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆64Updated this week
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆74Updated last year
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Updated 7 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated 2 years ago
- ☆52Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Updated last year
- Model Stock: All we need is just a few fine-tuned models☆129Updated 6 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59Updated 8 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆46Updated 6 months ago
- ☆71Updated 6 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆46Updated 3 months ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆91Updated 2 years ago
- ☆80Updated last year
- ML/DL Math and Method notes☆66Updated 2 years ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- ☆33Updated last year
- ☆71Updated last year
- Supercharge huggingface transformers with model parallelism.☆78Updated 6 months ago
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Updated 2 years ago
- Google Research☆46Updated 3 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆40Updated 2 years ago
- Official Code for Rectified LpJEPA: Joint-Embedding Predictive Architectures with Sparse and Maximum-Entropy Representations☆39Updated this week