facebookresearch / palLinks
PAL: Predictive Analysis & Laws of Large Language Models
☆38Updated last year
Alternatives and similar repositories for pal
Users that are interested in pal are comparing it to the libraries listed below
Sorting:
- ☆82Updated 2 months ago
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆126Updated 7 months ago
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆27Updated 5 months ago
- ☆59Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆64Updated this week
- ☆52Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆72Updated last year
- Recycling diverse models☆46Updated 3 years ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59Updated 7 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆34Updated 2 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Updated 7 months ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆120Updated 2 years ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆82Updated 2 years ago
- Source code of "Dr.LLM: Dynamic Layer Routing in LLMs"☆41Updated 3 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 6 months ago
- Model Stock: All we need is just a few fine-tuned models☆128Updated 5 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆46Updated 3 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Updated 5 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Updated last year
- Minimum Description Length probing for neural network representations☆20Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆41Updated 2 years ago
- ML/DL Math and Method notes☆66Updated 2 years ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆99Updated last year
- ☆80Updated last year
- ☆13Updated last year
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆95Updated last year