facebookresearch / palLinks
PAL: Predictive Analysis & Laws of Large Language Models
☆37Updated 7 months ago
Alternatives and similar repositories for pal
Users that are interested in pal are comparing it to the libraries listed below
Sorting:
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆111Updated 2 months ago
- ☆76Updated last week
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆31Updated last year
- ☆59Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆67Updated 11 months ago
- Recycling diverse models☆45Updated 2 years ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆77Updated last year
- Train, tune, and infer Bamba model☆131Updated 2 months ago
- ☆63Updated 3 weeks ago
- Model Stock: All we need is just a few fine-tuned models☆122Updated 3 weeks ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆120Updated last year
- ML/DL Math and Method notes☆63Updated last year
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆56Updated 3 months ago
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆36Updated 9 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated 9 months ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆122Updated 2 weeks ago
- Generating and validating natural-language explanations for the brain.☆56Updated this week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆40Updated last year
- Supercharge huggingface transformers with model parallelism.☆77Updated last month
- ☆51Updated last year
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆103Updated last year
- Google Research☆45Updated 2 years ago
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆94Updated last year
- Code for paper: "Privately generating tabular data using language models".☆15Updated 2 years ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated 2 weeks ago
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆24Updated 2 weeks ago
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆90Updated last year