facebookresearch / pal
PAL: Predictive Analysis & Laws of Large Language Models
☆31Updated 3 weeks ago
Alternatives and similar repositories for pal:
Users that are interested in pal are comparing it to the libraries listed below
- Code for paper: "Privately generating tabular data using language models".☆14Updated last year
- Recycling diverse models☆44Updated 2 years ago
- Research on Tabular Foundation Models☆38Updated last month
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆31Updated 2 weeks ago
- ☆26Updated last year
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆33Updated 2 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆49Updated last month
- Aioli: A unified optimization framework for language model data mixing☆19Updated last week
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆55Updated 4 months ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated this week
- Official PyTorch implementation for NeurIPS'24 paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆16Updated 2 months ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆16Updated 7 months ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆37Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"☆17Updated 3 weeks ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆22Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆47Updated 8 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 8 months ago
- ☆29Updated 8 months ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆69Updated last year
- Implementation of Bitune: Bidirectional Instruction-Tuning☆17Updated 7 months ago
- ☆51Updated 7 months ago
- ☆12Updated 5 months ago
- ☆58Updated 10 months ago
- ☆11Updated last month
- Official repo of Progressive Data Expansion: data, code and evaluation☆27Updated last year
- ☆25Updated last year
- ☆28Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆44Updated 3 months ago