RandallBalestriero / SplineLLMLinks
☆16Updated last year
Alternatives and similar repositories for SplineLLM
Users that are interested in SplineLLM are comparing it to the libraries listed below
Sorting:
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 5 months ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆22Updated last month
- Official implementation for Sparse MetA-Tuning (SMAT)☆16Updated 3 weeks ago
- Profile repository of Pietro Monticone.☆11Updated last week
- ☆22Updated 3 years ago
- ☆29Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 3 years ago
- Repo for solving arc problems with an Neural Cellular Automata☆17Updated last month
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated 2 years ago
- ☆11Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated 2 years ago
- Quantification of Uncertainty with Adversarial Models☆30Updated 2 years ago
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆18Updated 2 months ago
- The Energy Transformer block, in JAX☆57Updated last year
- ☆18Updated 2 years ago
- Recycling diverse models☆45Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- implementation of dualformer☆17Updated 4 months ago
- Tasks for describing differences between text distributions.☆16Updated 11 months ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆11Updated last year
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆15Updated this week
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 9 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Updated 2 years ago
- Official PyTorch implementation of NPwSA: "Neural Processes with Stochastic Attention: Paying more attention to the context dataset (ICLR…☆10Updated 3 years ago