mlbio-epfl / joint-inference
[ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners
☆11Updated last month
Alternatives and similar repositories for joint-inference
Users that are interested in joint-inference are comparing it to the libraries listed below
Sorting:
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆39Updated 7 months ago
- ☆68Updated 9 months ago
- ☆31Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆80Updated 2 weeks ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆99Updated 4 months ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆28Updated last year
- ☆53Updated 5 months ago
- ☆31Updated 4 months ago
- Evaluation of neuro-symbolic engines☆35Updated 9 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated 2 months ago
- ☆31Updated last year
- ☆18Updated last month
- Understanding how features learned by neural networks evolve throughout training☆34Updated 6 months ago
- ☆27Updated last year
- Latent Large Language Models☆18Updated 8 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 3 months ago
- ☆31Updated last year
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆13Updated last year
- A collection of various LLM sampling methods implemented in pure Pytorch☆24Updated 5 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆50Updated 3 months ago
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆23Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- ☆71Updated 8 months ago
- ☆56Updated last week
- Using FlexAttention to compute attention with different masking patterns☆43Updated 7 months ago
- A system for automating selection and optimization of pre-trained models from the TAO Model Zoo☆25Updated 10 months ago
- ☆75Updated 7 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated 2 months ago
- We study toy models of skill learning.☆26Updated 3 months ago