mlbio-epfl / joint-inferenceLinks
[ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners
☆19Updated 2 months ago
Alternatives and similar repositories for joint-inference
Users that are interested in joint-inference are comparing it to the libraries listed below
Sorting:
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆85Updated last month
- Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation☆117Updated last month
- Unofficial Implementation of Selective Attention Transformer☆17Updated 9 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆36Updated 5 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆107Updated last month
- One-shot Entropy Minimization☆175Updated 2 months ago
- ☆34Updated 5 months ago
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆146Updated last month
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆30Updated 3 months ago
- ☆16Updated 7 months ago
- ☆47Updated 6 months ago
- Code for Heima☆51Updated 3 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆47Updated 3 months ago
- ☆89Updated 2 months ago
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆103Updated last week
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆113Updated 11 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆86Updated 10 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆212Updated 6 months ago
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models☆226Updated 3 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆104Updated 2 weeks ago
- Code for "Reasoning to Learn from Latent Thoughts"☆115Updated 4 months ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆178Updated last month
- ☆83Updated 11 months ago
- Esoteric Language Models☆91Updated 2 weeks ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆25Updated 2 weeks ago
- ☆34Updated 7 months ago
- The code implementation of Symbolic-MoE☆37Updated 5 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆70Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 9 months ago
- repo for paper https://arxiv.org/abs/2504.13837☆180Updated last month