pixeli99 / MixLN
Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxiang Li, Lu Yin, Shiwei Liu
☆16Updated last month
Alternatives and similar repositories for MixLN:
Users that are interested in MixLN are comparing it to the libraries listed below
- ☆38Updated 3 months ago
- The official repo of continuous speculative decoding☆24Updated 3 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 6 months ago
- This repo contains code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation"☆11Updated last month
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆18Updated 7 months ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆14Updated 4 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 10 months ago
- Official Repository of Personalized Visual Instruct Tuning☆26Updated 3 months ago
- ☆17Updated last month
- TinyFusion: Diffusion Transformers Learned Shallow☆74Updated 2 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆28Updated 8 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 7 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated 10 months ago
- Retrieval-Augmented Personalization☆13Updated 2 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆39Updated 3 months ago
- This is the official repo for ByteVideoLLM/Dynamic-VLM☆19Updated 2 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last week
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆49Updated 4 months ago
- Official PyTorch Implementation for Task Vectors are Cross-Modal☆21Updated 2 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆41Updated last month
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024