pixeli99 / MixLN
Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxiang Li, Lu Yin, Shiwei Liu
☆15Updated 2 weeks ago
Alternatives and similar repositories for MixLN:
Users that are interested in MixLN are comparing it to the libraries listed below
- ☆36Updated 2 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆31Updated 6 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 5 months ago
- The official repo of continuous speculative decoding☆19Updated last month
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆31Updated this week
- Official Repository of Personalized Visual Instruct Tuning☆26Updated 2 months ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆14Updated 2 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 9 months ago
- ☆15Updated last week
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆17Updated 6 months ago
- ☆20Updated 6 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆38Updated 2 months ago
- TinyFusion: Diffusion Transformers Learned Shallow☆70Updated last month
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆48Updated 3 months ago
- ☆21Updated this week
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆35Updated last week
- Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing☆21Updated last month
- This repo contains code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation"☆10Updated this week
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆40Updated 8 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆16Updated 2 weeks ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆17Updated 9 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆28Updated 6 months ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆25Updated 10 months ago
- SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation (arXiv: 2410.12761)☆19Updated 2 months ago
- ☆12Updated 3 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated 9 months ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated last month
- ☆15Updated 11 months ago
- ☆29Updated last week