mts-ai / ReplaceMeLinks
☆35Updated 5 months ago
Alternatives and similar repositories for ReplaceMe
Users that are interested in ReplaceMe are comparing it to the libraries listed below
Sorting:
- Official implementation of ECCV24 paper: POA☆24Updated last year
- Control LLM☆20Updated 6 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆26Updated 3 months ago
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆15Updated 6 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆21Updated 2 weeks ago
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆54Updated 9 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆55Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆27Updated last year
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Updated last year
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11Updated last year
- This is a simple torch implementation of the high performance Multi-Query Attention☆15Updated 2 years ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 7 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆18Updated 6 months ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆31Updated last year
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 6 months ago
- research work on multimodal cognitive ai☆67Updated 4 months ago
- Distributed Optimization Infra for learning CLIP models☆27Updated last year
- ☆16Updated last year
- The official repo of continuous speculative decoding☆30Updated 7 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Updated last year
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆15Updated last year
- ☆19Updated 9 months ago
- ☆29Updated 3 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆31Updated 11 months ago
- MobileLLM-R1☆54Updated last month
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆30Updated last year
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆100Updated last year