mts-ai / ReplaceMeLinks
☆32Updated 4 months ago
Alternatives and similar repositories for ReplaceMe
Users that are interested in ReplaceMe are comparing it to the libraries listed below
Sorting:
- Official implementation of ECCV24 paper: POA☆24Updated last year
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆15Updated 5 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆20Updated 4 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆27Updated last year
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆53Updated 8 months ago
- Control LLM☆19Updated 6 months ago
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆62Updated 2 weeks ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆59Updated 11 months ago
- Autoregressive Image Generation☆32Updated 3 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆26Updated 2 months ago
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated last year
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆24Updated last year
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆30Updated last year
- Official Repository for Task-Circuit Quantization☆24Updated 4 months ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆13Updated 4 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 5 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Updated 9 months ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆14Updated last year
- This is a simple torch implementation of the high performance Multi-Query Attention☆15Updated 2 years ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆18Updated 6 months ago
- The official repo of continuous speculative decoding☆30Updated 6 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆58Updated 10 months ago
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆22Updated 2 months ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆35Updated last year
- ☆17Updated 3 weeks ago
- ☆47Updated last year
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆30Updated 11 months ago