Code for "Merging Text Transformers from Different Initializations"
☆20Feb 2, 2025Updated last year
Alternatives and similar repositories for merging-text-transformers
Users that are interested in merging-text-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆313Jan 18, 2024Updated 2 years ago
- LLM-Merging: Building LLMs Efficiently through Merging☆209Sep 24, 2024Updated last year
- A curated list of Model Merging methods.☆95Dec 3, 2025Updated 4 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆34Mar 5, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆77Apr 29, 2024Updated last year
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆92Jul 25, 2023Updated 2 years ago
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- ☆214Feb 3, 2024Updated 2 years ago
- Active Learning in the era of Foundation Models☆12Apr 16, 2025Updated last year
- [CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs☆14Jun 20, 2025Updated 10 months ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Official code repository for the WACV 2022 paper "Visualizing Paired Image Similarity in Transformer Networks"☆22Apr 13, 2022Updated 4 years ago
- Manage ML configuration with pydantic☆16Mar 18, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- 2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集☆11Sep 8, 2021Updated 4 years ago
- Official implementation of COLosSAL [MICCAI 2023]☆15Jul 22, 2023Updated 2 years ago
- ☆34Apr 14, 2025Updated last year
- ☆17Apr 11, 2024Updated 2 years ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆53Dec 22, 2025Updated 3 months ago
- [NeurIPS 2025] Official implementation of the paper "BecomingLit: Relightable Gaussian Avatars with Hybrid Neural Shading"☆28Updated this week
- ☆13Apr 3, 2024Updated 2 years ago
- A Streamlit app to add structured tags to a dataset card☆22Jun 30, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for Zero-Shot Tokenizer Transfer☆144Jan 14, 2025Updated last year
- [AAAI-25 Oral] Adaptive Calibration☆15Jul 6, 2025Updated 9 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated last year
- ☆24Jun 7, 2021Updated 4 years ago
- Code for GFlowNet-DPO (Direct Preference Optimization) EMNLP 2024 Main☆19Feb 22, 2026Updated last month
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 9 months ago
- ReX - typesetting mathematics☆24Mar 10, 2026Updated last month
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- The official implementation of "Low-power, Continuous Remote Behavioral Localization with Event Cameras" (CVPR 2024)☆12Sep 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- ☆19Dec 23, 2024Updated last year
- CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving (NAACL 2024 Findings))☆16Apr 26, 2024Updated last year
- ☆14Apr 27, 2022Updated 3 years ago
- An event based dataset loader under one common python API.☆10Mar 22, 2022Updated 4 years ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆80Mar 1, 2025Updated last year
- ☆18Oct 7, 2022Updated 3 years ago