Official code for paper "Shadow-FT: Tuning Instruct via Base"
☆50Apr 18, 2026Updated last month
Alternatives and similar repositories for Shadow-FT
Users that are interested in Shadow-FT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 7 months ago
- ACL24☆11Jun 7, 2024Updated 2 years ago
- This is the open-source code for TokenCarve.☆25Jan 23, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆23Feb 10, 2025Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- ☆22Oct 22, 2024Updated last year
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Jan 8, 2025Updated last year
- ☆21Jul 24, 2025Updated 10 months ago
- Modified Logisic Regression for the Positive and Unlabeled Learning Problem☆12Feb 1, 2014Updated 12 years ago
- ☆14May 4, 2024Updated 2 years ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 8 months ago
- [ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models☆34Sep 1, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!☆10Sep 1, 2024Updated last year
- Reverse engineered ChatGPT API☆10Feb 14, 2023Updated 3 years ago
- ☆26Oct 9, 2025Updated 8 months ago
- B站爬虫☆15Dec 10, 2023Updated 2 years ago
- A book about Ph.D. student and research career planning☆29Oct 21, 2025Updated 7 months ago
- Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"☆107Apr 23, 2026Updated last month
- Code repository for the robust active label correction paper.☆11Apr 12, 2018Updated 8 years ago
- homework in SCUT_SE☆12Nov 9, 2021Updated 4 years ago
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- dijkstra algorithm optimized with heap☆14Dec 20, 2018Updated 7 years ago
- This repo lists some researches and applications in PU learning.☆12Mar 12, 2020Updated 6 years ago
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆54Jul 24, 2025Updated 10 months ago
- ☆22Jun 1, 2025Updated last year
- ☆22Feb 4, 2026Updated 4 months ago
- Next-Generation AI-Assisted Kernel Engineering for Multi-Chip Systems☆64Updated this week
- Solution for Team Rayee, ranks 1st place in reconstructing partial textured objects (track 2), and 2nd overall in the SHARP Challenge at …☆21Oct 12, 2022Updated 3 years ago
- ☆31Jun 22, 2025Updated 11 months ago
- Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach, CVPR 2024☆26Jul 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Accepted By The 39th Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track☆25Nov 17, 2025Updated 6 months ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- Non-Autoregressive Math Word Problem Solver with Unified Tree Structure☆12Jan 13, 2024Updated 2 years ago
- 投资组合评比器,基于python3.8和mysql8.0。由Concyclics和wingholy完成主要编程工作,可实现对于蛋卷基金和且慢基金平台投资组合信息的收集和对比。☆11Jun 18, 2021Updated 4 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆18Aug 15, 2025Updated 9 months ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 3 months ago
- A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimoda…☆195Jun 5, 2026Updated last week