Official code for paper "Shadow-FT: Tuning Instruct via Base"
☆50Apr 18, 2026Updated 2 months ago
Alternatives and similar repositories for Shadow-FT
Users that are interested in Shadow-FT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆16Jun 26, 2025Updated last year
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 9 months ago
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Research Project TLDR☆25Jul 28, 2025Updated 11 months ago
- ACL24☆11Jun 7, 2024Updated 2 years ago
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆24Feb 10, 2025Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- ☆22Oct 22, 2024Updated last year
- Github repository for CLAPACK (fork of CLAPACK 3.2.1 patched for our needs)☆10Aug 15, 2018Updated 7 years ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆26Nov 29, 2024Updated last year
- ☆14May 4, 2024Updated 2 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 8 months ago
- Official implementation of paper "HiAE: A High-Throughput Authenticated Encryption Algorithm for Cross-Platfor Efficiency"☆19Nov 11, 2025Updated 7 months ago
- [ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models☆34Sep 1, 2024Updated last year
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!☆10Sep 1, 2024Updated last year
- Reverse engineered ChatGPT API☆10Feb 14, 2023Updated 3 years ago
- B站爬虫☆15Dec 10, 2023Updated 2 years ago
- A book about Ph.D. student and research career planning☆29Oct 21, 2025Updated 8 months ago
- Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"☆108Apr 23, 2026Updated 2 months ago
- homework in SCUT_SE☆12Nov 9, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight☆13May 26, 2025Updated last year
- ☆23Jun 1, 2025Updated last year
- HiAE - A High-Throughput Authenticated Encryption Algorithm for Cross-Platform Efficiency.☆19May 27, 2026Updated last month
- Next-Generation AI-Assisted Kernel Engineering for Multi-Chip Systems☆67Jun 24, 2026Updated last week
- Solution for Team Rayee, ranks 1st place in reconstructing partial textured objects (track 2), and 2nd overall in the SHARP Challenge at …☆21Oct 12, 2022Updated 3 years ago
- ☆32Jun 22, 2025Updated last year
- ☆12Aug 6, 2024Updated last year
- Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach, CVPR 2024☆27Jul 25, 2024Updated last year
- Accepted By The 39th Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track☆25Nov 17, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- Non-Autoregressive Math Word Problem Solver with Unified Tree Structure☆12Jan 13, 2024Updated 2 years ago
- 投资组合评比器,基于python3.8和mysql8.0。由Concyclics和wingholy完成主要编程工作,可实现对于蛋卷基金和且慢基金平台投资组合信息的收集和对比。☆11Jun 18, 2021Updated 5 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆18Aug 15, 2025Updated 10 months ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 4 months ago
- A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimoda…☆214Updated this week
- Pipeline for analyzing rare mutations in metagenome-assembled genomes☆10Apr 4, 2025Updated last year