CERT-Lab/lora-sb

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CERT-Lab/lora-sb)

CERT-Lab / lora-sb

Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning

☆52

Alternatives and similar repositories for lora-sb

Users that are interested in lora-sb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CERT-Lab / fed-sb
View on GitHub
(TMLR J2C Certification) Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tu…
☆27Oct 4, 2025Updated 9 months ago
CERT-Lab / abba
View on GitHub
(ICLR '26) ABBA-Adapters: Efficient and Expressive Fine-Tuning of Foundation Models
☆22Sep 25, 2025Updated 9 months ago
VILA-Lab / DELT
View on GitHub
(CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…
☆28Aug 23, 2025Updated 10 months ago
mwatkins1970 / SAE_Feature_Interpretability_Tool
View on GitHub
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Oct 4, 2024Updated last year
MiuLab / PairDistill
View on GitHub
Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.
☆22Nov 28, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
QuixiAI / dolphin-logger
View on GitHub
☆107Nov 1, 2025Updated 8 months ago
CERT-Lab / fedex-lora
View on GitHub
(ACL '25 - Oral) FedEx-LoRA: Exact Aggregation for Federated and Efficient Fine-Tuning of Foundation Models
☆36Oct 4, 2025Updated 9 months ago
kkyuhun94 / dalda
View on GitHub
[ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
☆32Feb 6, 2026Updated 5 months ago
UtkarshSaxena1 / EigenAttn
View on GitHub
☆20Oct 13, 2024Updated last year
MingLiiii / Layer_Gradient
View on GitHub
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆75Jun 25, 2025Updated last year
Luckfort / CD
View on GitHub
[COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
☆82Jan 22, 2025Updated last year
66RING / CritiPrefill
View on GitHub
Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".
☆17Sep 15, 2024Updated last year
EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetune
View on GitHub
☆19May 17, 2025Updated last year
BLEACH366 / P2DFlow
View on GitHub
P2DFlow: A Protein Ensemble Generative Model with SE(3) Flow Matching
☆43May 22, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
git-disl / Virus
View on GitHub
This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"
☆56Feb 2, 2025Updated last year
UKPLab / arxiv2025-inherent-limits-plms
View on GitHub
Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…
☆14Jan 16, 2025Updated last year
justarter / E2URec
View on GitHub
Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…
☆38Jul 19, 2024Updated 2 years ago
UKPLab / 5pils
View on GitHub
Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…
☆45Dec 6, 2025Updated 7 months ago
ZBox1005 / CoT-UQ
View on GitHub
[ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"
☆17Apr 3, 2025Updated last year
Geaming2002 / Ruler
View on GitHub
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆41Sep 30, 2024Updated last year
sunnynexus / RetroLLM
View on GitHub
[ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation
☆117Jan 23, 2025Updated last year
ysh-1998 / CoWPiRec
View on GitHub
The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.
☆25Jan 30, 2024Updated 2 years ago
hetailang / SqueezeAttention
View on GitHub
☆37Oct 10, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
rosewang2008 / posr
View on GitHub
Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings
☆34Nov 12, 2024Updated last year
shuzhangzhong / HybriMoE-Preview
View on GitHub
☆17Apr 9, 2025Updated last year
maclong01 / DeBiFormer
View on GitHub
[ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"
☆33Jan 8, 2025Updated last year
cycloarcane / Gathered-Paper-Resources
View on GitHub
In this repo I will upload aggregated resources I get from my daily reading of academic papers.
☆15Apr 9, 2026Updated 3 months ago
shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated last year
open-compass / Ada-LEval
View on GitHub
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
☆56May 22, 2025Updated last year
prs-eth / LoRA-Ensemble
View on GitHub
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks
☆55Mar 7, 2026Updated 4 months ago
TencentARC / pi-Tuning
View on GitHub
Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.
☆33Jul 21, 2023Updated 3 years ago
rayleizhu / GLMix
View on GitHub
[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".
☆43Jan 21, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
AgnostiqHQ / multi-agent-llm
View on GitHub
Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)
☆132Feb 10, 2025Updated last year
HanLingsgjk / UnifiedGeneralization
View on GitHub
Code for Self-Assessed Generation and CVPR2024 PAPER ADFACTORY
☆21Jul 28, 2025Updated 11 months ago
moucheng2017 / SOP-LVM-ICL-Ensemble
View on GitHub
[NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…
☆23Mar 16, 2025Updated last year
fferflo / statewide-visual-geolocalization
View on GitHub
Statewide Visual Geolocalization in the Wild (ECCV 2024)
☆75Dec 2, 2024Updated last year
giangdip2410 / HyperRouter
View on GitHub
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Nov 29, 2023Updated 2 years ago
Quehry / HelloBench
View on GitHub
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆60Nov 26, 2024Updated last year
ddw2AIGROUP2CQUPT / PA-LLaVA
View on GitHub
A Large Language-Vision Assistant for Pathology Image Understanding (BIBM-2024 & Journal of Artificial Intelligence Review 2025)
☆65Jun 18, 2025Updated last year