CERT-Lab / lora-sbLinks
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning
☆47Updated 3 months ago
Alternatives and similar repositories for lora-sb
Users that are interested in lora-sb are comparing it to the libraries listed below
Sorting:
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆22Updated last month
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆78Updated 4 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆126Updated 9 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆64Updated 3 months ago
- Code for Heima☆44Updated last month
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆132Updated 4 months ago
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated last week
- [Preprint 2025] Thinkless: LLM Learns When to Think☆125Updated this week
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆26Updated 7 months ago
- ☆37Updated 7 months ago
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆63Updated 10 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆44Updated last month
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆53Updated 2 weeks ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆58Updated last year
- Multimodal language model benchmark, featuring challenging examples☆168Updated 5 months ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆53Updated 7 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆26Updated last month
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆37Updated 10 months ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆97Updated 11 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆43Updated 3 weeks ago
- [ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models☆89Updated last year
- Matryoshka Multimodal Models☆107Updated 4 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆85Updated 7 months ago
- ☆142Updated last year
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆81Updated 3 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆20Updated 7 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆59Updated last month
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆114Updated last year
- The official implementation of Cross-Task Experience Sharing (COPS)☆22Updated 7 months ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆15Updated last month