juyongjiang/KaSA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/juyongjiang/KaSA)

juyongjiang / KaSA

[ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"

☆22

Alternatives and similar repositories for KaSA

Users that are interested in KaSA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

juyongjiang / WebGen-R1
View on GitHub
[arXiv'26] Official Code for "WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforce…
☆20Jul 20, 2026Updated last week
juyongjiang / ReflexiCoder
View on GitHub
[ACL'26] Official Code for "ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinfo…
☆22Apr 24, 2026Updated 3 months ago
gccnlp / Light-PEFT
View on GitHub
[ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
☆13Sep 2, 2024Updated last year
juyongjiang / Awesome-ANCE
View on GitHub
Implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"
☆17Jan 10, 2022Updated 4 years ago
DSA-MLOPS / main
View on GitHub
☆18Apr 19, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
DSA-MLOPS / UFUG1601
View on GitHub
☆19Nov 16, 2023Updated 2 years ago
wutaiqiang / MoSLoRA
View on GitHub
☆126Jul 6, 2024Updated 2 years ago
DSA-MLOPS / DSAA6000I
View on GitHub
☆28Jun 9, 2024Updated 2 years ago
carted / processing-text-data
View on GitHub
Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).
☆20Mar 7, 2022Updated 4 years ago
juyongjiang / TrendGCN
View on GitHub
[CIKM 2023] This is the official source code of "TrendGCN: Enhancing the Robustness via Adversarial Learning and Joint Spatial-Temporal E…
☆51Aug 11, 2023Updated 2 years ago
deep-diver / janus
View on GitHub
generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way
☆22Mar 18, 2024Updated 2 years ago
lime-RL / DCPO
View on GitHub
DCPO: Dynamic Adaptive Clipping for RL
☆49Apr 1, 2026Updated 3 months ago
sufenlp / MiLoRA
View on GitHub
[NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning
☆21May 31, 2025Updated last year
LaVi-Lab / FTTT
View on GitHub
[ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.
☆13May 16, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Clin0212 / HydraLoRA
View on GitHub
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆239Dec 3, 2024Updated last year
flash-bon / flash-bon
View on GitHub
(ECCV 2026): Official code for Flash-BoN: Instant Drafts for Inference-Time Scaling in Diffusion Models
☆18Jul 9, 2026Updated 2 weeks ago
sayakpaul / deploy-hf-tf-vision-models
View on GitHub
This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.
☆30Aug 22, 2022Updated 3 years ago
JackShDr / InfluentialRS
View on GitHub
Implementations of Influential Recommender System
☆12Oct 29, 2024Updated last year
TannerGilbert / Run-TFLITE-models-on-the-web
View on GitHub
Run TFLITE models on the web
☆13Jan 2, 2022Updated 4 years ago
Infini-AI-Lab / S2FT
View on GitHub
☆19Jan 3, 2025Updated last year
lamm-mit / Cephalo-Phi-3-Vision-MoE
View on GitHub
☆17Jun 9, 2024Updated 2 years ago
deep-diver / Continuous-Adaptation-for-Machine-Learning-System-to-Data-Changes
View on GitHub
https://blog.tensorflow.org/2021/12/continuous-adaptation-for-machine.html
☆30Dec 10, 2021Updated 4 years ago
tianyi-lab / RoMA
View on GitHub
Code for "Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs"
☆19Nov 6, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Evanwu1125 / LiteCoT
View on GitHub
☆17Jun 10, 2025Updated last year
czhang024 / ParallelControl
View on GitHub
ICML-2025 (Spotlight) "From Weight-Based to State-Based Fine-Tuning: Further Memory Reduction on LoRA with Parallel Control"
☆15May 7, 2026Updated 2 months ago
NTDXYG / COTTON
View on GitHub
Data and code for "Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models", which accepted in TSE.
☆15Jul 3, 2024Updated 2 years ago
Facico / GOAT-PEFT
View on GitHub
[ICML2025] Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
☆147Nov 9, 2025Updated 8 months ago
ds4an / CoDas4CG
View on GitHub
Contests based Dataset for Code Generation
☆13Dec 11, 2022Updated 3 years ago
guoyang9 / PELA
View on GitHub
PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]
☆19Apr 14, 2024Updated 2 years ago
FeiyuZhang98 / IncreLoRA
View on GitHub
☆36Aug 23, 2023Updated 2 years ago
XingruiWang / DynSuperCLEVR
View on GitHub
A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…
☆20Apr 23, 2025Updated last year
deep-diver / LLM-Serve
View on GitHub
This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.
☆18Apr 20, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mengxiayu / LLMSuperWeight
View on GitHub
Code for studying the super weight in LLM
☆124Dec 3, 2024Updated last year
deep-diver / Vid2Persona
View on GitHub
This project breathes life into video characters by using AI to describe their personality and then chat with you as them.
☆48Mar 12, 2024Updated 2 years ago
deep-diver / hllama
View on GitHub
hllama is a library which aims to provide a set of utility tools for large language models.
☆10Apr 16, 2024Updated 2 years ago
juyongjiang / awesome-multi-modal-recommendation
View on GitHub
A curated list of awesome multi-modal recommendation.
☆10Mar 16, 2022Updated 4 years ago
deep-diver / keras-sd-serving
View on GitHub
showing various ways to serve Keras based stable diffusion
☆111Feb 28, 2023Updated 3 years ago
deep-diver / image_search_with_natural_language
View on GitHub
Application for searching images from natural language queries
☆47Dec 10, 2021Updated 4 years ago
kernelmachine / balanced-kmeans
View on GitHub
☆21Apr 16, 2024Updated 2 years ago