[ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"
☆22Jan 16, 2025Updated last year
Alternatives and similar repositories for KaSA
Users that are interested in KaSA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Sep 2, 2024Updated last year
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated last year
- [CIKM 2023] This is the official source code of "TrendGCN: Enhancing the Robustness via Adversarial Learning and Joint Spatial-Temporal E…☆51Aug 11, 2023Updated 2 years ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆56Jan 13, 2025Updated last year
- [NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning☆20May 31, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆22Mar 18, 2024Updated 2 years ago
- Github repo for ICLR-2025 paper, Fine-tuning Large Language Models with Sparse Matrices☆26Feb 2, 2026Updated 4 months ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Aug 22, 2022Updated 3 years ago
- Awesome-Low-Rank-Adaptation☆133Oct 13, 2024Updated last year
- ☆21Feb 5, 2024Updated 2 years ago
- Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models☆39Sep 19, 2025Updated 8 months ago
- 🎨 把时间还给逻辑,用 AI 绘就你的科研故事☆174Jun 3, 2026Updated 2 weeks ago
- Run TFLITE models on the web☆13Jan 2, 2022Updated 4 years ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆29Jul 9, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- https://blog.tensorflow.org/2021/12/continuous-adaptation-for-machine.html☆30Dec 10, 2021Updated 4 years ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆20Mar 19, 2026Updated 2 months ago
- ☆11Jul 30, 2025Updated 10 months ago
- Code for studying the super weight in LLM☆124Dec 3, 2024Updated last year
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated 2 years ago
- ☆31Jun 6, 2025Updated last year
- End-to-end pipeline with TFX to train and deploy a BERT model for sentiment analysis.☆43Oct 21, 2023Updated 2 years ago
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 3 years ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆19Oct 18, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Application for searching images from natural language queries☆46Dec 10, 2021Updated 4 years ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- Check local static links and online links fast and in parallel☆13Dec 1, 2020Updated 5 years ago
- DMax: Aggressive Parallel Decoding for dLLMs☆126May 25, 2026Updated 3 weeks ago
- ☆17Jan 19, 2026Updated 4 months ago
- Build compute kernels and load them from the Hub.☆691Updated this week
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 9 months ago
- ☆15Jun 30, 2023Updated 2 years ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Mar 18, 2026Updated 3 months ago
- Source Code & Datasets for "Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data"☆12May 20, 2022Updated 4 years ago
- Code Generation Based High Speed Data Serialization Tool☆12Dec 27, 2022Updated 3 years ago
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆40Sep 24, 2024Updated last year
- This repository contains all code examples for my TensorFlow World talk about "Advanced model deployments with TensorFlow Serving"☆17Dec 8, 2022Updated 3 years ago
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆18Dec 22, 2024Updated last year
- Notebooks to demonstrate TimmWrapper☆16Jan 16, 2025Updated last year