song-wx/SIFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/song-wx/SIFT)

song-wx / SIFT

[ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely

☆24

Alternatives and similar repositories for SIFT

Users that are interested in SIFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zcli-charlie / BatGPT
View on GitHub
Bidirectional Autoregressive Talker from Generative Pre-trained Transformer
☆39Jul 27, 2023Updated 3 years ago
Infini-AI-Lab / S2FT
View on GitHub
☆19Jan 3, 2025Updated last year
eth-easl / deltazip
View on GitHub
Compression for Foundation Models
☆36Jul 21, 2025Updated last year
lzhangbv / eva
View on GitHub
[ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation
☆12Jul 31, 2023Updated 2 years ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
vectozavr / llm-hessian
View on GitHub
Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models
☆29Apr 17, 2025Updated last year
XuZikang / Awesome-MedIA-Fairness
View on GitHub
A collection of papers in fairness of medical image analysis
☆13Jun 16, 2023Updated 3 years ago
IST-DASLab / MicroAdam
View on GitHub
This repository contains code for the MicroAdam paper.
☆21Dec 14, 2024Updated last year
feizc / Visual-ChatGLM
View on GitHub
Open ChatGLM Eyes to See the World
☆13Mar 30, 2023Updated 3 years ago
TianjinYellow / SPAM-Optimizer
View on GitHub
☆36Mar 12, 2025Updated last year
fmfi-compbio / admm-pruning
View on GitHub
☆30Jul 22, 2024Updated 2 years ago
ElunaMamka / NG-Midiformer
View on GitHub
Official code of "N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding"
☆14Apr 10, 2024Updated 2 years ago
Ethos-lab / ares
View on GitHub
A System-Oriented Wargame Framework for Adversarial ML
☆10Apr 24, 2023Updated 3 years ago
whunextgen / LLMindCraft
View on GitHub
Shaping Language Models with Cognitive Insights
☆15Feb 29, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
elsa66666 / MentraSuite
View on GitHub
psychology reasoning llm
☆17Dec 16, 2025Updated 7 months ago
stanfordnlp / multi-distribution-retrieval
View on GitHub
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆17Jan 16, 2024Updated 2 years ago
webis-de / set-encoder
View on GitHub
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders
☆19May 23, 2025Updated last year
stzhang-patrick / ArcMMLU
View on GitHub
☆16Feb 2, 2024Updated 2 years ago
Infini-AI-Lab / Kinetics
View on GitHub
Kinetics: Rethinking Test-Time Scaling Laws
☆87Jul 11, 2025Updated last year
lartpang / RunIt
View on GitHub
A simple program scheduler for your code on different devices.
☆12Mar 8, 2026Updated 4 months ago
sanketx / AL-foundation-models
View on GitHub
Active Learning in the era of Foundation Models
☆14Apr 16, 2025Updated last year
ScalingIntelligence / caesar
View on GitHub
Throughput-oriented multi-turn inference engine for KernelBench [ICML '25]
☆24May 27, 2025Updated last year
mzalaya / screenkhorn
View on GitHub
Code for NeurIPS 2019 paper "Screening Sinkhorn Algorithm for Regularized Optimal Transport"
☆10Feb 10, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ubc-tea / SGPT
View on GitHub
The official implantation of SGPT (CVPR2024)
☆17Jul 15, 2024Updated 2 years ago
megagonlabs / holobench
View on GitHub
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…
☆12Feb 25, 2025Updated last year
aladinD / SafeMERGE
View on GitHub
Code for SafeMERGE (ICLR 2025).
☆15Apr 1, 2025Updated last year
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
MedICL-VU / COLosSAL
View on GitHub
Official implementation of COLosSAL [MICCAI 2023]
☆15Jul 22, 2023Updated 3 years ago
dirkiedai / sk-mt
View on GitHub
This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).
☆15Nov 22, 2023Updated 2 years ago
princeton-nlp / unintentional-unalignment
View on GitHub
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
☆32Jan 7, 2026Updated 6 months ago
zhuhanqing / Lightening-Transformer-AE
View on GitHub
Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…
☆11Mar 3, 2024Updated 2 years ago
SteveKGYang / SCCL
View on GitHub
Pytorch code for TAC accepted paper: "Cluster-Level Contrastive Learning for Emotion Recognition in Conversations"
☆26Apr 16, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
MuLabPKU / PiSSA
View on GitHub
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)
☆429Jun 30, 2025Updated last year
tanganke / pareto_set_learning
View on GitHub
Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"
☆11Sep 13, 2024Updated last year
SteveKGYang / MetaAligner
View on GitHub
Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
☆24Sep 26, 2024Updated last year
ruz048 / AutoLoRA
View on GitHub
☆10Apr 16, 2024Updated 2 years ago
uclaml / Frank-Wolfe-AdvML
View on GitHub
A Frank-Wolfe Framework for Efficient and Effective Adversarial Attacks (AAAI'20)
☆11Jun 10, 2020Updated 6 years ago
Outsider565 / LoRA-GA
View on GitHub
☆219Nov 25, 2025Updated 8 months ago
riejohnson / cfg-gan
View on GitHub
CFG-GAN: Composite functional gradient learning of generative adversarial models
☆15Jul 9, 2020Updated 6 years ago