song-wx / SIFTView external linksLinks
[ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely
☆24Jun 26, 2024Updated last year
Alternatives and similar repositories for SIFT
Users that are interested in SIFT are comparing it to the libraries listed below
Sorting:
- ☆19Jan 3, 2025Updated last year
- Compression for Foundation Models☆35Jul 21, 2025Updated 6 months ago
- BlockRank makes LLMs efficient and scalable for RAG and in-context ranking☆41Dec 12, 2025Updated 2 months ago
- Kinetics: Rethinking Test-Time Scaling Laws☆86Jul 11, 2025Updated 7 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Jan 16, 2024Updated 2 years ago
- ☆35Mar 12, 2025Updated 11 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 4 months ago
- This repository contains code for the MicroAdam paper.☆22Dec 14, 2024Updated last year
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆20Jun 11, 2025Updated 8 months ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 10 months ago
- Code to enable layer-level steering in LLMs using sparse auto encoders☆29Sep 18, 2025Updated 4 months ago
- Starter template for your ML/AI projects (uv package manager, RestAPI with FastAPI and Dockerfile support)☆32Jan 13, 2025Updated last year
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 3 months ago
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆34Mar 6, 2025Updated 11 months ago
- ☆30Jul 22, 2024Updated last year
- ☆31Nov 11, 2024Updated last year
- Agentic Learning Powered by AWorld☆88Feb 7, 2026Updated last week
- ☆59Nov 17, 2025Updated 2 months ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆30Mar 28, 2024Updated last year
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- [ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference☆283May 1, 2025Updated 9 months ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- GBM implementation on Legate☆14Jan 28, 2026Updated 2 weeks ago
- AI chaos reasoning persona☆28Updated this week
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- my profile readme☆14Updated this week
- This is the class in matlab for convex optimization algorithms☆10Nov 19, 2023Updated 2 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 5 months ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆13Nov 25, 2024Updated last year
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Dec 25, 2025Updated last month
- Official code for PLoP☆17Jun 30, 2025Updated 7 months ago
- ☆12Jul 8, 2024Updated last year