[ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely
☆24Jun 26, 2024Updated last year
Alternatives and similar repositories for SIFT
Users that are interested in SIFT are comparing it to the libraries listed below
Sorting:
- ☆19Jan 3, 2025Updated last year
- BlockRank makes LLMs efficient and scalable for RAG and in-context ranking☆41Dec 12, 2025Updated 2 months ago
- Kinetics: Rethinking Test-Time Scaling Laws☆85Jul 11, 2025Updated 7 months ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- ☆34Mar 12, 2025Updated 11 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- Multi-Domain Image-to-Image Translation using StarGAN with Max Sliced Wasserstein Distance.☆14May 17, 2019Updated 6 years ago
- Low-Rank Llama Custom Training☆23Mar 27, 2024Updated last year
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆21Jun 11, 2025Updated 8 months ago
- Starter template for your ML/AI projects (uv package manager, RestAPI with FastAPI and Dockerfile support)☆33Jan 13, 2025Updated last year
- Code to enable layer-level steering in LLMs using sparse auto encoders☆31Sep 18, 2025Updated 5 months ago
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 4 months ago
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- ☆30Jul 22, 2024Updated last year
- ☆32Nov 11, 2024Updated last year
- ☆59Nov 17, 2025Updated 3 months ago
- Agentic Learning Powered by AWorld☆90Feb 13, 2026Updated 3 weeks ago
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆34Jan 9, 2024Updated 2 years ago
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆30Mar 28, 2024Updated last year
- [ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference☆283May 1, 2025Updated 10 months ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Dec 25, 2025Updated 2 months ago
- ☆11Jul 17, 2023Updated 2 years ago
- This is the class in matlab for convex optimization algorithms☆10Nov 19, 2023Updated 2 years ago
- GBM implementation on Legate☆14Jan 28, 2026Updated last month
- rabitq rust implementation☆10Feb 4, 2026Updated last month
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- my profile readme☆14Updated this week
- Official code for PLoP☆17Updated this week
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- ☆51Jun 21, 2025Updated 8 months ago
- [NeurIPS Spotlight 2025] Angles Don’t Lie: Unlocking Training-Efficient RL Through the Model’s Own Signals.☆81Sep 26, 2025Updated 5 months ago