LoFiT: Localized Fine-tuning on LLM Representations
☆45Jan 15, 2025Updated last year
Alternatives and similar repositories for lo-fit
Users that are interested in lo-fit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated 2 years ago
- ☆10Apr 16, 2024Updated 2 years ago
- ☆56Aug 10, 2024Updated last year
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆33Nov 4, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆35Feb 10, 2025Updated last year
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- ☆20Oct 13, 2024Updated last year
- Python library for Adversarial ML Evaluation☆26Jul 14, 2025Updated 9 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆45Jun 30, 2024Updated last year
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆60Jun 12, 2023Updated 2 years ago
- ☆13Sep 8, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆144Jul 13, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆573Jan 28, 2025Updated last year
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆38Jan 20, 2026Updated 3 months ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated 2 years ago
- ☆12Jul 4, 2024Updated last year
- ACL 2022(findings): A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddings☆18Mar 23, 2022Updated 4 years ago
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆98Nov 24, 2025Updated 4 months ago
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- [ICLR 2025] Official repository for the paper "Influence-Guided Diffusion for Dataset Distillation".☆15Feb 12, 2025Updated last year
- ☆240Nov 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- ☆20Feb 2, 2026Updated 2 months ago
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆22Mar 20, 2025Updated last year
- ☆14Jun 24, 2024Updated last year
- ☆26Nov 23, 2023Updated 2 years ago
- ☆34Nov 7, 2024Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 5 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆180Mar 12, 2026Updated last month
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆181Jan 29, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- code for unsupervised entity resolution☆10Apr 26, 2019Updated 6 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆39Feb 27, 2024Updated 2 years ago
- ☆253Feb 22, 2024Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆103Oct 28, 2024Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆301Jan 22, 2026Updated 2 months ago
- ☆16Oct 26, 2018Updated 7 years ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆14Mar 11, 2025Updated last year