yfqiu-nlp / sea-llmView external linksLinks
Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"
☆29Dec 20, 2024Updated last year
Alternatives and similar repositories for sea-llm
Users that are interested in sea-llm are comparing it to the libraries listed below
Sorting:
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Nov 16, 2023Updated 2 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- ☆12Jul 30, 2025Updated 6 months ago
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Jul 22, 2024Updated last year
- ☆24Jul 25, 2024Updated last year
- ☆23Feb 5, 2026Updated last week
- ☆26Nov 23, 2023Updated 2 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆24Jun 19, 2023Updated 2 years ago
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆25Sep 4, 2022Updated 3 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆25Sep 13, 2024Updated last year
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆34Jun 20, 2024Updated last year
- ☆32Feb 13, 2024Updated 2 years ago
- [CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu☆26Aug 27, 2024Updated last year
- ☆26Aug 31, 2023Updated 2 years ago
- Adapting LLaMA Decoder to Vision Transformer☆30May 20, 2024Updated last year
- ☆38Jan 15, 2025Updated last year
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Enhanced Explainable Neural Network☆10Dec 25, 2021Updated 4 years ago
- How much is the footprint of a piece of software? This script scans the process statistics for the appearance of a given command name and…☆12Nov 16, 2023Updated 2 years ago
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago
- Interpretating the latent space representations of attention head outputs for LLMs☆36Aug 13, 2024Updated last year
- ☆43Jul 22, 2024Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆42Jan 18, 2026Updated 3 weeks ago
- ☆41Nov 30, 2023Updated 2 years ago
- ☆10Apr 15, 2022Updated 3 years ago
- ☆28Feb 3, 2026Updated 2 weeks ago
- ☆10Oct 26, 2022Updated 3 years ago
- ☆11Jan 13, 2026Updated last month
- ☆13Feb 4, 2025Updated last year
- SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks☆14Mar 2, 2023Updated 2 years ago
- ☆10Aug 16, 2023Updated 2 years ago
- Deep Generative Model (Torch)☆11Apr 19, 2016Updated 9 years ago
- Software package for intertemporal pricing optimization under reference effects and consumer heterogeneity estimation. Please see REAMDE.…☆10Mar 7, 2024Updated last year
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- A Swift implementation of Qwen3-ASR speech recognition model using MLX Swift for Apple Silicon.☆46Updated this week
- ☆11Dec 14, 2022Updated 3 years ago
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- Adaptive and Robust Multi-Task Learning☆10May 19, 2024Updated last year