automl / is_mamba_capable_of_icl
☆13Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for is_mamba_capable_of_icl
- ☆25Updated 4 months ago
- Test-time-training on nearest neighbors for large language models☆27Updated 7 months ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆32Updated 2 weeks ago
- Deep Learning & Information Bottleneck☆50Updated last year
- ☆63Updated 2 years ago
- ☆50Updated 6 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆54Updated 2 weeks ago
- ☆44Updated 10 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆28Updated 4 months ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆44Updated last year
- ☆26Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆87Updated last year
- ☆36Updated 3 months ago
- ☆24Updated 8 months ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆15Updated 5 months ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆14Updated 10 months ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆16Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆69Updated last year
- Bayesian low-rank adaptation for large language models☆23Updated 6 months ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆15Updated 6 months ago
- ☆15Updated 4 months ago
- ☆33Updated 9 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Long Context Extension and Generalization in LLMs☆39Updated 2 months ago
- ☆26Updated 3 weeks ago
- ☆58Updated 2 years ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆21Updated 6 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆63Updated 8 months ago