neilwen987 / CSR_Adaptive_Rep
Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation
☆63Updated last week
Alternatives and similar repositories for CSR_Adaptive_Rep
Users that are interested in CSR_Adaptive_Rep are comparing it to the libraries listed below
Sorting:
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated 2 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆84Updated last year
- Using FlexAttention to compute attention with different masking patterns☆43Updated 7 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆26Updated 6 months ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆24Updated 2 months ago
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆41Updated last year
- ☆47Updated 8 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆27Updated 2 months ago
- Simple and scalable tools for data-driven pretraining data selection.☆23Updated 3 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆84Updated 5 months ago
- ☆78Updated 8 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆72Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆68Updated 3 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆39Updated 7 months ago
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆36Updated 11 months ago
- ☆31Updated last year
- ☆17Updated 4 months ago
- ☆24Updated 3 months ago
- Stick-breaking attention☆53Updated 2 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆27Updated 7 months ago
- ☆72Updated 3 weeks ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆64Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆77Updated last month
- Code for Zero-Shot Tokenizer Transfer☆127Updated 4 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆207Updated 3 months ago
- ☆68Updated 8 months ago
- Exploration of automated dataset selection approaches at large scales.☆40Updated 2 months ago
- MEXMA: Token-level objectives improve sentence representations☆41Updated 4 months ago