naver-ai / model-stock
Model Stock: All we need is just a few fine-tuned models
☆88Updated last month
Related projects ⓘ
Alternatives and complementary repositories for model-stock
- Patching open-vocabulary models by interpolating weights☆90Updated last year
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆51Updated 2 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆53Updated last month
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆45Updated last month
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆37Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆43Updated 5 months ago
- ☆148Updated 9 months ago
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆31Updated 3 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆62Updated 2 months ago
- Code for T-MARS data filtering☆35Updated last year
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆48Updated 2 months ago
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆45Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆69Updated 3 months ago
- ☆24Updated 2 months ago
- This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.☆69Updated 4 months ago
- Language Quantized AutoEncoders☆94Updated last year
- Code accompanying the paper "Massive Activations in Large Language Models"☆121Updated 8 months ago
- Official implementation for NeurIPS'23 paper "Geodesic Multi-Modal Mixup for Robust Fine-Tuning"☆28Updated last month
- This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆120Updated 4 months ago
- ☆50Updated 10 months ago
- This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)☆79Updated 7 months ago
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆24Updated 8 months ago
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- Recycling diverse models☆43Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆32Updated 2 months ago
- ☆62Updated 3 months ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆108Updated 3 weeks ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆32Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆49Updated last week
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year