naver-ai / model-stockLinks

Model Stock: All we need is just a few fine-tuned models

☆121

Alternatives and similar repositories for model-stock

Users that are interested in model-stock are comparing it to the libraries listed below

Sorting:

prometheus-eval / prometheus-vision
[ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…
☆74Updated 10 months ago
ByungKwanLee / Phantom
[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …
☆60Updated 10 months ago
jongwooko / distillm-2
Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)
☆34Updated last month
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆53Updated last year
alinlab / HOMER
Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).
☆43Updated last year
locuslab / T-MARS
Code for T-MARS data filtering
☆35Updated last year
prateeky2806 / ties-merging
☆185Updated last year
mlfoundations / patching
Patching open-vocabulary models by interpolating weights
☆91Updated last year
mlfoundations / clip_quality_not_quantity
☆29Updated 2 years ago
UCDvision / NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
☆55Updated 11 months ago
multimodal-interpretability / maia
Official implementation of MAIA, A Multimodal Automated Interpretability Agent
☆83Updated last month
ExplainableML / fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆57Updated 8 months ago
jihoontack / MAC
Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)
☆65Updated last year
haoliuhl / language-quantized-autoencoders
Language Quantized AutoEncoders
☆108Updated 2 years ago
parameterlab / apricot
Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024
☆19Updated 8 months ago
naver-ai / seit
[ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT
☆55Updated 11 months ago
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆174Updated last year
ByungKwanLee / TroL
[EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…
☆97Updated last year
dmis-lab / Monet
[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers
☆70Updated last month
minyoungg / LTE
☆68Updated last year
apple / ml-rpm-bench
☆41Updated last year
jiasenlu / LL3M
LL3M: Large Language and Multi-Modal Model in Jax
☆72Updated last year
wang-kee / LiNeS
Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"
☆30Updated 9 months ago
katiekang1998 / reasoning_generalization
☆34Updated 7 months ago
TomerRonen34 / mixed-resolution-vit
☆51Updated last year
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
ml-jku / EVA
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆41Updated 9 months ago
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆31Updated 2 years ago
Westlake-AI / SEMA
Switch EMA: A Free Lunch for Better Flatness and Sharpness
☆26Updated last year
ncsoft / offsetbias
Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"
☆24Updated 11 months ago