jaisidhsingh / pytorch-mixturesLinks

One-stop solutions for Mixture of Experts and Mixture of Depth modules in PyTorch.

☆24

Alternatives and similar repositories for pytorch-mixtures

Users that are interested in pytorch-mixtures are comparing it to the libraries listed below

Sorting:

SriramB-98 / vit-decompose
☆22Updated 5 months ago
facebookresearch / Mixture-of-Transformers
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.
☆78Updated last month
nreHieW / minARImageGen
Autoregressive Image Generation
☆32Updated 2 weeks ago
fangyuan-ksgk / Mini-LLaVA
A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.
☆93Updated 6 months ago
kpup1710 / CAMEx
[ICLR 2025] CAMEx: Curvature-Aware Merging of Experts
☆20Updated 3 months ago
locuslab / llava-token-compression
☆42Updated 7 months ago
tianyu-z / VCR
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
☆31Updated 4 months ago
albanie / foundation-models
Video descriptions of research papers relating to foundation models and scaling
☆31Updated 2 years ago
iancovert / locality-alignment
☆50Updated 5 months ago
jaisidhsingh / CoN-CLIP
Implementation of the "Learn No to Say Yes Better" paper.
☆31Updated last month
sehyunkwon / ICTC
This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)
☆88Updated last year
Adamdad / neumeta
NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…
☆43Updated 7 months ago
fkodom / soft-mixture-of-experts
PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)
☆73Updated last year
Suikasxt / PMG
The repository of paper Personalized Multimodal Response Generation with Large Language Models
☆14Updated 11 months ago
krafton-ai / mambaformer-icl
MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248
☆55Updated last year
hanyang1999 / discrete-diffusion-papers
A collection of papers on discrete diffusion models
☆145Updated 2 weeks ago
Optimization-AI / FastCLIP
Distributed Optimization Infra for learning CLIP models
☆26Updated 8 months ago
YuxinWenRick / diffusion_memorization
Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)
☆75Updated last year
chenllliang / DnD-Transformer
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆76Updated 6 months ago
TIGER-AI-Lab / VIEScore
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…
☆45Updated 7 months ago
lucidrains / infini-transformer-pytorch
Implementation of Infini-Transformer in Pytorch
☆111Updated 5 months ago
TianjinYellow / SPAM-Optimizer
☆33Updated 3 months ago
AnonymousAlethiometer / SGD_SaI
Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"
☆52Updated 5 months ago
lucidrains / mixture-of-attention
Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts
☆119Updated 8 months ago
SHI-Labs / OLA-VLM
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024
☆60Updated 4 months ago
Weixin-Liang / Mixture-of-Mamba
☆43Updated 5 months ago
lucidrains / multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
☆102Updated last year
ExplainableML / fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆57Updated 6 months ago
neilwen987 / CSR_Adaptive_Rep
Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation
☆80Updated 2 weeks ago
multimodal-interpretability / maia
Official implementation of MAIA, A Multimodal Automated Interpretability Agent
☆82Updated last week