One-stop solutions for Mixture of Expert modules in PyTorch.
☆26Feb 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for pytorch-mixtures
Users that are interested in pytorch-mixtures are comparing it to the libraries listed below
Sorting:
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆14Apr 18, 2024Updated last year
- Multi-task Learning Model for Recommender Systems☆15Jul 16, 2021Updated 4 years ago
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)☆20Feb 16, 2024Updated 2 years ago
- OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks☆26Jul 2, 2019Updated 6 years ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Mar 7, 2024Updated 2 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- IIT Guwahati's Gold Medal winning solution to DevRev’s Expert Answers in a Flash Improving Domain-Specific QA☆10Jul 26, 2025Updated 7 months ago
- Fastened CROWN: Tightened Neural Network Robustness Certificates☆10Feb 10, 2020Updated 6 years ago
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 4 months ago
- Showing full TensorBoard support in Tensorflow for a CNN using MNIST data.☆13Oct 19, 2019Updated 6 years ago
- An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learn…☆15Feb 19, 2019Updated 7 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- (ECCV2022) EAGAN: EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs☆12Sep 15, 2022Updated 3 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- PyTorch implementation of the SIESTA algorithm from our TMLR-2023 paper "SIESTA: Efficient Online Continual Learning with Sleep"☆13Oct 25, 2024Updated last year
- Symbolic Graphics Programming with Large Language Models☆37Sep 14, 2025Updated 5 months ago
- ChineseCLIP using online learning☆13Nov 7, 2022Updated 3 years ago
- An Efficient Dataset Condensation Plugin and Its Application to Continual Learning. NeurIPS, 2023.☆12Nov 29, 2023Updated 2 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".☆10May 30, 2019Updated 6 years ago
- ☆11Oct 8, 2020Updated 5 years ago
- ☆11Jun 2, 2021Updated 4 years ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL) + customize-your-own.☆10Oct 20, 2022Updated 3 years ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- Nano vLLM☆13Jun 26, 2025Updated 8 months ago
- Integration examples and utilities for VOT toolkit☆11Feb 18, 2026Updated 2 weeks ago
- ☆10Oct 7, 2019Updated 6 years ago
- Vectorgraph Image Painter☆12Mar 24, 2019Updated 6 years ago
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆12Mar 20, 2023Updated 2 years ago
- A Nonlocal Feature-Driven Exemplar-Based Approach For Image Inpainting☆10Dec 9, 2020Updated 5 years ago
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Oct 30, 2024Updated last year
- [KDD 2025] The implementation of "Fine-tuning Multimodal Large Language Models for Product Bundling", KDD'25☆15Sep 20, 2025Updated 5 months ago
- ☆13Nov 26, 2023Updated 2 years ago
- PyTorch implementation of "Learning from Students: Online Contrastive Distillation Network for General Continual Learning" (IJCAI 2022)☆11Dec 29, 2022Updated 3 years ago