jaisidhsingh / pytorch-mixturesView external linksLinks
One-stop solutions for Mixture of Experts and Mixture of Depth modules in PyTorch.
☆26Feb 9, 2026Updated last week
Alternatives and similar repositories for pytorch-mixtures
Users that are interested in pytorch-mixtures are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated last year
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks☆26Jul 2, 2019Updated 6 years ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Mar 7, 2024Updated last year
- Fastened CROWN: Tightened Neural Network Robustness Certificates☆10Feb 10, 2020Updated 6 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- IIT Guwahati's Gold Medal winning solution to DevRev’s Expert Answers in a Flash Improving Domain-Specific QA☆10Jul 26, 2025Updated 6 months ago
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- A Nonlocal Feature-Driven Exemplar-Based Approach For Image Inpainting☆10Dec 9, 2020Updated 5 years ago
- ☆10Oct 7, 2019Updated 6 years ago
- PyTorch implementation of the SIESTA algorithm from our TMLR-2023 paper "SIESTA: Efficient Online Continual Learning with Sleep"☆13Oct 25, 2024Updated last year
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- GPU methods for alpha matting, including cutting edge research algorithms by Philip G. Lee.☆12Jan 8, 2014Updated 12 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- Showing full TensorBoard support in Tensorflow for a CNN using MNIST data.☆13Oct 19, 2019Updated 6 years ago
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆11Mar 20, 2023Updated 2 years ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 3 months ago
- An Efficient Dataset Condensation Plugin and Its Application to Continual Learning. NeurIPS, 2023.☆12Nov 29, 2023Updated 2 years ago
- Integration examples and utilities for VOT toolkit☆10Apr 18, 2025Updated 9 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 10 months ago
- (ECCV2022) EAGAN: EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs☆12Sep 15, 2022Updated 3 years ago
- ☆11Nov 29, 2023Updated 2 years ago
- Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL) + customize-your-own.☆10Oct 20, 2022Updated 3 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆12Oct 9, 2024Updated last year
- ChineseCLIP using online learning☆13Nov 7, 2022Updated 3 years ago
- ☆11Jun 2, 2021Updated 4 years ago
- Vectorgraph Image Painter☆12Mar 24, 2019Updated 6 years ago
- Symbolic Graphics Programming with Large Language Models☆37Sep 14, 2025Updated 5 months ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated last year
- The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".☆10May 30, 2019Updated 6 years ago
- Nano vLLM☆12Jun 26, 2025Updated 7 months ago
- An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learn…☆15Feb 19, 2019Updated 6 years ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated last year
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- ☆12Sep 1, 2023Updated 2 years ago
- Official code of "NAS acceleration via proxy data", IJCAI21☆10May 29, 2022Updated 3 years ago
- PyTorch implementation of "Learning from Students: Online Contrastive Distillation Network for General Continual Learning" (IJCAI 2022)☆11Dec 29, 2022Updated 3 years ago