fkodom / soft-mixture-of-expertsView external linksLinks
PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)
☆82Oct 5, 2023Updated 2 years ago
Alternatives and similar repositories for soft-mixture-of-experts
Users that are interested in soft-mixture-of-experts are comparing it to the libraries listed below
Sorting:
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆344Apr 2, 2025Updated 10 months ago
- GPT-J 6B inference on TensorRT with INT-8 precision☆11Apr 5, 2023Updated 2 years ago
- ☆705Dec 6, 2025Updated 2 months ago
- ☆13Jan 27, 2019Updated 7 years ago
- Preprocessing of datasets of chemical reactions: standardization, filtering, augmentation, tokenization, etc.☆15Sep 10, 2025Updated 5 months ago
- A collection of AWESOME things about mixture-of-experts☆1,262Dec 8, 2024Updated last year
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- Map (deep learning) model weights between different model implementations.☆19Jan 30, 2025Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Jun 11, 2025Updated 8 months ago
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 4 years ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Oct 11, 2024Updated last year
- Experiments to assess SPADE on different LLM pipelines.☆17Apr 7, 2024Updated last year
- Research Paper: "Graph Contrastive Learning as a Versatile Foundation for Advanced scRNA-seq Data Analysis"☆10Nov 20, 2024Updated last year
- Pytorch-based adaptive deformable convolution☆18Jun 26, 2021Updated 4 years ago
- Included the standard NMS, Rotate NMS,standard SoftNMS, Rotate SoftNMS, Weighted NMS, Rotate Weighted NMS, Weighted-Boxes-Fusion, Rotate …☆15Oct 11, 2021Updated 4 years ago
- Tiny ResNet inspired FPN network (<2M params) for Rotated Object Detection using 5-parameter Modulated Rotation Loss☆18Jul 8, 2021Updated 4 years ago
- this code shows the train and test of a YOLOV5 convolutional neural network for detection of electronics components☆26Nov 21, 2024Updated last year
- RT-DETR family algorithms implemented based on MMDetection, including RT-DETR, RT-DETRv2, RT-DETRv4, DFINE, DEIM and DEIMv2.☆40Feb 1, 2026Updated 2 weeks ago
- ☆23Aug 2, 2024Updated last year
- Anime head detection using faster-rcnn-fpn☆19Nov 12, 2019Updated 6 years ago
- ☆20Apr 22, 2021Updated 4 years ago
- Neuron Activation☆26Nov 21, 2024Updated last year
- Zero-shot evaluation on LEXGLUE tasks with GTP3.5☆29Mar 11, 2023Updated 2 years ago
- ☆34Aug 23, 2023Updated 2 years ago
- Replication attempt for the Protein Folding Model described in https://www.biorxiv.org/content/10.1101/2021.08.02.454840v1☆37May 19, 2022Updated 3 years ago
- ☆26Oct 2, 2023Updated 2 years ago
- ☆30Sep 28, 2023Updated 2 years ago
- ASCEND Chinese-English code-switching dataset☆30Jul 12, 2022Updated 3 years ago
- A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models☆848Sep 13, 2023Updated 2 years ago
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,228Apr 19, 2024Updated last year
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 2 months ago
- Artifact code release for paper "Uniform-Cost Multi-Path Routing for Reconfigurable Data Center Networks"☆12Sep 5, 2024Updated last year
- ☆13Jan 8, 2024Updated 2 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆146Sep 20, 2024Updated last year
- Triton-based implementation of Sparse Mixture of Experts.☆265Oct 3, 2025Updated 4 months ago
- ☆39Jul 30, 2024Updated last year
- ☆37Aug 4, 2020Updated 5 years ago