YeonwooSung / LIMoE-pytorchView external linksLinks
PyTorch implementation of LIMoE
☆52Apr 1, 2024Updated last year
Alternatives and similar repositories for LIMoE-pytorch
Users that are interested in LIMoE-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆36Jan 31, 2026Updated 2 weeks ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 2 months ago
- ☆13Sep 18, 2019Updated 6 years ago
- A collection of AWESOME things about mixture-of-experts☆1,262Dec 8, 2024Updated last year
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆20Jul 16, 2024Updated last year
- ☆705Dec 6, 2025Updated 2 months ago
- Official Implementation for MoPE (T-MM 2025)☆28Oct 10, 2025Updated 4 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 2 years ago
- ☆93Apr 3, 2023Updated 2 years ago
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆26Nov 15, 2023Updated 2 years ago
- ☆11Jun 3, 2025Updated 8 months ago
- ☆26Jun 14, 2022Updated 3 years ago
- Code repository for "Parameter Efficient Self-supervised Geospatial Domain Adaptation", CVPR 2024☆35Jul 29, 2024Updated last year
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,228Apr 19, 2024Updated last year
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated 10 months ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Sep 18, 2025Updated 4 months ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆344Apr 2, 2025Updated 10 months ago
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆36Nov 29, 2021Updated 4 years ago
- A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models☆848Sep 13, 2023Updated 2 years ago
- ☆12Dec 9, 2022Updated 3 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- Book: Practical Probabilistic Machine Learning in Python☆10Apr 3, 2021Updated 4 years ago
- Official repository for the paper "On the use of Benford's law to detect GAN-generated images", ICPR2020☆13Apr 7, 2021Updated 4 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- 从零快速使用Ubuntu,搭建深度学习环境,持续更新中☆10Apr 18, 2023Updated 2 years ago
- ☆10Jul 8, 2021Updated 4 years ago
- ☆14Jul 4, 2022Updated 3 years ago
- Implementation of Pre-text invariant representation learning algorithm in pytorch☆11May 27, 2020Updated 5 years ago
- 2024年第六届全球校园人工智能算法精英大赛AI生成人脸图像鉴别☆15May 30, 2025Updated 8 months ago
- Instance-wise Batch Label Restoration via Gradients In Federated Learning (ICLR 2023)☆11May 18, 2023Updated 2 years ago
- Perception Matters: Exploring Imperceptible and Transferable Anti-forensics for GAN-generated Fake Face Imagery Detection☆11Jan 23, 2023Updated 3 years ago
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- Official code for the paper "Adversarial Magnification to Deceive Deepfake Detection through Super Resolution"☆12Jun 26, 2023Updated 2 years ago
- Transformer from scratch with einsum method☆11Jul 8, 2021Updated 4 years ago
- This repository contains the code used to run generate the data splits, run the hyperparameter tunings, and export the results presented …☆13Jul 22, 2022Updated 3 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 6 years ago
- ☆13Oct 25, 2019Updated 6 years ago