PyTorch implementation of LIMoE
☆52Apr 1, 2024Updated last year
Alternatives and similar repositories for LIMoE-pytorch
Users that are interested in LIMoE-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆36Jan 31, 2026Updated last month
- ☆11May 17, 2024Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- SWIS: Self-Supervised Representation Learning For Writer Independent Offline Signature Verification", ICIP 2022 (Oral)☆11Feb 17, 2023Updated 3 years ago
- ☆13Sep 18, 2019Updated 6 years ago
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆40Oct 31, 2024Updated last year
- A collection of AWESOME things about mixture-of-experts☆1,269Dec 8, 2024Updated last year
- ☆707Dec 6, 2025Updated 3 months ago
- Official Implementation for MoPE (T-MM 2025)☆28Oct 10, 2025Updated 4 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆26Nov 15, 2023Updated 2 years ago
- ☆11Jun 3, 2025Updated 9 months ago
- ☆26Jun 14, 2022Updated 3 years ago
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,232Apr 19, 2024Updated last year
- A fast MoE impl for PyTorch☆1,845Feb 10, 2025Updated last year
- ALIGN trained on COYO-dataset☆29Apr 30, 2024Updated last year
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Sep 18, 2025Updated 5 months ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆36Nov 29, 2021Updated 4 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- [ICCV 2025] VisRL: Intention-Driven Visual Perception via Reinforced Reasoning☆45Nov 8, 2025Updated 4 months ago
- Uncovering User Interest from Biased and Noised Watch Time in Video Recommendation. In Recsys23.☆11Jul 18, 2023Updated 2 years ago
- Perception Matters: Exploring Imperceptible and Transferable Anti-forensics for GAN-generated Fake Face Imagery Detection☆11Jan 23, 2023Updated 3 years ago
- Instance-wise Batch Label Restoration via Gradients In Federated Learning (ICLR 2023)☆11May 18, 2023Updated 2 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- Implementation of Pre-text invariant representation learning algorithm in pytorch☆11May 27, 2020Updated 5 years ago
- A large scale inpainting & t2i anime image dataset☆15Oct 18, 2025Updated 4 months ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- ☆11Feb 9, 2024Updated 2 years ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year
- This repository contains the code used to run generate the data splits, run the hyperparameter tunings, and export the results presented …☆13Jul 22, 2022Updated 3 years ago
- Demo Library for Numato FPGA Boards☆12Mar 4, 2015Updated 11 years ago
- ☆10Aug 10, 2017Updated 8 years ago
- ☆32Jan 30, 2026Updated last month
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year