One-stop solutions for Mixture of Expert modules in PyTorch.
☆27May 3, 2026Updated last month
Alternatives and similar repositories for pytorch-mixtures
Users that are interested in pytorch-mixtures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-task Learning Model for Recommender Systems☆15Jul 16, 2021Updated 4 years ago
- OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks☆25Jul 2, 2019Updated 6 years ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆14Apr 18, 2024Updated 2 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated 2 years ago
- Code accompanying the paper "A contrastive rule for meta-learning"☆13Oct 31, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Some utility functions to help myself (and perhaps others) go faster with ML/AI work☆50Jun 2, 2026Updated last week
- Official implementation for Sparse MetA-Tuning (SMAT)☆17Jul 29, 2025Updated 10 months ago
- [NeurIPS 2024] Search for Efficient LLMs☆17Jan 16, 2025Updated last year
- Mixture of Experts from scratch☆14Apr 12, 2024Updated 2 years ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Easy wrapper for inserting LoRA layers in CLIP.☆40Jun 16, 2024Updated last year
- ☆10Jun 22, 2022Updated 3 years ago
- Large-Scale Scene Text Dataset for Indic Languages☆20May 19, 2026Updated 3 weeks ago
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆12Mar 20, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep Reinforcement Learning for Dialogue Generation using SEQ2SEQ model☆11Feb 23, 2021Updated 5 years ago
- IIT Guwahati's Gold Medal winning solution to DevRev’s Expert Answers in a Flash Improving Domain-Specific QA☆11Jul 26, 2025Updated 10 months ago
- [KDD 2025] Fine-tuning Multimodal Large Language Models for Product Bundling☆15Sep 20, 2025Updated 8 months ago
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Aug 18, 2023Updated 2 years ago
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 7 months ago
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago
- This is an official implementation for "Learning to Expand Audience via Meta Hybrid Experts and Critics for Recommendation and Advertisin…☆58Feb 22, 2022Updated 4 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated last year
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆48Sep 2, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆18Jun 28, 2024Updated last year
- This project is my attempt at automating work in Notion.☆17Aug 28, 2025Updated 9 months ago
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- A Pytorch tutorial of Conditional Flow Matching[Lipman22] using MNIST dataset.☆32Aug 26, 2025Updated 9 months ago
- Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)☆21Feb 16, 2024Updated 2 years ago
- [KDD'25] Flow Matching for Collaborative Filtering☆23Sep 6, 2025Updated 9 months ago
- 记录推荐系统相关的面试题、优化经验☆38Jun 2, 2025Updated last year
- ☆15Oct 25, 2021Updated 4 years ago
- Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…☆18Dec 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🏆 A ranked list of awesome machine learning Julia libraries.☆25Nov 15, 2021Updated 4 years ago
- Design your Material-UI buttons, add clickable hyperlinks, integrate them in your Streamlit apps! 🎈☆10Jun 17, 2022Updated 3 years ago
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated 2 years ago
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆18Dec 7, 2024Updated last year
- [CVPR 2024] PriViLege: Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners☆57Sep 5, 2024Updated last year
- [SIGIR2024] BIGCF: Exploring the Individuality and Collectivity of Intents behind Interactions for Graph Collaborative Filtering☆16Aug 1, 2024Updated last year