One-stop solutions for Mixture of Expert modules in PyTorch.
☆28Feb 10, 2026Updated last month
Alternatives and similar repositories for pytorch-mixtures
Users that are interested in pytorch-mixtures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-task Learning Model for Recommender Systems☆15Jul 16, 2021Updated 4 years ago
- OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks☆26Jul 2, 2019Updated 6 years ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆13Apr 18, 2024Updated last year
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated last year
- Code accompanying the paper "A contrastive rule for meta-learning"☆13Oct 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation for Sparse MetA-Tuning (SMAT)☆17Jul 29, 2025Updated 8 months ago
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions☆15Dec 27, 2023Updated 2 years ago
- ☆10Jun 22, 2022Updated 3 years ago
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆12Mar 20, 2023Updated 3 years ago
- Deep Reinforcement Learning for Dialogue Generation using SEQ2SEQ model☆12Feb 23, 2021Updated 5 years ago
- IIT Guwahati's Gold Medal winning solution to DevRev’s Expert Answers in a Flash Improving Domain-Specific QA☆10Jul 26, 2025Updated 8 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [KDD 2025] Fine-tuning Multimodal Large Language Models for Product Bundling☆15Sep 20, 2025Updated 6 months ago
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Aug 18, 2023Updated 2 years ago
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆18Jun 28, 2024Updated last year
- This project is my attempt at automating work in Notion.☆17Aug 28, 2025Updated 7 months ago
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆20Mar 5, 2025Updated last year
- Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)☆20Feb 16, 2024Updated 2 years ago
- ☆29Oct 25, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This is the official code for the ACL 2025 paper "GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion".☆31Mar 23, 2026Updated 2 weeks ago
- [ACM TOMM'2025] "MMHCL: Multi-Modal Hypergraph Contrastive Learning for Recommendation"☆30Aug 13, 2025Updated 7 months ago
- ☆15Oct 25, 2021Updated 4 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Neuro-Symbolic AI Toolkit☆116Sep 17, 2025Updated 6 months ago
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆18Dec 7, 2024Updated last year
- [CVPR 2024] PriViLege: Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners☆57Sep 5, 2024Updated last year
- [SIGIR2024] BIGCF: Exploring the Individuality and Collectivity of Intents behind Interactions for Graph Collaborative Filtering☆15Aug 1, 2024Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Mar 7, 2024Updated 2 years ago
- [WSDM'2025] "MixRec: Heterogeneous Graph Collaborative Filtering"☆21Dec 19, 2024Updated last year
- Update: Ignore this repo, check out @lucidrains' implementation https://github.com/lucidrains/musiclm-pytorch☆15Jan 27, 2023Updated 3 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 3 weeks ago
- [CVPR2024] Simple Semantic-Aided Few-Shot Learning☆58Sep 1, 2024Updated last year
- Official code of "Invariant Collaborative Filtering to Popularity Distribution Shift" (2023 WWW)☆21Jul 27, 2023Updated 2 years ago
- Python code which creates a semantic search bot over any available corpus☆17May 22, 2023Updated 2 years ago