One-stop solutions for Mixture of Expert modules in PyTorch.
☆28Feb 10, 2026Updated 2 months ago
Alternatives and similar repositories for pytorch-mixtures
Users that are interested in pytorch-mixtures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆13Apr 18, 2024Updated 2 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated 2 years ago
- Source codes and Datasets for EDDA in CIKM'23☆21Aug 8, 2023Updated 2 years ago
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- ☆10Jun 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- IIT Guwahati's Gold Medal winning solution to DevRev’s Expert Answers in a Flash Improving Domain-Specific QA☆11Jul 26, 2025Updated 9 months ago
- [KDD 2025] Fine-tuning Multimodal Large Language Models for Product Bundling☆15Sep 20, 2025Updated 7 months ago
- A straightforward method to reduce your LLM inference API costs and token usage.☆24May 18, 2025Updated 11 months ago
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 6 months ago
- Contrastive Dialogue Disentanglement via Clustering☆12Apr 26, 2023Updated 3 years ago
- This is an official implementation for "Learning to Expand Audience via Meta Hybrid Experts and Critics for Recommendation and Advertisin…☆58Feb 22, 2022Updated 4 years ago
- This project is my attempt at automating work in Notion.☆17Aug 28, 2025Updated 8 months ago
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆171Aug 25, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆20Mar 5, 2025Updated last year
- A Pytorch tutorial of Conditional Flow Matching[Lipman22] using MNIST dataset.☆30Aug 26, 2025Updated 8 months ago
- Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)☆21Feb 16, 2024Updated 2 years ago
- [KDD'25] Flow Matching for Collaborative Filtering☆22Sep 6, 2025Updated 7 months ago
- A collection of some awesome public Julia programming language projects.☆22Feb 22, 2024Updated 2 years ago
- [ACM TOMM'2025] "MMHCL: Multi-Modal Hypergraph Contrastive Learning for Recommendation"☆30Aug 13, 2025Updated 8 months ago
- Official source code for AAAI 2025 paper: Augmenting Sequential Recommendation with Balanced Relevance and Diversity☆25Apr 16, 2025Updated last year
- ☆15Oct 25, 2021Updated 4 years ago
- Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…☆18Dec 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🏆 A ranked list of awesome machine learning Julia libraries.☆24Nov 15, 2021Updated 4 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- [WWW'25]The official implementation of Graph Representation Learning via Causal Diffusion for Out-of-Distribution Recommendation☆18Mar 29, 2025Updated last year
- Neuro-Symbolic AI Toolkit☆117Sep 17, 2025Updated 7 months ago
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆18Dec 7, 2024Updated last year
- PyTorch Implementation of the Multi-gate Mixture-of-Experts with Exclusivity (MMoEEx)☆34Jul 6, 2021Updated 4 years ago
- [CVPR 2024] PriViLege: Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners☆57Sep 5, 2024Updated last year
- [SIGIR2024] BIGCF: Exploring the Individuality and Collectivity of Intents behind Interactions for Graph Collaborative Filtering☆15Aug 1, 2024Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆21May 24, 2025Updated 11 months ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- ☆17Dec 17, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last month
- [CVPR2024] Simple Semantic-Aided Few-Shot Learning☆58Sep 1, 2024Updated last year
- Official code of "Invariant Collaborative Filtering to Popularity Distribution Shift" (2023 WWW)☆21Jul 27, 2023Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year