One-stop solutions for Mixture of Expert modules in PyTorch.
☆27May 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for pytorch-mixtures
Users that are interested in pytorch-mixtures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hierarchical Story Generation based on (https://arxiv.org/abs/1805.04833)☆12May 6, 2020Updated 6 years ago
- This is a PyTorch implementation of a Transformer Decoder based model that plays chess.☆17Mar 15, 2024Updated 2 years ago
- ☆41Jun 14, 2025Updated 11 months ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆13Apr 18, 2024Updated 2 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code accompanying the paper "A contrastive rule for meta-learning"☆13Oct 31, 2024Updated last year
- [NeurIPS 2024] Search for Efficient LLMs☆17Jan 16, 2025Updated last year
- Mixture of Experts from scratch☆14Apr 12, 2024Updated 2 years ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆12Mar 20, 2023Updated 3 years ago
- Deep Reinforcement Learning for Dialogue Generation using SEQ2SEQ model☆11Feb 23, 2021Updated 5 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated last year
- A straightforward method to reduce your LLM inference API costs and token usage.☆24May 18, 2025Updated last year
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Aug 18, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 6 months ago
- Contrastive Dialogue Disentanglement via Clustering☆12Apr 26, 2023Updated 3 years ago
- Chrome Extension for Switching Google Account in Web Apps☆24Apr 11, 2024Updated 2 years ago
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆18Jun 28, 2024Updated last year
- This project is my attempt at automating work in Notion.☆17Aug 28, 2025Updated 8 months ago
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆20Mar 5, 2025Updated last year
- ICFHR2020☆22Apr 11, 2021Updated 5 years ago
- Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)☆21Feb 16, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A collection of some awesome public Julia programming language projects.☆22Feb 22, 2024Updated 2 years ago
- This is the official code for the ACL 2025 paper "GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion".☆31Mar 23, 2026Updated last month
- Implementation of our paper, "MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models".☆18Apr 16, 2025Updated last year
- Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…☆18Dec 11, 2024Updated last year
- 🏆 A ranked list of awesome machine learning Julia libraries.☆24Nov 15, 2021Updated 4 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- [WWW'25]The official implementation of Graph Representation Learning via Causal Diffusion for Out-of-Distribution Recommendation☆19Mar 29, 2025Updated last year
- [CVPR 2024] PriViLege: Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners☆57Sep 5, 2024Updated last year
- [SIGIR2024] BIGCF: Exploring the Individuality and Collectivity of Intents behind Interactions for Graph Collaborative Filtering☆16Aug 1, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Mar 7, 2024Updated 2 years ago
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆21May 24, 2025Updated 11 months ago
- [WSDM'2025] "MixRec: Heterogeneous Graph Collaborative Filtering"☆20Dec 19, 2024Updated last year
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- ☆17Dec 17, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago