Mixture of Experts from scratch
☆13Apr 12, 2024Updated last year
Alternatives and similar repositories for mixture-of-experts-from-scratch
Users that are interested in mixture-of-experts-from-scratch are comparing it to the libraries listed below
Sorting:
- UW–Madison Math/CS 714 code examples☆13Dec 4, 2025Updated 3 months ago
- ☆13Jul 23, 2025Updated 7 months ago
- Updated every 15 min — data science & ML jobs for new grads | FAANG & startups | 2026☆30Updated this week
- ☆14Apr 7, 2025Updated 11 months ago
- ☆16Jul 7, 2025Updated 8 months ago
- gpt from 0 -> 1☆11Oct 9, 2025Updated 5 months ago
- Code for the article series on building a Python compiler and interpreter☆11Feb 13, 2025Updated last year
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 4 months ago
- Python Book Telegram Bot☆11Mar 4, 2024Updated 2 years ago
- Chitrarth: Bridging Vision and Language for a Billion People☆13Feb 12, 2025Updated last year
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆12Mar 20, 2023Updated 2 years ago
- ☆10Jun 22, 2022Updated 3 years ago
- [KDD 2025] The implementation of "Fine-tuning Multimodal Large Language Models for Product Bundling", KDD'25☆15Sep 20, 2025Updated 5 months ago
- Generating super-resolution images using GANs☆11Mar 29, 2020Updated 5 years ago
- A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…☆11Dec 11, 2023Updated 2 years ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- An implementation of the FuzzyDBSCAN algorithm.☆13Nov 6, 2022Updated 3 years ago
- ☆43Jan 27, 2026Updated last month
- Версия курса для магистрантов AI TalantedHub в ИТМО за 2024.☆11Oct 14, 2025Updated 4 months ago
- ☆14Jul 6, 2022Updated 3 years ago
- GaaB: Effortless blogging with GitHub issues and Next.js☆12Jun 23, 2024Updated last year
- [KDD'25] Flow Matching for Collaborative Filtering☆20Sep 6, 2025Updated 6 months ago
- This is a side project where me and my friend try to generate synthetic data in bangla from deepseek-r1. So that can be used for model di…☆11Jun 28, 2025Updated 8 months ago
- A High level PyTorch Training and Utility Library☆12Sep 16, 2025Updated 5 months ago
- The notebooks for generative AI by using PyTorch, Huggingface/diffusers, transforms. And the implementing of the algorithms in paper☆16Jan 26, 2026Updated last month
- ☆12Jan 10, 2025Updated last year
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago
- 为langchain添加chatGLM-130B,星火大模型的chat models☆13Jun 30, 2023Updated 2 years ago
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆15Dec 11, 2023Updated 2 years ago
- Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…☆17Dec 11, 2024Updated last year
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆21Mar 5, 2025Updated last year
- This is the official code for the ACL 2025 paper "GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion".☆27Aug 30, 2025Updated 6 months ago
- MLOps platform powered by Kubeflow☆26Updated this week
- everything i know about cuda and triton☆13Jan 28, 2025Updated last year
- Python version of "How to Build an Agent" by Thorsten Ball☆39Oct 30, 2025Updated 4 months ago
- Text Normalizer module use for Bangla as well as English digit convert to textual format, Normalize Date and Extract Date☆14Feb 25, 2026Updated last week
- A straightforward method to reduce your LLM inference API costs and token usage.☆21May 18, 2025Updated 9 months ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated 11 months ago