antonio-f/mixture-of-experts-from-scratch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/antonio-f/mixture-of-experts-from-scratch)

antonio-f / mixture-of-experts-from-scratch

Mixture of Experts from scratch

☆13

Alternatives and similar repositories for mixture-of-experts-from-scratch

Users that are interested in mixture-of-experts-from-scratch are comparing it to the libraries listed below

Sorting:

rycroft-group / math714
View on GitHub
UW–Madison Math/CS 714 code examples
☆13Dec 4, 2025Updated 3 months ago
AIVIETNAM-Hub / Hybrid-Unified-and-Iterative-A-Novel-Framework-for-Text-based-Person-Anomaly-Retrieval
View on GitHub
☆13Jul 23, 2025Updated 7 months ago
zapplyjobs / New-Grad-Data-Science-Jobs-2026
View on GitHub
Updated every 15 min — data science & ML jobs for new grads | FAANG & startups | 2026
☆30Updated this week
lburakakca / Self-driving-car
View on GitHub
☆14Apr 7, 2025Updated 11 months ago
Atulit23 / ai_from_scratch
View on GitHub
☆16Jul 7, 2025Updated 8 months ago
erthorpabar / pytorch-DecoderOnly-model
View on GitHub
gpt from 0 -> 1
☆11Oct 9, 2025Updated 5 months ago
mathspp / building-a-python-compiler-and-interpreter
View on GitHub
Code for the article series on building a Python compiler and interpreter
☆11Feb 13, 2025Updated last year
SnowCharmQ / DPL
View on GitHub
[2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization
☆25Oct 29, 2025Updated 4 months ago
MaximGudkov / book-bot
View on GitHub
Python Book Telegram Bot
☆11Mar 4, 2024Updated 2 years ago
ola-krutrim / Chitrarth
View on GitHub
Chitrarth: Bridging Vision and Language for a Billion People
☆13Feb 12, 2025Updated last year
barneyhill / minBERT
View on GitHub
A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)
☆12Mar 20, 2023Updated 2 years ago
PythonOT / OTML_course_2022
View on GitHub
☆10Jun 22, 2022Updated 3 years ago
Xiaohao-Liu / Bundle-MLLM
View on GitHub
[KDD 2025] The implementation of "Fine-tuning Multimodal Large Language Models for Product Bundling", KDD'25
☆15Sep 20, 2025Updated 5 months ago
ApurbaSengupta / Super-Resolution-Images
View on GitHub
Generating super-resolution images using GANs
☆11Mar 29, 2020Updated 5 years ago
lilingxi01 / nougat-replication
View on GitHub
A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…
☆11Dec 11, 2023Updated 2 years ago
eltociear / MolCA
View on GitHub
Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".
☆12Dec 27, 2023Updated 2 years ago
schulzch / fuzzy_dbscan
View on GitHub
An implementation of the FuzzyDBSCAN algorithm.
☆13Nov 6, 2022Updated 3 years ago
idoatad / TensorLens
View on GitHub
☆43Jan 27, 2026Updated last month
Skyfallk / 2024_processing_and_generating_images_course
View on GitHub
Версия курса для магистрантов AI TalantedHub в ИТМО за 2024.
☆11Oct 14, 2025Updated 4 months ago
liyunfei0411 / labelimg-master
View on GitHub
☆14Jul 6, 2022Updated 3 years ago
kane50613 / github-as-a-blog
View on GitHub
GaaB: Effortless blogging with GitHub issues and Next.js
☆12Jun 23, 2024Updated last year
chengkai-liu / FlowCF
View on GitHub
[KDD'25] Flow Matching for Collaborative Filtering
☆20Sep 6, 2025Updated 6 months ago
KillerShoaib / DeepSeek-r1-Bangla-Reasoning-Data
View on GitHub
This is a side project where me and my friend try to generate synthetic data in bangla from deepseek-r1. So that can be used for model di…
☆11Jun 28, 2025Updated 8 months ago
SaihanTaki / Torchmate
View on GitHub
A High level PyTorch Training and Utility Library
☆12Sep 16, 2025Updated 5 months ago
Aisuko / generative-ai
View on GitHub
The notebooks for generative AI by using PyTorch, Huggingface/diffusers, transforms. And the implementing of the algorithms in paper
☆16Jan 26, 2026Updated last month
Ranking-VMR / SPR
View on GitHub
☆12Jan 10, 2025Updated last year
qianlima-lab / MISSL
View on GitHub
The official pytorch implementation of our proposed model MISSL (ICDE-24).
☆13Dec 8, 2023Updated 2 years ago
FredGoo / langchain-chinese-chat-models
View on GitHub
为langchain添加chatGLM-130B，星火大模型的chat models
☆13Jun 30, 2023Updated 2 years ago
WangXFng / ELMRec
View on GitHub
[EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.
☆13Jan 9, 2025Updated last year
kyegomez / MGQA
View on GitHub
The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…
☆15Dec 11, 2023Updated 2 years ago
VanillaCreamer / CoRA
View on GitHub
Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…
☆17Dec 11, 2024Updated last year
Yuhan1i / G-Refer
View on GitHub
G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation
☆21Mar 5, 2025Updated last year
skleee / GRAM
View on GitHub
This is the official code for the ACL 2025 paper "GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion".
☆27Aug 30, 2025Updated 6 months ago
dohuyduc2002 / Credit-scoring-system-with-Kubeflow
View on GitHub
MLOps platform powered by Kubeflow
☆26Updated this week
aniket-mish / cuda
View on GitHub
everything i know about cuda and triton
☆13Jan 28, 2025Updated last year
daveebbelaar / single-file-ai-agent-tutorial
View on GitHub
Python version of "How to Build an Agent" by Thorsten Ball
☆39Oct 30, 2025Updated 4 months ago
saiful9379 / pybangla
View on GitHub
Text Normalizer module use for Bangla as well as English digit convert to textual format, Normalize Date and Extract Date
☆14Feb 25, 2026Updated last week
FareedKhan-dev / save-llm-api-cost
View on GitHub
A straightforward method to reduce your LLM inference API costs and token usage.
☆21May 18, 2025Updated 9 months ago
asiff00 / Training-TTS
View on GitHub
Train and finutune text-to-speech models for Bengali and many other languages!
☆18Apr 2, 2025Updated 11 months ago