antonio-f / mixture-of-experts-from-scratchView external linksLinks
Mixture of Experts from scratch
☆13Apr 12, 2024Updated last year
Alternatives and similar repositories for mixture-of-experts-from-scratch
Users that are interested in mixture-of-experts-from-scratch are comparing it to the libraries listed below
Sorting:
- UW–Madison Math/CS 714 code examples☆13Dec 4, 2025Updated 2 months ago
- ☆13Jul 23, 2025Updated 6 months ago
- ☆14Apr 7, 2025Updated 10 months ago
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆11Mar 20, 2023Updated 2 years ago
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 3 months ago
- Chitrarth: Bridging Vision and Language for a Billion People☆13Feb 12, 2025Updated last year
- ☆16Jul 7, 2025Updated 7 months ago
- Code for the article series on building a Python compiler and interpreter☆11Feb 13, 2025Updated last year
- gpt from 0 -> 1☆11Oct 9, 2025Updated 4 months ago
- Python Book Telegram Bot☆11Mar 4, 2024Updated last year
- A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…☆11Dec 11, 2023Updated 2 years ago
- [KDD 2025] The implementation of "Fine-tuning Multimodal Large Language Models for Product Bundling", KDD'25☆15Sep 20, 2025Updated 4 months ago
- ☆42Jan 27, 2026Updated 3 weeks ago
- Версия курса для магистрантов AI TalantedHub в ИТМО за 2024.☆11Oct 14, 2025Updated 4 months ago
- ☆14Jul 6, 2022Updated 3 years ago
- ☆10Jun 22, 2022Updated 3 years ago
- An implementation of the FuzzyDBSCAN algorithm.☆13Nov 6, 2022Updated 3 years ago
- Generating super-resolution images using GANs☆11Mar 29, 2020Updated 5 years ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- GaaB: Effortless blogging with GitHub issues and Next.js☆12Jun 23, 2024Updated last year
- 为langchain添加chatGLM-130B,星火大模型的chat models☆13Jun 30, 2023Updated 2 years ago
- The notebooks for generative AI by using PyTorch, Huggingface/diffusers, transforms. And the implementing of the algorithms in paper☆16Jan 26, 2026Updated 3 weeks ago
- This is a side project where me and my friend try to generate synthetic data in bangla from deepseek-r1. So that can be used for model di…☆11Jun 28, 2025Updated 7 months ago
- [KDD'25] Flow Matching for Collaborative Filtering☆19Sep 6, 2025Updated 5 months ago
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆20Mar 5, 2025Updated 11 months ago
- MLOps platform powered by Kubeflow☆26Sep 23, 2025Updated 4 months ago
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆22Jan 3, 2026Updated last month
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆15Dec 11, 2023Updated 2 years ago
- This is the official code for the ACL 2025 paper "GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion".☆27Aug 30, 2025Updated 5 months ago
- A High level PyTorch Training and Utility Library☆12Sep 16, 2025Updated 5 months ago
- Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…☆17Dec 11, 2024Updated last year
- CLI tools for managing a second brain + Lobster workflow pipelines☆33Feb 9, 2026Updated last week
- ☆12Jan 10, 2025Updated last year
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- everything i know about cuda and triton☆13Jan 28, 2025Updated last year
- ☆15Jun 10, 2024Updated last year
- A Hybrid Self-Cross Attention Network For Remote Sensing Change Detection☆14May 13, 2025Updated 9 months ago
- Text Normalizer module use for Bangla as well as English digit convert to textual format, Normalize Date and Extract Date☆14Dec 31, 2025Updated last month