some mixture of experts architecture implementations
☆26Mar 22, 2024Updated last year
Alternatives and similar repositories for MoE
Users that are interested in MoE are comparing it to the libraries listed below
Sorting:
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Jul 19, 2022Updated 3 years ago
- ☆15Apr 26, 2022Updated 3 years ago
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆19Aug 26, 2025Updated 6 months ago
- nanoGPT-like codebase for LLM training☆116Nov 7, 2025Updated 4 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Optimize GEMM with tensorcore step by step☆36Dec 17, 2023Updated 2 years ago
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 11 months ago
- Concurrency library☆17Oct 13, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- ☆11Dec 23, 2024Updated last year
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- CANdle - a library for using USB-FDCAN dongle and communicating with md80 drives☆15Sep 15, 2025Updated 5 months ago
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Updated this week
- An active inference model of Lacanian psychoanalysis☆15Jun 7, 2025Updated 9 months ago
- ☆11Feb 28, 2022Updated 4 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- ☆10Apr 7, 2024Updated last year
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year
- TiC: Exploring Vision Transformer in Convolution☆11Oct 24, 2023Updated 2 years ago
- CRISPR, faster, better – The Crackling method for whole-genome target detection☆10Jan 11, 2024Updated 2 years ago
- R package for metabolic enzyme enrichment anaylsis☆13Oct 24, 2025Updated 4 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 4 months ago
- SSL Video Representation Learning project☆14Jul 8, 2025Updated 7 months ago
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 7 months ago
- ☆11Apr 6, 2024Updated last year
- ☆11Jan 3, 2024Updated 2 years ago
- A dependency injection library for python, aimed for the least amount of magic.☆12Feb 23, 2022Updated 4 years ago
- ☆13Nov 27, 2025Updated 3 months ago
- This repo is for CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering.☆14Mar 6, 2024Updated 2 years ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- ☆11Jan 19, 2025Updated last year
- ☆13Jun 11, 2024Updated last year
- Artifact for TOSEM Submission: GiantRepair☆13Jun 26, 2024Updated last year