swiss-ai / MoEView external linksLinks
some mixture of experts architecture implementations
☆25Mar 22, 2024Updated last year
Alternatives and similar repositories for MoE
Users that are interested in MoE are comparing it to the libraries listed below
Sorting:
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Jul 19, 2022Updated 3 years ago
- ☆15Apr 26, 2022Updated 3 years ago
- nanoGPT-like codebase for LLM training☆113Nov 7, 2025Updated 3 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 10 months ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- ☆11Dec 23, 2024Updated last year
- Concurrency library☆16Oct 13, 2024Updated last year
- CANdle - a library for using USB-FDCAN dongle and communicating with md80 drives☆14Sep 15, 2025Updated 5 months ago
- ☆10Apr 7, 2024Updated last year
- ☆11Feb 28, 2022Updated 3 years ago
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Jan 7, 2026Updated last month
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year
- An active inference model of Lacanian psychoanalysis☆15Jun 7, 2025Updated 8 months ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated 11 months ago
- ☆11Jan 3, 2024Updated 2 years ago
- [ICLR 2023] ReScore: Boosting Causal Discovery via Adaptive Sample Reweighting☆11Mar 11, 2023Updated 2 years ago
- DICE: Detecting In-distribution Data Contamination with LLM's Internal State☆11Sep 21, 2024Updated last year
- f-PO: Generalizing Preference Optimization with f-divergence Minimization☆13Apr 2, 2025Updated 10 months ago
- ☆13Nov 27, 2025Updated 2 months ago
- This library implements functions and classes for mesh registration, data augmentation, and data normalisation.☆11Oct 7, 2024Updated last year
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- [NeurIPS 2024] CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition☆16Nov 12, 2025Updated 3 months ago
- ☆12Dec 20, 2024Updated last year
- ☆15Dec 8, 2024Updated last year
- JSON RPC v2.0 Sans I/O☆11Updated this week
- Paper has been accepted in ACM MM 2024.☆13Jul 4, 2025Updated 7 months ago
- R package for metabolic enzyme enrichment anaylsis☆13Oct 24, 2025Updated 3 months ago
- ☆12Jun 11, 2024Updated last year
- 🧩 Design-Information-Modeling for Kit-of-Parts 🏘️☆16Updated this week
- Interactive, GPU accelerated computation graphs☆12Nov 21, 2024Updated last year
- 3D geoms for plotnine (grammar of graphics in Python)☆12Aug 5, 2022Updated 3 years ago
- Smallest ellipse covering a finite set of points☆14Jan 3, 2025Updated last year