PyTorch implementation of moe, which stands for mixture of experts
☆53Feb 11, 2021Updated 5 years ago
Alternatives and similar repositories for Pytorch_mixture-of-experts
Users that are interested in Pytorch_mixture-of-experts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AI-based search done right☆20Dec 25, 2025Updated 5 months ago
- PyTorch implementation of GLOM☆23Apr 1, 2022Updated 4 years ago
- A HFT Market Simulation utilizing high-speed, efficient C++ and concurrent/parallel programming☆19Nov 3, 2023Updated 2 years ago
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,245Apr 19, 2024Updated 2 years ago
- PyTorch implementation of LIMoE☆52Apr 1, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆83Oct 5, 2023Updated 2 years ago
- Code to support my Master's thesis☆22Sep 10, 2023Updated 2 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- A collection of AWESOME things about mixture-of-experts☆1,280Dec 8, 2024Updated last year
- Awesome Automatic Speech Recognition (ASR) paper collection☆22Sep 4, 2020Updated 5 years ago
- Zero-shot evaluation on LEXGLUE tasks with GTP3.5☆29Mar 11, 2023Updated 3 years ago
- ☆30Aug 7, 2022Updated 3 years ago
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 7 months ago
- Calculating FLOPs of Pre-trained Models in NLP☆18Mar 29, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆15Nov 11, 2024Updated last year
- qup: a Single-Node Job Scheduler with NVIDIA GPU support☆18Jan 10, 2023Updated 3 years ago
- ☆20Oct 19, 2023Updated 2 years ago
- ☆22Aug 27, 2023Updated 2 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 6 months ago
- ☆10Jul 12, 2019Updated 6 years ago
- Seq2seq using LSTM with attention from Luong et al☆10Oct 2, 2018Updated 7 years ago
- ☆39Nov 28, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Code for the DataPipes article☆15Jun 14, 2022Updated 3 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Extended Annotations of DeepFashion Images for Fine-grained Recognition☆14May 28, 2019Updated 7 years ago
- Joint Modelling Histology and Molecular Markers for Glioma Classification☆12Jun 4, 2025Updated last year
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- Some examples of usage of Grobid in a third party java project.☆20Jun 14, 2023Updated 2 years ago
- ☆95Apr 3, 2023Updated 3 years ago
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- AC automation on Redis.☆39Jun 14, 2020Updated 5 years ago
- 机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格…☆10Feb 25, 2020Updated 6 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago
- Simple Erlang's logger's formatters wrapper that adds colours to the messages☆15Aug 11, 2020Updated 5 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- An ahead-of-time compiler from Erlang (intermediate language) to LLVM IR and a runtime library for linking against it☆15Sep 18, 2018Updated 7 years ago
- Rebar3 plugin to build Rust crates (unmaintained). See https://github.com/filmor/rebar3_rust/tree/update for more recent work.☆10Sep 26, 2018Updated 7 years ago