Visualize expert firing frequencies across sentences in the Mixtral MoE model
☆18Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for mixtral-vis-moe
Users that are interested in mixtral-vis-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 7 months ago
- LOLA: Large and Open Source Multilingual Language Model☆11Apr 8, 2026Updated 2 months ago
- Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025☆32Oct 22, 2025Updated 8 months ago
- A minimal proof-of-concept for a Vite backend integration with Flask.☆18Sep 11, 2024Updated last year
- ReportParse is a unified NLP analyzer for corporate sustainability reports☆21Sep 18, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Oct 30, 2024Updated last year
- Word acquisition in neural language models (TACL 2022).☆21Jan 30, 2025Updated last year
- Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models☆41Sep 19, 2025Updated 9 months ago
- Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training☆24Mar 1, 2024Updated 2 years ago
- Test and benchmark your Rust library on mobile devices with ease.☆13Jul 17, 2023Updated 2 years ago
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated 2 years ago
- A simple web server written in Lua☆17Sep 24, 2022Updated 3 years ago
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- [Suspended] Modern, customizable AI character frontend for enthusiasts (inspired by SillyTavern)☆10Nov 8, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simplify Google Gemini 1.5 Pro's authentication☆15Apr 11, 2024Updated 2 years ago
- ☆13Apr 17, 2024Updated 2 years ago
- SGLang is a fast serving framework for large language models and vision language models.☆32Updated this week
- Sample Codes using NVSHMEM on Multi-GPU☆30Jan 22, 2023Updated 3 years ago
- ☆32Nov 14, 2024Updated last year
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- Official website for the TRON (Token Reduced Object Notation) format☆43Nov 29, 2025Updated 6 months ago
- Coord: A Unified Interface for All Models☆18Jun 19, 2026Updated last week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Mac OS menubar application that allows drag-and-drop file uploading to an S3 bucket with a presigned URL copied to the clipboard.☆20Nov 12, 2021Updated 4 years ago
- 南京大学本科毕业论文 Word 模板☆12May 14, 2020Updated 6 years ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆52Mar 8, 2024Updated 2 years ago
- ☆32Jan 16, 2025Updated last year
- ☆92Apr 2, 2022Updated 4 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Jun 20, 2026Updated last week
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆30Apr 21, 2025Updated last year
- Automates the creation of full-text (sound and text) ebooks in epub/epub3/daisy format, the webserver/client creates smil files to sync a…☆10Nov 12, 2021Updated 4 years ago
- applications of https://github.com/PrefectHQ/marvin☆13Jan 15, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆20Jan 4, 2026Updated 5 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆19Jun 20, 2026Updated last week
- Data and experiments with world population densities for comparison to addresses☆12Mar 15, 2017Updated 9 years ago
- ☆13Jul 23, 2024Updated last year
- Analysis related to article on FOIA Online Database.☆11Feb 2, 2017Updated 9 years ago
- Datasette plugin for streaming SQLite database backups to S3, using Litestream!☆20Jan 20, 2026Updated 5 months ago
- 2023 한국어 AI 경진대회☆10Oct 30, 2023Updated 2 years ago