Visualize expert firing frequencies across sentences in the Mixtral MoE model
☆18Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for mixtral-vis-moe
Users that are interested in mixtral-vis-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 6 months ago
- LOLA: Large and Open Source Multilingual Language Model☆11Apr 8, 2026Updated last month
- High-performance GEMM implementation optimized for NVIDIA H100 GPUs, leveraging Hopper architecture's TMA, WGMMA, and Thread Block Cluste…☆10Dec 4, 2024Updated last year
- Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025☆32Oct 22, 2025Updated 6 months ago
- A minimal proof-of-concept for a Vite backend integration with Flask.☆18Sep 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ReportParse is a unified NLP analyzer for corporate sustainability reports☆21Sep 18, 2024Updated last year
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Oct 30, 2024Updated last year
- Research on the usage of Jupyter notebooks☆19Sep 12, 2019Updated 6 years ago
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 10 months ago
- Word acquisition in neural language models (TACL 2022).☆21Jan 30, 2025Updated last year
- Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models☆38Sep 19, 2025Updated 8 months ago
- Klimatkollen's data pipeline and API for processing company sustainability reports☆23Updated this week
- Test and benchmark your Rust library on mobile devices with ease.☆13Jul 17, 2023Updated 2 years ago
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple web server written in Lua☆16Sep 24, 2022Updated 3 years ago
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Simplify Google Gemini 1.5 Pro's authentication☆15Apr 11, 2024Updated 2 years ago
- ☆13Apr 17, 2024Updated 2 years ago
- SGLang is a fast serving framework for large language models and vision language models.☆32Updated this week
- Sample Codes using NVSHMEM on Multi-GPU☆30Jan 22, 2023Updated 3 years ago
- Build A Simple Web App With Sveltekit and Appwrite☆11Apr 3, 2023Updated 3 years ago
- LLM Benchmark☆43May 24, 2025Updated 11 months ago
- ☆31Nov 14, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fetches transcripts from YouTube videos, including private ones with granted access, and optionally downloads the videos. Does not suppor…☆17Apr 17, 2024Updated 2 years ago
- Official website for the TRON (Token Reduced Object Notation) format☆39Nov 29, 2025Updated 5 months ago
- Coord: A Unified Interface for All Models☆18Feb 2, 2026Updated 3 months ago
- A Cloudflare Worker for proxying and caching images, with optional rate limiting and a convenient setup process.☆21Mar 30, 2026Updated last month
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 8 months ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆28Apr 15, 2025Updated last year
- A Mac OS menubar application that allows drag-and-drop file uploading to an S3 bucket with a presigned URL copied to the clipboard.☆20Nov 12, 2021Updated 4 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Documenting various metrics available for open source projects☆14Jan 4, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆18Nov 28, 2025Updated 5 months ago
- ☆32Jan 16, 2025Updated last year
- Undergraduate Course Explorer for KAIST☆14Feb 28, 2023Updated 3 years ago
- An AI regulatory assistant to pre-check your documentation before FDA or MDR submission.☆13Jul 31, 2024Updated last year
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆29Apr 21, 2025Updated last year
- applications of https://github.com/PrefectHQ/marvin☆13Jan 15, 2024Updated 2 years ago