Visualize expert firing frequencies across sentences in the Mixtral MoE model
☆18Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for mixtral-vis-moe
Users that are interested in mixtral-vis-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High-performance GEMM implementation optimized for NVIDIA H100 GPUs, leveraging Hopper architecture's TMA, WGMMA, and Thread Block Cluste…☆10Dec 4, 2024Updated last year
- Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025☆31Oct 22, 2025Updated 5 months ago
- ReportParse is a unified NLP analyzer for corporate sustainability reports☆20Sep 18, 2024Updated last year
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 9 months ago
- Word acquisition in neural language models (TACL 2022).☆20Jan 30, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Oct 26, 2016Updated 9 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13May 7, 2015Updated 10 years ago
- Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training☆24Mar 1, 2024Updated 2 years ago
- SGLang is a fast serving framework for large language models and vision language models.☆30Updated this week
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated last year
- 抓取Here地图的三维建筑物模型☆12Jun 29, 2017Updated 8 years ago
- A simple web server written in Lua☆16Sep 24, 2022Updated 3 years ago
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- notes on reading tensorflow source code☆13Aug 18, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LLM Benchmark☆42May 24, 2025Updated 10 months ago
- ☆31Nov 14, 2024Updated last year
- 监控六个主流数字货币交易所的上币公告:Gate Bybit Bitget KuCoin Binance OKX☆37Aug 6, 2025Updated 8 months ago
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- A Cloudflare Worker for proxying and caching images, with optional rate limiting and a convenient setup process.☆20Mar 30, 2026Updated 3 weeks ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 7 months ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆28Apr 15, 2025Updated last year
- Scala Tutorial☆15Dec 19, 2018Updated 7 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Main xCurator monorepo☆10Dec 20, 2023Updated 2 years ago
- ☆11Mar 27, 2023Updated 3 years ago
- Documenting various metrics available for open source projects☆14Jan 4, 2024Updated 2 years ago
- ☆31Apr 14, 2023Updated 3 years ago
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆18Nov 28, 2025Updated 4 months ago
- A small demo about three & cannon.User can toll the dice by click the dice.☆12Oct 12, 2024Updated last year
- ☆32Jan 16, 2025Updated last year
- JSON Directed Acyclic Graph for IPLD☆23Apr 2, 2026Updated 2 weeks ago
- ☆16Nov 29, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Nov 15, 2018Updated 7 years ago
- Python implementation of Medoidshift and Quickshift algorithms☆15Feb 5, 2015Updated 11 years ago
- ☆19Jan 4, 2026Updated 3 months ago
- ☆13Jul 23, 2024Updated last year
- pixel-mosaic converts images into pixel art and preserves features while downscaling☆29Mar 7, 2026Updated last month
- Analysis related to article on FOIA Online Database.☆11Feb 2, 2017Updated 9 years ago
- Datasette plugin for streaming SQLite database backups to S3, using Litestream!☆19Jan 20, 2026Updated 3 months ago