guqiqi/Samoyeds

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guqiqi/Samoyeds)

guqiqi / Samoyeds

Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)

☆16

Alternatives and similar repositories for Samoyeds

Users that are interested in Samoyeds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HPMLL / DTC-SpMM_ASPLOS24
View on GitHub
☆47Jun 19, 2024Updated 2 years ago
spcl / smat
View on GitHub
Code for High Performance Unstructured SpMM Computation Using Tensor Cores
☆35Nov 3, 2024Updated last year
mi150 / VaLoRA
View on GitHub
☆11May 19, 2025Updated last year
xxyux / SpInfer
View on GitHub
SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
☆68Mar 25, 2025Updated last year
UDC-GAC / venom
View on GitHub
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
☆62Nov 24, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
wudu98 / autoGEMM
View on GitHub
☆16Dec 5, 2024Updated last year
SuperScientificSoftwareLaboratory / DASP
View on GitHub
Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…
☆29Jun 18, 2024Updated 2 years ago
eth-easl / deltazip
View on GitHub
Compression for Foundation Models
☆36Jul 21, 2025Updated last year
aws-samples / end-2-end-3d-ml
View on GitHub
This repository features Amazon SageMaker Ground Truth and explains how to ingest raw 3D point cloud data, label it, train a 3D object de…
☆13Jun 23, 2022Updated 4 years ago
escalab / RTSpMSpM
View on GitHub
☆25Apr 13, 2025Updated last year
29DCH / AI_ML_DataAnalysis_DataVisualization_Classic-Examples
View on GitHub
关于AI,ML,DA,DV等的几个经典案例，包括堵车模拟(NagelSchreckenberg)、蒙特卡洛排队问题(Monte Carlo Queuing Problem)、人脸识别(RecognitionFace)、遗传算法推断图像(IconGenetic)
☆10Oct 14, 2018Updated 7 years ago
LinkAnonymous / BESA
View on GitHub
☆12Oct 9, 2023Updated 2 years ago
Jiawei888 / FPGA-CNN-Accelerator
View on GitHub
The goal of this design is to use the PYNQ-Z2 development board to design a general convolution neural network accelerator. And through r…
☆11Sep 30, 2020Updated 5 years ago
xxcclong / GNN-Computing
View on GitHub
Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"
☆42Nov 16, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
maym86 / ros_lidar_camera
View on GitHub
Test of lidar camera calibration with ROS
☆10Jun 25, 2019Updated 7 years ago
shawnricecake / search-llm
View on GitHub
[NeurIPS 2024] Search for Efficient LLMs
☆16Jan 16, 2025Updated last year
ZhW-loop / UniCoMo
View on GitHub
☆13Sep 19, 2024Updated last year
LKJacky / Differentiable-Model-Scaling
View on GitHub
This is the official repo for "Differentiable Model Scaling using Differentiable Topk"
☆12May 16, 2024Updated 2 years ago
NetX-lab / Ayo
View on GitHub
[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo
☆75Mar 11, 2026Updated 4 months ago
myhsia / litetable
View on GitHub
A LaTeX template provides a beautiful design of class schedule with colorful course blocks.
☆14Jul 9, 2026Updated 2 weeks ago
tsinghua-ideal / Syno
View on GitHub
Source code repository for ASPLOS '25 paper "Syno: Structured Synthesis for Neural Operators"
☆15Aug 31, 2025Updated 10 months ago
Serkonosand / ParallelProgramming2021
View on GitHub
中科大郑启龙2021年并行程序设计课程实验
☆11Jan 15, 2022Updated 4 years ago
PanZaifeng / FastTree-Artifact
View on GitHub
☆32Mar 24, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
MachineLearningSystem / 25ASPLOS-Medusa
View on GitHub
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
☆12Nov 8, 2024Updated last year
aws-neuron / neuronx-distributed-training
View on GitHub
☆13Dec 20, 2025Updated 7 months ago
mnicely / nvml_examples
View on GitHub
Examples showing how to utilize the NVML library for GPU monitoring
☆29May 31, 2022Updated 4 years ago
PeterTheSparrow / SE3353-Architecture-of-Enterprise-Applications-2023Autumn-SJTU-notes
View on GitHub
上海交通大学软件学院课程《应用系统体系架构》（SE3353）笔记
☆11Feb 2, 2024Updated 2 years ago
Deep-Learning-Profiling-Tools / fasten
View on GitHub
☆14Apr 24, 2024Updated 2 years ago
zhang677 / PCL-lite
View on GitHub
[ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development
☆17Jan 6, 2026Updated 6 months ago
Hyaloid / AccSpMM
View on GitHub
Official implementation of Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores.
☆17Nov 13, 2025Updated 8 months ago
scai-tech / NeuSight
View on GitHub
☆83Jun 23, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Okabe-Rintarou-0 / Shuiyuan-Client
View on GitHub
水源社区 API client
☆16Dec 11, 2023Updated 2 years ago
zhaiyi000 / tlp
View on GitHub
☆43Apr 25, 2024Updated 2 years ago
HPMLL / SpInfer_EuroSys25
View on GitHub
☆35Apr 2, 2025Updated last year
YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago
AlibabaResearch / flash-llm
View on GitHub
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
☆246Sep 24, 2023Updated 2 years ago
spcl / crosspipe
View on GitHub
Official implementation of CrossPipe: Towards Optimal Pipeline Schedules for Cross-Datacenter Training (ATC '25), built on top of Megatro…
☆17Jul 6, 2025Updated last year
XXXVincent / MonoDepth2
View on GitHub
Mono depth on nuscenes dataset
☆20Feb 25, 2021Updated 5 years ago