GetUpEarlier/minit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GetUpEarlier/minit)

GetUpEarlier / minit

☆26

Alternatives and similar repositories for minit

Users that are interested in minit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dcaox / MIT6.5940
View on GitHub
模型加速/模型压缩（已完成所有Lab）
☆11Dec 24, 2023Updated 2 years ago
HLRJ / Cpu0_For_LLVM17
View on GitHub
给llvm17.0.6添加一个新后端Cpu0
☆12Apr 22, 2024Updated 2 years ago
zhangkai0425 / SGEMM-HPC
View on GitHub
Implementation and optimization of matrix multiplication on single CPU (HPC-THU-2023-Autumn)
☆18Feb 27, 2024Updated 2 years ago
xiaoyu1998 / llvm-cpu0
View on GitHub
LLVM Backend tutorial Cpu0
☆26Nov 5, 2023Updated 2 years ago
chips-compilers-mlsys-21 / chips-compilers-mlsys-21.github.io
View on GitHub
☆11Apr 5, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
TiledTensor / TiledCUDA
View on GitHub
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆192Jan 28, 2025Updated last year
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago
LLMServe / hydraserve
View on GitHub
☆20May 11, 2026Updated 2 months ago
chuyiyao / Q-MAT
View on GitHub
☆12Feb 7, 2018Updated 8 years ago
spcl / crosspipe
View on GitHub
Official implementation of CrossPipe: Towards Optimal Pipeline Schedules for Cross-Datacenter Training (ATC '25), built on top of Megatro…
☆17Jul 6, 2025Updated last year
rinsa318 / normal2depth
View on GitHub
Estimate depth from surface normal.
☆12Aug 14, 2020Updated 5 years ago
yuantangliang / softmaxfocalloss
View on GitHub
the loss function in Aritcal ‘Focal Loss for Dense Object Detection‘’
☆17Sep 20, 2017Updated 8 years ago
AlexReimann / depth_calibration
View on GitHub
Calibration of depth sensors, e.g. Kinect, Asus Xtion
☆13Apr 26, 2019Updated 7 years ago
adamgallas / MIT_Bluespec_RISCV_Tutorial
View on GitHub
☆27Jan 22, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ChandlerGuan / mercury_artifact
View on GitHub
☆27Oct 1, 2025Updated 9 months ago
vdcores / vdcores
View on GitHub
Virtual Decoupled Cores: Composable Programming Framework and Runtime for Async GPUs
☆19Updated this week
caiwanxianhust / FasterLLaMA
View on GitHub
使用 CUDA C++ 实现的 llama 模型推理框架
☆64Nov 8, 2024Updated last year
alexshuang / write-your-own-ai-compiler
View on GitHub
《自己动手写AI编译器》
☆40Oct 19, 2024Updated last year
Gyumeijie / an-embedded-c-interpreter
View on GitHub
a very simple interpreter for c, inspired by c4, but it is embedded
☆12Jul 6, 2018Updated 8 years ago
Qwesh157 / conv_op_optimization
View on GitHub
This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.
☆44Sep 29, 2025Updated 9 months ago
eth-cscs / Tiled-MM
View on GitHub
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆33Apr 2, 2025Updated last year
zhiqwang / shufaCV
View on GitHub
☆26May 22, 2023Updated 3 years ago
Bruce-Lee-LY / decoding_attention
View on GitHub
Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.
☆47Jun 11, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
ShaYeBuHui01 / flash_attention_inference
View on GitHub
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
☆15Aug 31, 2023Updated 2 years ago
partic2 / pwart
View on GitHub
A lightweight WebAssembly JIT compiler and runtime , powered by sljit. PR and issue are welcome.
☆20May 9, 2026Updated 2 months ago
ganler / nnsmith-asplos-artifact
View on GitHub
https://nnsmith-asplos.rtfd.io Artifact of "NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers" ASPLOS'23
☆11Mar 29, 2023Updated 3 years ago
harleyszhang / lite_llama
View on GitHub
A light llama-like llm inference framework based on the triton kernel.
☆188Jan 5, 2026Updated 6 months ago
RT-Thread-packages / jerryscript
View on GitHub
JerryScript port for RT-Thread
☆21Nov 18, 2021Updated 4 years ago
cornell-zhang / allo-pldi24-artifact
View on GitHub
Artifact evaluation of PLDI'24 paper "Allo: A Programming Model for Composable Accelerator Design"
☆35Apr 11, 2024Updated 2 years ago
leepoly / sm-profiler
View on GitHub
☆82Feb 5, 2026Updated 5 months ago
UNITES-Lab / HEXA-MoE
View on GitHub
Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"
☆15Mar 6, 2025Updated last year
yui0 / ugemm
View on GitHub
GEMM
☆10Aug 26, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WangGarrison / CPP-Learning-Notes
View on GitHub
C++学习笔记
☆14Sep 19, 2021Updated 4 years ago
byrzhm / hadoop-docker-cluster
View on GitHub
hadoop 的 docker 集群配置
☆10Jun 8, 2024Updated 2 years ago
PFCCLab / Camp
View on GitHub
飞桨护航计划集训营
☆20May 25, 2026Updated last month
ziyeshanwai / python-laplacian-deformation
View on GitHub
a python version of laplacian deformation
☆22Mar 10, 2020Updated 6 years ago
hy172574895 / EasyCompleteYou
View on GitHub
A code-completion engine with easy for Vim.
☆10Jan 20, 2021Updated 5 years ago
gjzkeyframe / KFAVDemo-Android
View on GitHub
Android 音视频工程示例。
☆12Feb 8, 2023Updated 3 years ago
XiaoSongXS / CUDA-Optimization-Guide
View on GitHub
Xiao's CUDA Optimization Guide [NO LONGER ADDING NEW CONTENT]
☆328Nov 8, 2022Updated 3 years ago