a simple general program language
☆100Feb 2, 2026Updated 2 months ago
Alternatives and similar repositories for prajna
Users that are interested in prajna are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆45Feb 2, 2026Updated 2 months ago
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated last year
- ☆17Jan 1, 2024Updated 2 years ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 3 years ago
- ☆11Dec 26, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Sep 1, 2023Updated 2 years ago
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- ☆15Apr 15, 2022Updated 3 years ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Feb 20, 2026Updated last month
- MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…☆94Updated this week
- Conversions to MLIR EmitC☆135Dec 12, 2024Updated last year
- MIAOW2.0 FPGA implementable design☆12Oct 18, 2017Updated 8 years ago
- ☆40Feb 28, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- a c++/cuda template library for tensor lazy evaluation☆165May 8, 2023Updated 2 years ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆44Feb 27, 2025Updated last year
- 自建 chisel 工程模板☆14Jul 19, 2023Updated 2 years ago
- Free resource for the book AI Compiler Development Guide☆49Dec 22, 2022Updated 3 years ago
- Source for kusionstack.io☆18Oct 13, 2025Updated 5 months ago
- PolyLib official git.☆11Jan 27, 2026Updated 2 months ago
- This is a demo how to write a high performance convolution run on apple silicon☆56Feb 8, 2022Updated 4 years ago
- This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".☆121Sep 24, 2025Updated 6 months ago
- triton for dsa☆60Apr 2, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆41Mar 31, 2022Updated 4 years ago
- Musings in GEMM (General Matrix Multiplication)☆14Dec 14, 2025Updated 3 months ago
- 记录阅读各类paper的想法笔记 (关注体系结构,机器学习系统,深度学习,计算机视觉)☆25Oct 25, 2019Updated 6 years ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆21Dec 3, 2020Updated 5 years ago
- Implement Flash Attention using Cute.☆105Dec 17, 2024Updated last year
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆63Mar 25, 2025Updated last year
- ncnn和pnnx格式编辑器☆138Oct 7, 2024Updated last year
- Debug print operator for cudagraph debugging☆14Aug 2, 2024Updated last year
- A LR(1) parser generator targeting C++17.☆13Jul 8, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- A super tiny RISC-V emulator that is able to run xv6.☆76Aug 16, 2022Updated 3 years ago
- ☆29Oct 6, 2021Updated 4 years ago
- A model compilation solution for various hardware☆469Aug 20, 2025Updated 7 months ago
- handy cli tool to convert your speech to clipboard text☆15Mar 18, 2026Updated 3 weeks ago
- Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruct…☆535Sep 8, 2024Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 11 months ago