a simple general program language
☆99Feb 2, 2026Updated 5 months ago
Alternatives and similar repositories for prajna
Users that are interested in prajna are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆45Feb 2, 2026Updated 5 months ago
- GPTQ inference TVM kernel☆41Apr 25, 2024Updated 2 years ago
- ☆17Jan 1, 2024Updated 2 years ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 3 years ago
- ☆11Dec 26, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Aug 11, 2022Updated 3 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆19Jul 21, 2023Updated 2 years ago
- ☆15Apr 15, 2022Updated 4 years ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Jun 21, 2026Updated last week
- MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…☆114Jun 27, 2026Updated last week
- Conversions to MLIR EmitC☆135Dec 12, 2024Updated last year
- MIAOW2.0 FPGA implementable design☆12Oct 18, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆19Jun 25, 2026Updated last week
- ☆40Feb 28, 2020Updated 6 years ago
- a c++/cuda template library for tensor lazy evaluation☆164May 8, 2023Updated 3 years ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆45Feb 27, 2025Updated last year
- .NET 8 Hack project submission, using AI. Use this app to chat with AI to organize files interactively. Adjust until satisfied, then sele…☆13Nov 29, 2023Updated 2 years ago
- OneFlow->ONNX☆42Apr 19, 2023Updated 3 years ago
- PolyLib official git.☆12Jun 27, 2026Updated last week
- This is a demo how to write a high performance convolution run on apple silicon☆56Feb 8, 2022Updated 4 years ago
- ☆41Mar 31, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 自建 chisel 工程模板☆15Jul 19, 2023Updated 2 years ago
- triton for dsa☆66Jun 18, 2026Updated 2 weeks ago
- 记录阅读各类paper的想法笔记(关注体系结构,机器学习系统,深度学习,计算机视觉)☆25Oct 25, 2019Updated 6 years ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆21Dec 3, 2020Updated 5 years ago
- Implement Flash Attention using Cute.☆108Dec 17, 2024Updated last year
- This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".☆137Sep 24, 2025Updated 9 months ago
- ncnn和pnnx格式编辑器☆147Apr 21, 2026Updated 2 months ago
- Debug print operator for cudagraph debugging☆18Aug 2, 2024Updated last year
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆68Mar 25, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A LR(1) parser generator targeting C++17.☆13Jul 8, 2020Updated 5 years ago
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- ☆29Oct 6, 2021Updated 4 years ago
- A super tiny RISC-V emulator that is able to run xv6.☆77Aug 16, 2022Updated 3 years ago
- A model compilation solution for various hardware☆471Aug 20, 2025Updated 10 months ago
- Subscribe Loomo published image messages and process☆10Oct 22, 2017Updated 8 years ago
- Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruct…☆551Sep 8, 2024Updated last year