☆24Apr 10, 2026Updated this week
Alternatives and similar repositories for torchair
Users that are interested in torchair are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton adapter for Ascend. Mirror of https://gitcode.com/ascend/triton-ascend☆117Apr 8, 2026Updated last week
- See vLLM official support: https://github.com/vllm-project/vllm-ascend☆11Feb 5, 2025Updated last year
- ☆38Aug 7, 2025Updated 8 months ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- Sparse kernels for GNNs based on TVM☆17Nov 18, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example of using pytorch's open device registration API☆31Oct 14, 2022Updated 3 years ago
- ☆65Apr 26, 2025Updated 11 months ago
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitcode.com/Ascend/pytorch☆499Updated this week
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆150May 10, 2025Updated 11 months ago
- ☆140Aug 18, 2025Updated 7 months ago
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆95Apr 6, 2026Updated last week
- A tool to simulate Ethereum 2.0 execution☆13Mar 13, 2020Updated 6 years ago
- 💀 Query Homebrew's analytics from the command-line (deprecated)☆17Feb 9, 2025Updated last year
- Rust开发一个玩具语言☆12Sep 19, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Aug 4, 2022Updated 3 years ago
- ☆87Jan 23, 2025Updated last year
- A common lisp DSL for writing zero knowledge circuits☆18Oct 19, 2022Updated 3 years ago
- ☆14Aug 18, 2025Updated 7 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- ☆13Sep 3, 2018Updated 7 years ago
- 做Web3世界的巴别塔|看完你就懂什么是Web3|面向萌新的Web3“白皮书” |@Web3-Club☆16Aug 29, 2024Updated last year
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆175Nov 11, 2025Updated 5 months ago
- LLM prompt patterns☆15Sep 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Feb 24, 2026Updated last month
- ☆18Mar 4, 2025Updated last year
- ☆11Mar 9, 2022Updated 4 years ago
- Implement Flash Attention using Cute.☆105Dec 17, 2024Updated last year
- ☆119May 19, 2025Updated 10 months ago
- A sample verifier for a toy language built on top of Boogie☆25Nov 24, 2022Updated 3 years ago
- ⚡Harry Potter books and audiobooks☆12Oct 1, 2020Updated 5 years ago
- Helpful kernel tutorials and examples for tile-based GPU programming☆700Updated this week
- Autonomous GPU Kernel Generation & Optimization via Deep Agents☆362Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20Sep 28, 2024Updated last year
- ☆15Feb 1, 2016Updated 10 years ago
- Agent skills for vLLM☆59Apr 3, 2026Updated last week
- A simulation framework for modeling efficiency of Graph Neural Network Dataflows☆24Feb 14, 2025Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 8 months ago