A torch compile backend for multi-targets
☆46Feb 27, 2026Updated this week
Alternatives and similar repositories for xpu_graph
Users that are interested in xpu_graph are comparing it to the libraries listed below
Sorting:
- Reranking for Multi-objective Optimized Recommender Systems☆11Aug 3, 2023Updated 2 years ago
- SPBench: A Framework for Benchmarking Stream Processing Applications☆11Dec 16, 2025Updated 2 months ago
- Work related to vectorizing strategies for arbitrary FHE programs☆10Sep 5, 2025Updated 5 months ago
- libFastMesh - Optimized Finite Volume Computational Aeroacoustics (CAA) Code☆13Mar 28, 2024Updated last year
- [NeurIPS'25 Spotlight] Adaptive Attention Sparsity with Hierarchical Top-p Pruning☆87Nov 29, 2025Updated 3 months ago
- A lightweight design for computation-communication overlap.☆223Jan 20, 2026Updated last month
- ☆63Jul 14, 2025Updated 7 months ago
- Shared Middle-Layer for Triton Compilation☆329Dec 5, 2025Updated 2 months ago
- This repo contains the benchmarks for Enzyme on GPU's☆11Feb 22, 2026Updated last week
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated last month
- ☆12Apr 30, 2024Updated last year
- ☆18Sep 27, 2022Updated 3 years ago
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated last year
- A simple pseudo-spectral solver for the Direct Numerical Simulation (DNS) of the 3D Taylor-Green Vortex in the Julia programming language☆10Jun 6, 2022Updated 3 years ago
- ☆11Dec 9, 2025Updated 2 months ago
- ☆11Nov 13, 2020Updated 5 years ago
- Benchmarking LLMs on Typst☆19May 26, 2025Updated 9 months ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- Low-latency live streaming PoC☆11Jul 30, 2019Updated 6 years ago
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- Exploring Machine Learning methods and workflows in a simplified weather model☆19Jun 6, 2024Updated last year
- Reference GPU test harness for the "Accelerating MSM on GPU" challenge of ZPrize☆12Jul 7, 2022Updated 3 years ago
- Torch 7 + Android port of Neural style algorithm☆10May 10, 2016Updated 9 years ago
- KsanaDiT: High-Performance DiT (Diffusion Transformer) Inference Framework for Video & Image Generation☆36Feb 6, 2026Updated 3 weeks ago
- Python Script to Open SJTU Dormitory Smart Lock☆10Sep 12, 2022Updated 3 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Oct 9, 2018Updated 7 years ago
- ☆24Jan 12, 2016Updated 10 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- The official repository of the Eesen project☆12Jun 20, 2018Updated 7 years ago
- ☆41Dec 10, 2024Updated last year
- Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocatio…☆88Sep 11, 2025Updated 5 months ago
- Sample pytorch implementation of Covariant Compositional Networks☆13Feb 17, 2018Updated 8 years ago
- Repository for answers for exercises in Programming Massively Parallel Processors book☆16Aug 10, 2024Updated last year
- YouTube-Based Multimodal Recipe Recommender☆14Jul 11, 2024Updated last year
- Convolution implementation with on the fly Toeplitz matrix generation.☆11Dec 17, 2016Updated 9 years ago
- A Compiler from "Mx* language" (A C++ & Java like language) to RV32I Assembly, with optimizations on LLVM IR. SJTU CS2966 Project.☆12Feb 12, 2023Updated 3 years ago
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Aug 4, 2018Updated 7 years ago
- ☆18Mar 4, 2025Updated last year
- Generates random utf-8 strings for fuzz t�sting character encoding probl�ms☆11Aug 21, 2015Updated 10 years ago