☆23Apr 25, 2023Updated 3 years ago
Alternatives and similar repositories for oneflow-xrt
Users that are interested in oneflow-xrt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Jan 1, 2024Updated 2 years ago
- OneFlow Serving☆20Apr 10, 2025Updated last year
- ☆11Dec 26, 2025Updated 6 months ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 3 years ago
- auto deploy neovim like chxuan/vimplus☆12Apr 22, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- OneFlow->ONNX☆42Apr 19, 2023Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated 2 years ago
- Datasets, Transforms and Models specific to Computer Vision☆91Nov 17, 2023Updated 2 years ago
- Models and examples built with OneFlow☆100Oct 16, 2024Updated last year
- https://start.oneflow.org/oneflow-yolo-doc☆23Mar 14, 2023Updated 3 years ago
- A more efficient yolov5 with oneflow backend 🎉🎉🎉☆215Jul 10, 2025Updated 11 months ago
- ☆33Feb 3, 2025Updated last year
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆403Jul 31, 2025Updated 11 months ago
- A GPU performance profiling tool for PyTorch models☆22Jul 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DeepLearning Framework Performance Profiling Toolkit☆292Mar 28, 2022Updated 4 years ago
- ☆48Mar 5, 2024Updated 2 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆85Mar 20, 2023Updated 3 years ago
- A quick way to benchmark your CUDA compiler on a Linux environment☆27Mar 16, 2011Updated 15 years ago
- Base on retinaface and centerface modefied. frame work depend on pytorch.☆31Jul 23, 2020Updated 5 years ago
- ☆14Mar 26, 2020Updated 6 years ago
- Toolkit for launching and observing MaxText training on Slurm-managed GPU clusters☆28Jun 18, 2026Updated last week
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 10 months ago
- Recording and thinking when read the paper about PersonReID.☆10Jan 10, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆33Aug 31, 2022Updated 3 years ago
- oneflow documentation☆69Jun 26, 2024Updated 2 years ago
- ☆126Dec 15, 2023Updated 2 years ago
- add tensorflow ops to ncnn☆30Jan 13, 2019Updated 7 years ago
- ROI_Align☆14Mar 11, 2020Updated 6 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- Ahead of Time (AOT) Triton Math Library☆99Jun 16, 2026Updated 2 weeks ago
- AlgorithmNote is a knowledge sharing github page, mainly has three parts: algorithm, engineering and basic knowledge.☆13Feb 17, 2015Updated 11 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- deepx_core是一个专注于张量计算/深度学习的基础库☆379Apr 15, 2025Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆929Dec 30, 2024Updated last year
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- A Next.js version of Claude Aritfacts , inspired by llamacoder☆26Sep 26, 2024Updated last year
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 3 years ago
- ☆12Dec 16, 2021Updated 4 years ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆82Aug 12, 2024Updated last year