☆17Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for oneflow-lite
Users that are interested in oneflow-lite are comparing it to the libraries listed below
Sorting:
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 2 years ago
- ☆23Apr 25, 2023Updated 2 years ago
- OneFlow->ONNX☆43Apr 19, 2023Updated 2 years ago
- ☆11Dec 26, 2025Updated 2 months ago
- OneFlow Serving☆20Apr 10, 2025Updated 11 months ago
- ☆12Mar 13, 2023Updated 3 years ago
- Datasets, Transforms and Models specific to Computer Vision☆91Nov 17, 2023Updated 2 years ago
- ☆13Mar 27, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- Inference Llama 2 in one file of pure go☆16Jul 25, 2023Updated 2 years ago
- Akinasan team(秋名山车队)'s code base for the 0th Taichi Hackathon.☆19Dec 4, 2022Updated 3 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- Large Language Model Onnx Inference Framework☆34Nov 25, 2025Updated 3 months ago
- ☆25Aug 27, 2021Updated 4 years ago
- handy cli tool to convert your speech to clipboard text☆15Updated this week
- ggml学习笔记,ggml是一个机器学习的推理框架☆17Mar 24, 2024Updated last year
- a single-header math library☆17Nov 7, 2025Updated 4 months ago
- ☆14Mar 26, 2020Updated 5 years ago
- Self-trained Large Language Models based on Meta LLaMa☆29Aug 11, 2023Updated 2 years ago
- It is an advanced medical CT image analysis system that uses a multi-agent collaborative framework and the latest AI technology to automa…☆18Apr 23, 2025Updated 10 months ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 7 months ago
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆33Aug 31, 2022Updated 3 years ago
- ☆16Mar 24, 2025Updated 11 months ago
- A Triton JIT runtime and ffi provider in C++☆32Updated this week
- Tengine 管子是用来快速生产 demo 的辅助工具☆12Jul 15, 2021Updated 4 years ago
- ☆42Nov 29, 2022Updated 3 years ago
- A more efficient yolov5 with oneflow backend 🎉🎉🎉☆216Jul 10, 2025Updated 8 months ago
- ☆11Feb 5, 2026Updated last month
- Base on retinaface and centerface modefied. frame work depend on pytorch.☆31Jul 23, 2020Updated 5 years ago
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated last year
- Ahead of Time (AOT) Triton Math Library☆94Updated this week
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- A model serving framework for various research and production scenarios. Seamlessly built upon the PyTorch and HuggingFace ecosystem.☆23Oct 11, 2024Updated last year
- ☆10Nov 8, 2021Updated 4 years ago
- Getting Started with Triton: A Tutorial for Python Beginners☆45Oct 21, 2025Updated 5 months ago
- Simple C++ FFmpeg video encoder. Raw data to mp4 (h264) file.☆21Jan 24, 2021Updated 5 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- A benchmark and playground for Completely Fair Scheduling in Go☆11Feb 12, 2022Updated 4 years ago