☆30Feb 12, 2026Updated last month
Alternatives and similar repositories for llmsys_code_examples
Users that are interested in llmsys_code_examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A minimal implementation of vllm.☆70Jul 27, 2024Updated last year
- A system to improve compatibility between different Django versions, and make upgrading dependencies less painful.☆13Apr 10, 2025Updated 11 months ago
- LLVM optimization passes (DCE, LICM), compilers and stuff☆14Dec 10, 2020Updated 5 years ago
- An application that displays a map and graphs showing solar irradiance forecasts in solar farms in Georgia using data from the National S…☆10Oct 15, 2021Updated 4 years ago
- 计算机视觉 北京邮电大学 鲁鹏 课件与学习笔记☆13Aug 3, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆79Updated this week
- Beginner Workshops for Georgia Tech's The Agency☆11Nov 16, 2021Updated 4 years ago
- SJTU-SE高级数据结构☆14Jun 8, 2023Updated 2 years ago
- Subject of the hackathon 42☆12Nov 9, 2022Updated 3 years ago
- Tools for running experiments on RL agents in procgen environments☆20Apr 5, 2024Updated last year
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆25Aug 27, 2025Updated 6 months ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- Simple PyTorch graph capturing.☆21May 31, 2023Updated 2 years ago
- 给llvm17.0.6添加一个新后端Cpu0☆12Apr 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Jun 22, 2025Updated 9 months ago
- Summary of all repositories for my public contents, mostly Python, in Jupyter Notebooks, PDFs, Markdowns, and more!☆11Aug 24, 2021Updated 4 years ago
- A Streaming-Native Serving Engine for TTS/STS Models☆60Feb 22, 2026Updated last month
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 8 months ago
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- ☆13Jul 2, 2025Updated 8 months ago
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆17Dec 29, 2024Updated last year
- ☆15Mar 26, 2025Updated last year
- Collection of scripts to build PyTorch and the domain libraries from source.☆14Feb 4, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An alternative Vivado custom design example (to fully Vitis) for the User Logic Partition targeting VCK5000☆13Jul 16, 2024Updated last year
- IREE C++ Template☆17Jul 30, 2024Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Jun 5, 2023Updated 2 years ago
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆16Oct 11, 2024Updated last year
- ☆27Apr 7, 2025Updated 11 months ago
- This is a cross-chip platform collection of operators and a unified neural network library.☆16Nov 3, 2023Updated 2 years ago
- 基于matlab和bag of words的图像分类☆10Mar 15, 2017Updated 9 years ago
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- ☆25Jan 7, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Generator for MLIR files from known front-ends☆16Oct 31, 2023Updated 2 years ago
- Coursera Machine Learning Engineering for Production Specialization Course☆26May 4, 2023Updated 2 years ago
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆73Feb 18, 2026Updated last month
- The source files of considerveganism.com☆36Feb 26, 2022Updated 4 years ago
- RP - FO Project S2T1, DSAI HUST☆14Jun 7, 2022Updated 3 years ago
- deep learning framework from scratch☆33Apr 18, 2022Updated 3 years ago
- A fast full-system simulator of Tenstorrent hardware☆44Updated this week