☆32Mar 31, 2026Updated last month
Alternatives and similar repositories for llmsys_code_examples
Users that are interested in llmsys_code_examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A system to improve compatibility between different Django versions, and make upgrading dependencies less painful.☆13Apr 13, 2026Updated 3 weeks ago
- ☆13May 7, 2023Updated 2 years ago
- making the official triton tutorials actually comprehensible☆152Aug 25, 2025Updated 8 months ago
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago
- Welcome to OptML! This repository is designed for those new to MLIR and machine learning-based optimizations. As a compiler enthusiast, I…☆20Sep 16, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- My GitHub Repo for UIUC ECE408 Applied Parallel Programming, mainly focus on CUDA programming and algorithm implementation.☆28Jan 16, 2024Updated 2 years ago
- Beginner Workshops for Georgia Tech's The Agency☆11Nov 16, 2021Updated 4 years ago
- LOUDS-trie implementation example (C++)☆15Nov 27, 2019Updated 6 years ago
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆113Apr 28, 2026Updated last week
- Tools for running experiments on RL agents in procgen environments☆20Apr 5, 2024Updated 2 years ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆27Aug 27, 2025Updated 8 months ago
- A community-driven pypto implementation☆63Updated this week
- Xilinx Modifications to Halide☆13May 3, 2021Updated 5 years ago
- Scala/Play + Vue.js web application providing online Risk, produced for CS 2340 with Professor Simpkins☆14Dec 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ACM MobiCom'23 - Reconfiguration Android smartphones to support 192kHz sampling rates acoustic sensing☆19Aug 24, 2023Updated 2 years ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 9 months ago
- tutorials about polyhedral compilation.☆61Feb 9, 2026Updated 2 months ago
- ☆13Jul 2, 2025Updated 10 months ago
- ☆10May 18, 2024Updated last year
- Reinforcement Learning Replications is a set of Pytorch implementations of reinforcement learning algorithms.☆24Apr 4, 2026Updated last month
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆17Dec 29, 2024Updated last year
- Collection of scripts to build PyTorch and the domain libraries from source.☆14Apr 1, 2026Updated last month
- An alternative Vivado custom design example (to fully Vitis) for the User Logic Partition targeting VCK5000☆13Jul 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Masked Convolutional Flow☆60Apr 9, 2020Updated 6 years ago
- Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.☆18Apr 19, 2025Updated last year
- 63k Chinese sentences with simplified, traditional, pinyin and english translation for offline use☆21Mar 17, 2021Updated 5 years ago
- CS294 AI Systems Class Website☆18Apr 25, 2022Updated 4 years ago
- Allow torch tensor memory to be released and resumed later☆241Apr 20, 2026Updated 2 weeks ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Jun 5, 2023Updated 2 years ago
- ☆29Apr 7, 2025Updated last year
- Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom …☆26Jun 22, 2025Updated 10 months ago
- ☆22Apr 10, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- Measuring the situational awareness of language models☆41Feb 12, 2024Updated 2 years ago
- MIT 6.172 Performance Engineering of Software Systems☆16Dec 30, 2021Updated 4 years ago
- DGEMM on KNL, achieve 75% MKL☆19May 19, 2022Updated 3 years ago
- Utilities for constructing a large dataset of LLVM IR☆25Jun 2, 2025Updated 11 months ago
- deep learning framework from scratch☆33Apr 18, 2022Updated 4 years ago
- ☆29Apr 4, 2024Updated 2 years ago