☆31Mar 31, 2026Updated 2 weeks ago
Alternatives and similar repositories for llmsys_code_examples
Users that are interested in llmsys_code_examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A minimal implementation of vllm.☆71Jul 27, 2024Updated last year
- ☆14Feb 2, 2025Updated last year
- A system to improve compatibility between different Django versions, and make upgrading dependencies less painful.☆13Updated this week
- ☆11Oct 24, 2022Updated 3 years ago
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago
- ☆65Apr 26, 2025Updated 11 months ago
- 计算机视觉 北京邮电大学 鲁鹏 课件与学习笔记☆13Aug 3, 2021Updated 4 years ago
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆81Updated this week
- My GitHub Repo for UIUC ECE408 Applied Parallel Programming, mainly focus on CUDA programming and algorithm implementation.☆28Jan 16, 2024Updated 2 years ago
- SJTU-SE高级数据结构☆14Jun 8, 2023Updated 2 years ago
- Subject of the hackathon 42☆12Nov 9, 2022Updated 3 years ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆26Aug 27, 2025Updated 7 months ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- hadoop 的 docker 集群配置☆10Jun 8, 2024Updated last year
- 给llvm17.0.6添加一个新后端Cpu0☆12Apr 22, 2024Updated last year
- ☆15Jun 22, 2025Updated 9 months ago
- Xilinx Modifications to Halide☆13May 3, 2021Updated 4 years ago
- ☆20Feb 17, 2023Updated 3 years ago
- Zero-shot clinical trial matching with LLMs☆17Mar 1, 2025Updated last year
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- ☆13Jul 2, 2025Updated 9 months ago
- Region-level profiling for CUDA kernels with trace, NVBit, CUPTI, and an interactive Explorer.☆103Mar 27, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An alternative Vivado custom design example (to fully Vitis) for the User Logic Partition targeting VCK5000☆13Jul 16, 2024Updated last year
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- IREE C++ Template☆17Jul 30, 2024Updated last year
- Allow torch tensor memory to be released and resumed later☆233Mar 10, 2026Updated last month
- CS294 AI Systems Class Website☆18Apr 25, 2022Updated 3 years ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Jun 5, 2023Updated 2 years ago
- ☆29Apr 7, 2025Updated last year
- Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom …☆25Jun 22, 2025Updated 9 months ago
- MERN Stack Bootcamp☆18Jul 28, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆13May 29, 2024Updated last year
- 基于matlab和bag of words的图像分类☆10Mar 15, 2017Updated 9 years ago
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- ☆25Jan 7, 2026Updated 3 months ago
- 将《GRE再要你命3000词》中的例句按照规则提取改编,辅助单词记忆☆13Oct 19, 2018Updated 7 years ago
- Measuring the situational awareness of language models☆40Feb 12, 2024Updated 2 years ago
- Implementing SPMD control flow in LLVM using reconverging CFGs - Vectorizing Divergent Control-Flow for SIMD Applications☆18Apr 11, 2019Updated 7 years ago