一个轻量化的大模型推理框架
☆22May 26, 2025Updated 10 months ago
Alternatives and similar repositories for lite_lang
Users that are interested in lite_lang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A light llama-like llm inference framework based on the triton kernel.☆179Jan 5, 2026Updated 3 months ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆12Jun 10, 2024Updated last year
- paper-read-notes☆13Sep 26, 2024Updated last year
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- 高性能 高精度 大陆车牌、港澳车牌、台湾车牌 韩国车牌(South Korea LPR)识别 代码开源(ncnn移植)☆41Nov 5, 2025Updated 5 months ago
- ☆14Mar 8, 2025Updated last year
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆25Aug 27, 2025Updated 7 months ago
- Implementation of a histogram equalization program using CUDA. Histogram equalization is a technique for adjusting image intensities to e…☆13Jan 3, 2021Updated 5 years ago
- Inference Llama 2 in one file of pure Cuda☆17Aug 20, 2023Updated 2 years ago
- Nuclei AI Library Optimized For RISC-V Vector☆15Oct 15, 2025Updated 5 months ago
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆16Jan 25, 2020Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Optimize softmax in triton in many cases☆23Sep 6, 2024Updated last year
- Improve the performance of atoi()☆13Jan 23, 2016Updated 10 years ago
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆118Mar 13, 2024Updated 2 years ago
- A learning project for getting newcomers started with a WASM JIT compiler☆14Feb 28, 2026Updated last month
- Awesome code, projects, books, etc. related to CUDA☆32Mar 30, 2026Updated last week
- A one-page-only CGraph-API-liked DAG project.☆26Feb 11, 2025Updated last year
- ☆20Dec 29, 2023Updated 2 years ago
- A Minimalistic Auto-Diff Optimization Framework for Teaching and Understanding Pytorch☆27Mar 12, 2026Updated 3 weeks ago
- 给llvm17.0.6添加一个新后端Cpu0☆12Apr 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Sifive All Aboard 系列文章翻译☆11Nov 26, 2021Updated 4 years ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 8 months ago
- GitHub for AI4PD 2023 Workshop in Chile☆12Oct 12, 2023Updated 2 years ago
- ☆26Aug 15, 2023Updated 2 years ago
- A demo code for implementation of differentiable thermodynamic modeling in JAX.☆10Sep 18, 2021Updated 4 years ago
- Use time-splits for Materials Project entries for generative modeling benchmarking.☆12Mar 12, 2026Updated 3 weeks ago
- A complete (FP optional), portable implementation of stdio including printf, scanf, etc. No malloc() or static buffers.☆18Apr 16, 2025Updated 11 months ago
- An interface between the Materials Project software suite and the Schrodinger Python API, designed to allow for high-throughput execution…☆13Apr 8, 2024Updated 2 years ago
- A Rust-based, SenseVoiceSmall☆30Mar 9, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated last year
- Running LLaMA 3 with Rust.☆10May 21, 2024Updated last year
- Spring 2024 - Data Science and Machine Learning in Chemical Engineering☆12Feb 14, 2024Updated 2 years ago
- Deep Learning tools For Biology☆10Apr 18, 2022Updated 3 years ago
- ☆15Mar 30, 2024Updated 2 years ago
- A Rust crate offering similar functionality to the Python transformers package using Candle.☆14Nov 19, 2024Updated last year