JittorInfer is a high-performance C++ inference framework designed for large language models on Huawei's Ascend AI processor.
☆80Mar 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for JittorInfer
Users that are interested in JittorInfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An experimental parallel training platform☆56Mar 25, 2024Updated 2 years ago
- A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs☆13Dec 17, 2024Updated last year
- How to use node-local MPI rank IDs to manually map MPI ranks to GPUs☆14Apr 22, 2020Updated 5 years ago
- LLM inference in C/C++☆11Updated this week
- Using OpenVINO to speed up inference of PaddleOCR-VL model☆25Mar 2, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An implementation of the Latent Skill Embedding model☆10Feb 19, 2016Updated 10 years ago
- PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.☆29Mar 12, 2026Updated 2 weeks ago
- ☆37Jan 10, 2026Updated 2 months ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated 2 months ago
- ☆44Sep 8, 2025Updated 6 months ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Apr 27, 2021Updated 4 years ago
- MultiPaxos and Disk Paxos in TLA+ and PlusCal☆13Jan 23, 2023Updated 3 years ago
- Ask question to your PDF☆10Jun 11, 2023Updated 2 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A configurable general purpose graphics processing unit for☆12May 18, 2019Updated 6 years ago
- Toy RISC-V LLVM backend☆31Aug 15, 2022Updated 3 years ago
- A field theory inspired xAct package for Mathematica☆17Jul 28, 2016Updated 9 years ago
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆27Apr 4, 2025Updated 11 months ago
- distributed transaction processor☆16Updated this week
- ⚡Harry Potter books and audiobooks☆12Oct 1, 2020Updated 5 years ago
- Specifications and safety proofs in different tools of a simple concurrent algorithm☆24May 24, 2020Updated 5 years ago
- ☆13Jun 17, 2021Updated 4 years ago
- ☆15Feb 1, 2016Updated 10 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Fix empty address in teslamate☆26Mar 9, 2026Updated 2 weeks ago
- High-performance distributed data shuffling (all-to-all) library for MoE training and inference☆116Mar 7, 2026Updated 2 weeks ago
- A curated list of bookmarks, packages, tutorials, videos and other cool resources done in Chapel language.☆21Apr 12, 2021Updated 4 years ago
- Source code analysis of Impala, PostgreSQL, Citus and Postgres-XL☆13Jan 16, 2017Updated 9 years ago
- Ascend TileLang adapter☆236Mar 20, 2026Updated last week
- SGLang kernel library for NPU☆108Mar 18, 2026Updated last week
- a presto plugin supporting read csv files in local filesystem.☆10Jul 27, 2018Updated 7 years ago
- Source for Demystifying GPU Microarchitecture through Microbenchmarking☆18May 29, 2023Updated 2 years ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆21Dec 2, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official website of the book: http://themlbook.com/☆13Feb 10, 2019Updated 7 years ago
- A TLA+ formalization of the algorithm described in "Paxos Made Simple"☆21Jan 28, 2025Updated last year
- 文档项目☆13Updated this week
- ☆56Jul 7, 2025Updated 8 months ago
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆46Updated this week
- A Trino connector to access git repository contents☆18Feb 9, 2026Updated last month
- Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral☆48Jan 16, 2026Updated 2 months ago