cuJSON: A Highly Parallel JSON Parser for GPUs
☆40Dec 12, 2025Updated 2 months ago
Alternatives and similar repositories for cuJSON
Users that are interested in cuJSON are comparing it to the libraries listed below
Sorting:
- ☆19Nov 21, 2022Updated 3 years ago
- Compiler plugin for performance analysis of HIP applications☆13Apr 7, 2025Updated 10 months ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆24Aug 27, 2025Updated 6 months ago
- ☆14Apr 24, 2024Updated last year
- ☆16Sep 12, 2023Updated 2 years ago
- GenDP: A Dynamic Programming Framework for Genome Sequencing Analysis☆17Jan 12, 2024Updated 2 years ago
- ☆65Apr 26, 2025Updated 10 months ago
- Distributed Deep Graph Learning Framework for Dynamic Graphs☆19Mar 25, 2024Updated last year
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆16Oct 11, 2024Updated last year
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- Debug print operator for cudagraph debugging☆14Aug 2, 2024Updated last year
- ☆31Oct 21, 2025Updated 4 months ago
- a size profiler for cuda binary☆72Jan 15, 2026Updated last month
- Handwritten GEMM using Intel AMX (Advanced Matrix Extension)☆17Jan 11, 2025Updated last year
- A Triton JIT runtime and ffi provider in C++☆31Updated this week
- AgentHub is the only SDK you need to connect to state-of-the-art LLMs (GPT-5.2/Claude 4.5/Gemini 3).☆52Updated this week
- ANT-ACE: Advanced Compiler Ecosystem for Fully Homomorphic Encryption and Domain Specific Computing☆56Updated this week
- ☆39Dec 14, 2025Updated 2 months ago
- A Framework for Graph Sampling and Random Walk on GPUs.☆38Feb 3, 2025Updated last year
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 6 months ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- ☆40Aug 18, 2019Updated 6 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆53Mar 24, 2024Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 8 months ago
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- ☆17Feb 26, 2020Updated 6 years ago
- JPStream: JSONPath Stream Processing in Parallel☆25Nov 15, 2022Updated 3 years ago
- Canvas: End-to-End Kernel Architecture Search in Neural Networks☆27Nov 18, 2024Updated last year
- Transforming Graphs for Efficient Irregular Graph Processing on GPUs☆50Nov 15, 2022Updated 3 years ago
- Cheddar: A Swift Fully Homomorphic Encryption (FHE) GPU Library☆48Jan 14, 2026Updated last month
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Oct 13, 2024Updated last year
- ☆42Nov 1, 2025Updated 3 months ago
- ☆25Feb 20, 2024Updated 2 years ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆64Nov 8, 2024Updated last year
- We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …☆193Jan 28, 2025Updated last year
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆32Jun 25, 2025Updated 8 months ago
- Artifact evaluation of PLDI'24 paper "Allo: A Programming Model for Composable Accelerator Design"☆33Apr 11, 2024Updated last year
- ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)☆59Updated this week
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Nov 11, 2019Updated 6 years ago