High-performance CUDA kernels for real-time financial low latency inference, optimized for both consumer and datacenter GPUs.
☆19Jul 25, 2025Updated 10 months ago
Alternatives and similar repositories for cuda_latency_benchmark
Users that are interested in cuda_latency_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Keyboard-first dotfiles for terminal-centric development with tmux, Neovim, and coding agents.☆25May 20, 2026Updated last week
- My personal dotfiles with automated macOS setup. Features smart installation scripts, Bats testing (bash), performance monitoring, and 2…☆11Apr 24, 2026Updated last month
- Backpack Attachments is a FiveM resource for attaching weapons and items to players' backs. It supports customizable attachment points, h…☆10Nov 14, 2024Updated last year
- The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…☆11Mar 3, 2023Updated 3 years ago
- Demo repository for article "Express server, Handlebars & Critical Path Performance Optimization"☆13Jan 12, 2017Updated 9 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A lightweight React hook that automatically manages fade overlays for scrollable containers. Provides smooth gradient transitions at the …☆12Aug 11, 2025Updated 9 months ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11May 4, 2024Updated 2 years ago
- Stable Magisk modules for performance and efficient battery usage on rooted Android devices.☆27May 2, 2026Updated 3 weeks ago
- A SystemVerilog-based simulation and design of a Last Level Cache (LLC) implementing the MESI protocol, featuring Pseudo-LRU replacement,…☆16Mar 8, 2026Updated 2 months ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆29Jun 5, 2025Updated 11 months ago
- Sekai Viewer but built with Next, optimized for performance☆11Jan 20, 2023Updated 3 years ago
- Materials for a workshop on JVM performance optimization☆14Jan 27, 2024Updated 2 years ago
- An implementation of the Pregel graph processing system on the Spark cluster computing framework. Merged into Spark; please see:☆11Apr 9, 2011Updated 15 years ago
- High-performance technical indicators library for financial analysis, optimized with Numba☆16Oct 13, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A batched implementation for efficient Qwen2.5-VL inference.☆25Jul 16, 2025Updated 10 months ago
- Tomasulo Simulator written in React as the project for Computer Architecture course, Spring 2019, Tsinghua University☆11Jun 9, 2019Updated 6 years ago
- An experimental python library to compile and analyze the cost of any desired composite simulation in real or imaginary time, and with or…☆10Feb 9, 2024Updated 2 years ago
- 10K+ req/s batch API client for LLM endpoints — Rust, async, load-balanced☆19Feb 21, 2026Updated 3 months ago
- r3conwhale aims to develop a multifunctional recon chain for web applications, intelligently interpreting collected data, and optimizing …☆14Jul 3, 2024Updated last year
- 💰 Save money on AI API costs! 76% token reduction, Auto-Fix token limits, Universal AI compatibility. Cline • Copilot • Claude • Curs…☆31Jun 18, 2025Updated 11 months ago
- ☆10Nov 22, 2022Updated 3 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Dec 15, 2021Updated 4 years ago
- This project is a real-time Wav2Lip implementation that I am actively optimizing to enhance the precision and performance of audio-to-lip…☆11Dec 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Apr 2, 2021Updated 5 years ago
- MySQL Memory Calculator estimates maximum MySQL memory usage based on key configuration settings. Featuring real-time calculations and vi…☆18Dec 17, 2024Updated last year
- Implementation of the SHA-3 family using AVX/AVX2 instructions.☆14Oct 5, 2018Updated 7 years ago
- A powerful image optimization tool that reduces file sizes while maintaining quality. It supports compression, resizing, and format conve…☆10Sep 21, 2024Updated last year
- High performance brainfuck transpiler/interpreter for Lua with FFI support. Very fast implementation with multiple optimization passes.☆13Jan 24, 2026Updated 4 months ago
- Proxima is a Node.js and Express API that provides basic CRUD functionality for managing data resources. It supports GET, POST, PATCH, an…☆11Jun 8, 2023Updated 2 years ago
- Wen? Now! A library to simplify your Web3 data fetching.☆20Jun 30, 2022Updated 3 years ago
- Efficient 3bit/4bit quantization of LLaMA models☆18May 18, 2023Updated 3 years ago
- A magisk module that optimizes your device's memory performance through persistent zRAM + Swapfile optimization with VM tweaks.☆15Jun 1, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- A high-performance ULID (Universally Unique Lexicographically Sortable Identifier) generator using WebAssembly, up to 40x faster than tra…☆20May 11, 2024Updated 2 years ago
- optimize TCP settings and download speeds of applications on Windows systems for improved network performance☆11Apr 29, 2025Updated last year
- DATAVIEW is a big data workflow management system. It uses Dropbox as the data cloud and Amazon EC2 as the compute cloud. Current researc…☆11Jun 11, 2022Updated 3 years ago
- Demo theme with various front-end performance optimization tricks applied☆15Sep 18, 2017Updated 8 years ago
- Repo for Li, Kafka, Gao et al 2019 "Clustering discretization methods for generation of material performance databases in machine learni…☆14May 27, 2019Updated 7 years ago
- Explore the world of electric vehicle battery optimization, where I simulate and fine-tune charging strategies based on temperature and S…☆11Sep 29, 2023Updated 2 years ago