β49Dec 20, 2019Updated 6 years ago
Alternatives and similar repositories for CUDA_study
Users that are interested in CUDA_study are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. π The official implementation of https://arxβ¦β29Feb 17, 2025Updated last year
- CUDA code with exact k-NN algorithm for multiple GPU system.β12Jul 5, 2024Updated last year
- A Vector Caching Scheme for Streaming FPGA SpMV Acceleratorsβ10Sep 7, 2015Updated 10 years ago
- β10Dec 19, 2023Updated 2 years ago
- DenseShuffleNet for Semantic Segmentation using Caffe for Cityscapes and Mapillary Vistas Datasetβ10Mar 21, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ouβ¦β12Jan 18, 2016Updated 10 years ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsityβ75Mar 10, 2026Updated 2 months ago
- Fork of gem5 with support for manycore architectures. Includes models and scripts to evaluate a software-defined-vector architecture.β13Oct 14, 2021Updated 4 years ago
- Roll the dice, in elixirβ14Dec 13, 2016Updated 9 years ago
- An experimental project for paddle python IR.β15Dec 4, 2023Updated 2 years ago
- arm-neonβ93Aug 2, 2024Updated last year
- Extended globbing in modern C++β13Dec 24, 2025Updated 4 months ago
- β12Oct 19, 2014Updated 11 years ago
- β134Feb 17, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenGraph is an open-source graph processing benchmarking suite written in pure C/OpenMP. Integrated with Sniper simulator.β11Apr 27, 2024Updated 2 years ago
- β14Nov 6, 2019Updated 6 years ago
- Very basic implementation of SPM for gem5 simulator (legacy gem5 version)β12Feb 18, 2020Updated 6 years ago
- A selective knowledge distillation algorithm for efficient speculative decodersβ39Nov 27, 2025Updated 5 months ago
- An Automatic Synthesis Tool for PIM-based CNN Accelerators.β16Feb 29, 2024Updated 2 years ago
- Speed of Light Analysis for ML Model Runtimeβ65Apr 13, 2026Updated last month
- RIMD: Efficient and Flexible Deformation Representation for Data-Driven Surface Modeling (Siggraph 2016)β11Mar 28, 2020Updated 6 years ago
- β2,742Jan 16, 2024Updated 2 years ago
- β16Jun 7, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Nebula: Deep Neural Network Benchmarks in C++β13Jan 2, 2025Updated last year
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)β16Jan 6, 2026Updated 4 months ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understandingβ17Oct 20, 2021Updated 4 years ago
- Official repository Flash Local Linear Attentionβ23Apr 23, 2026Updated 3 weeks ago
- Unofficial implementation for SOLOv2 instance segmentationβ15Jun 13, 2020Updated 5 years ago
- Run ethash opencl kernel on Xilinx's Alveo U50β17Mar 4, 2021Updated 5 years ago
- pypcd for python3 changesβ14Jun 27, 2019Updated 6 years ago
- β14Jun 30, 2021Updated 4 years ago
- β21Mar 22, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- HPC Game Platformβ11Apr 20, 2023Updated 3 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discoveryβ20Sep 24, 2025Updated 7 months ago
- a demo for openmp , by Jidorβ13Mar 25, 2019Updated 7 years ago
- Vocabulary Parallelismβ26Mar 10, 2025Updated last year
- Spike with a coherence supported cache modelβ14Jul 9, 2024Updated last year
- β39Jun 3, 2018Updated 7 years ago
- πAutomatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)β10Updated this week