CSshengxy/MEC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CSshengxy/MEC)

CSshengxy / MEC

ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)

☆17

Alternatives and similar repositories for MEC

Users that are interested in MEC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gplhegde / convolution-flavors
View on GitHub
Implementation of convolution layer in different flavors
☆68Oct 8, 2017Updated 8 years ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 6 months ago
mrzhuzhe / riven
View on GitHub
CPU Memory Compiler and Parallel programing
☆26Nov 18, 2024Updated last year
MatanHamilis / one_stencil
View on GitHub
Multiple 1-stencil implementations using nvidia cuda.
☆12Dec 2, 2017Updated 8 years ago
Xilinx / SDFEC-PYNQ
View on GitHub
A PYNQ overlay demonstrating the Xilinx RFSoC SD-FEC
☆13Jun 29, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xuqiantong / CUDA-Winograd
View on GitHub
Fast CUDA Kernels for ResNet Inference.
☆183May 26, 2019Updated 7 years ago
3dem / externprior
View on GitHub
RELION external reconstruct functionality
☆12Sep 11, 2020Updated 5 years ago
njuhope / cuda_sgemm
View on GitHub
☆121Apr 11, 2024Updated 2 years ago
lixiuhong / implicit_gemm_convolution
View on GitHub
☆14May 28, 2019Updated 7 years ago
lixiuhong / batched_gemm
View on GitHub
☆40Feb 28, 2020Updated 6 years ago
DaisukeMiyamoto / aws-parallelcluster-relion
View on GitHub
example set up for Relion on AWS ParallelCluster for CryoEM
☆13May 21, 2022Updated 4 years ago
gpgpu-sim / pytorch-gpgpu-sim
View on GitHub
Modified version of PyTorch able to work with changes to GPGPU-Sim
☆58Nov 18, 2022Updated 3 years ago
benlwk / Tensorcrypto
View on GitHub
☆16Feb 27, 2022Updated 4 years ago
hova88 / CUDA-MatMul-Practice
View on GitHub
☆19Jan 4, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
owensgroup / GpuBTree
View on GitHub
Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019
☆57Jun 27, 2022Updated 4 years ago
csehydrogen / Winograd-OpenCL
View on GitHub
Winograd-based convolution implementation in OpenCL
☆29Jan 22, 2017Updated 9 years ago
EuanPyle / relion4_tomo_robot
View on GitHub
Automated workflow for preparing tilt series data for RELION 4.0.
☆13Dec 17, 2023Updated 2 years ago
streamer-AP / HRT19D-detection
View on GitHub
无人车感知组的技术文章，教程
☆18Jan 17, 2019Updated 7 years ago
ddjiajun / SSVD
View on GitHub
Implementation of "Single Shot Video Object Detector"
☆23Mar 25, 2020Updated 6 years ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
Robinatp / YOLO_Tensorflow
View on GitHub
YOLO( You Only Look Once,including YOLOv1,YOLOv2,YOLOv3) using tensorflow ,including train/detected and export pb script. Convert darkne…
☆25Aug 30, 2018Updated 7 years ago
msnh2012 / XNet
View on GitHub
Simple CuDNN wrapper
☆29Nov 29, 2015Updated 10 years ago
stganser / polyite
View on GitHub
Polyite: Iterative Schedule Optimization for Parallelization in the Polyhedron Model
☆12Jan 19, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
louisliuwei / pynq-dpu
View on GitHub
Migrate Xilinx edge AI solution to PYNQ
☆17Nov 3, 2020Updated 5 years ago
XiaoMi / nnlib
View on GitHub
Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib
☆59Apr 10, 2023Updated 3 years ago
daadaada / turingas
View on GitHub
Assembler for NVIDIA Volta and Turing GPUs
☆246Jan 13, 2022Updated 4 years ago
lightbulb128 / Pencil
View on GitHub
☆17Oct 16, 2024Updated last year
hebench / reference-seal-backend
View on GitHub
The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…
☆11Mar 3, 2023Updated 3 years ago
marsiau / PYNQ-RTL-SDR
View on GitHub
A FPGA accelerated SDR receiver using PYNQ-Z2 board and RTL-SDR
☆23Oct 22, 2019Updated 6 years ago
vkrasnov / vpmadd
View on GitHub
Multiplication using AVX512 and AVX512IFMA instructions
☆25Nov 9, 2015Updated 10 years ago
xupsh / pynq-supported-board-file
View on GitHub
☆24Nov 30, 2018Updated 7 years ago
MegEngine / cutlass
View on GitHub
CUDA Templates for Linear Algebra Subroutines
☆101Apr 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
DPCEKY / systolic-array
View on GitHub
HLS implemented systolic array structure
☆41Nov 13, 2017Updated 8 years ago
dengzelu / semantic-segmentation-pytorch
View on GitHub
semantic segmentation using pytorch
☆11Dec 1, 2017Updated 8 years ago
weishengying / cutlass_flash_atten_fp8
View on GitHub
使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention
☆82Aug 12, 2024Updated last year
d-li14 / mobilenext.pytorch
View on GitHub
Rethinking Bottleneck Structure for Efficient Mobile Network Design
☆12Jul 22, 2020Updated 6 years ago
wgq18 / img_defog
View on GitHub
一种基于FPGA平台的实时视频去雾系统项目代码，其中bit流文件可以直接下载到PYNQ-Z2开发板上，通过usb和hdmi设备输入有雾视频，将去雾后的视频输出到显示屏上。c++源代码部分是我们的去雾IP核的源代码。
☆20Nov 24, 2019Updated 6 years ago
VLSIDA / OpenCache
View on GitHub
An open-source custom cache generator.
☆37Mar 14, 2024Updated 2 years ago
OpenPPL / CuAssembler
View on GitHub
An unofficial cuda assembler, for all generations of SASS, hopefully ：）
☆85Mar 20, 2023Updated 3 years ago