Forward and backward Attention DNN operators implementationed by LibTorch, cuDNN, and Eigen.
☆31Jun 6, 2023Updated 2 years ago
Alternatives and similar repositories for eigenMHA
Users that are interested in eigenMHA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Overlapping Schwarz Domain Decomposition Finite Element Algorithm in both Matlab and serial/parallel C++☆18Mar 1, 2022Updated 4 years ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆37Sep 15, 2023Updated 2 years ago
- ☆163Sep 15, 2023Updated 2 years ago
- Auto-differentiation library for C++☆12Jan 16, 2022Updated 4 years ago
- Transparent Cudnn / Cublas / Eigen usage for the deep learning training using MNIST dataset.☆18Sep 3, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated 3 weeks ago
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- Official implementation of paper "Self-Supervised Noise Modeling and Sparsity Guided Cryo-ET Image Denoising" published on Ultramicroscop…☆16Sep 10, 2024Updated last year
- ☆13Nov 25, 2022Updated 3 years ago
- ☆121Apr 11, 2024Updated 2 years ago
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 5 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Replication of "Taming the Factor Zoo: A Test of New Factors (Feng, Giglio, and Xiu, 2020, JF)"☆10Mar 4, 2024Updated 2 years ago
- JCudnn - Java bindings for cuDNN☆31Nov 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A java implementation of Bert Tokenizer.☆30Jan 4, 2022Updated 4 years ago
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- fpv vehicle powered by esp32 cam☆10Aug 9, 2022Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆57May 4, 2026Updated 2 weeks ago
- Scalable radix top-k selection on GPUs.☆23Jan 27, 2025Updated last year
- cuASR: CUDA Algebra for Semirings☆47Aug 22, 2022Updated 3 years ago
- A development version of the numerically exact variant of MIP solver SCIP☆11Mar 22, 2023Updated 3 years ago
- AutodiffEngine☆13Apr 1, 2019Updated 7 years ago
- ☆16Aug 18, 2015Updated 10 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Customized Openvslam for IR and RGB images☆16Oct 13, 2020Updated 5 years ago
- web app for designing and milling simple circuit boards☆14May 7, 2018Updated 8 years ago
- Transformer framework for edge computing based on C++.☆130Nov 11, 2024Updated last year
- ☆51Sep 5, 2020Updated 5 years ago
- English Georgian Dictionary for iPhone☆20Apr 19, 2018Updated 8 years ago
- Image Restoration via Multi-domain Learning☆27May 25, 2025Updated 11 months ago
- Learning-aided 3D mapping☆10May 12, 2025Updated last year
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆95Feb 23, 2023Updated 3 years ago
- TensorFlow implementation of the Dissimilarity Mixture Autoencoder: https://arxiv.org/abs/2006.08177☆13Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Lessons Learned from GPU Experiments with Aparapi☆13Apr 17, 2016Updated 10 years ago
- This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.☆15Aug 9, 2023Updated 2 years ago
- Digital Audio Effects in JavaScript☆11Updated this week
- ☆18Jan 4, 2024Updated 2 years ago
- Implemented the depth reconstruction part from the paper semi dense visual odometry from a monocular camera https://vision.in.tum.de/memb…☆12Dec 28, 2017Updated 8 years ago
- ZePolA - A Parametric Equalizer with Interactive Poles and Zeros Control for Digital Signal Processing Education☆26Dec 19, 2025Updated 5 months ago
- Opensource Light Weight Hotel Enterprise Resource Planning System☆14Feb 5, 2021Updated 5 years ago