Forward and backward Attention DNN operators implementationed by LibTorch, cuDNN, and Eigen.
☆31Jun 6, 2023Updated 3 years ago
Alternatives and similar repositories for eigenMHA
Users that are interested in eigenMHA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆37Sep 15, 2023Updated 2 years ago
- Transparent Cudnn / Cublas / Eigen usage for the deep learning training using MNIST dataset.☆18Sep 3, 2020Updated 5 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Jun 26, 2026Updated last week
- Fast GPU based tensor core reductions☆12Jan 13, 2023Updated 3 years ago
- Speech synthesis using LPC☆25Jun 5, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆121Apr 11, 2024Updated 2 years ago
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 7 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- JCudnn - Java bindings for cuDNN☆31Nov 16, 2024Updated last year
- A java implementation of Bert Tokenizer.☆30Jan 4, 2022Updated 4 years ago
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- fpv vehicle powered by esp32 cam☆10Aug 9, 2022Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆57May 28, 2026Updated last month
- Scalable radix top-k selection on GPUs.☆23Jan 27, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A development version of the numerically exact variant of MIP solver SCIP☆11Mar 22, 2023Updated 3 years ago
- antkillerfarm's crazy magic☆18Oct 3, 2024Updated last year
- web app for designing and milling simple circuit boards☆14May 7, 2018Updated 8 years ago
- Transformer framework for edge computing based on C++.☆130Jun 21, 2026Updated last week
- English Georgian Dictionary for iPhone☆20Apr 19, 2018Updated 8 years ago
- An exact real arithmetic (aka constructive reals) for OCaml☆13Jun 14, 2024Updated 2 years ago
- Image Restoration via Multi-domain Learning☆31May 25, 2025Updated last year
- TensorFlow implementation of the Dissimilarity Mixture Autoencoder: https://arxiv.org/abs/2006.08177☆13Dec 8, 2022Updated 3 years ago
- ☆18Jan 4, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implemented the depth reconstruction part from the paper semi dense visual odometry from a monocular camera https://vision.in.tum.de/memb…☆12Dec 28, 2017Updated 8 years ago
- Source and solution codes for Professional CUDA C Programming book.☆15Aug 20, 2020Updated 5 years ago
- A toy text-to-image model trained from scratch.☆19Jun 9, 2025Updated last year
- DeepSparkHub selects hundreds of application algorithms and models, covering various fields of AI and general-purpose computing, to suppo…☆70May 28, 2026Updated last month
- Opensource Light Weight Hotel Enterprise Resource Planning System☆14Feb 5, 2021Updated 5 years ago
- ZePolA - A Parametric Equalizer with Interactive Poles and Zeros Control for Digital Signal Processing Education☆25Dec 19, 2025Updated 6 months ago
- A single header-only C++ library for automatic / algorithmic differentiation.☆16Nov 29, 2022Updated 3 years ago
- Code for the paper:<LARNet:Lie Algebra Residual Network for Profile Face Recognition>(ICML2021)☆10Aug 19, 2021Updated 4 years ago
- cuDNN Frontend is NVIDIA's modern, open-source entry point to the cuDNN library and a growing collection of high-performance open-source …☆861Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆14Jun 8, 2023Updated 3 years ago
- Code for the paper "Interpreting video features: A comparison of 3D Convolutional networks and Convolutional LSTM networks"☆11Dec 14, 2020Updated 5 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- This project is an extension of the project of the same name developped by☆16Jun 20, 2022Updated 4 years ago
- ☆16May 24, 2021Updated 5 years ago
- Neuralizer.ai - Visual Neural Network Designer☆14Nov 8, 2022Updated 3 years ago
- Continuous speech recognition for Android demo☆14Feb 20, 2024Updated 2 years ago