Forward and backward Attention DNN operators implementationed by LibTorch, cuDNN, and Eigen.
☆31Jun 6, 2023Updated 2 years ago
Alternatives and similar repositories for eigenMHA
Users that are interested in eigenMHA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The only known (by 2022) open-source, easy-to-understand basic algorithm implementations in TD-CEM. (Please star and fork this project if…☆15Mar 1, 2022Updated 4 years ago
- ☆12Mar 4, 2022Updated 4 years ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆35Sep 15, 2023Updated 2 years ago
- ☆159Sep 15, 2023Updated 2 years ago
- Auto-differentiation library for C++☆12Jan 16, 2022Updated 4 years ago
- Transparent Cudnn / Cublas / Eigen usage for the deep learning training using MNIST dataset.☆18Sep 3, 2020Updated 5 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Updated this week
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- Speech synthesis using LPC☆23Jun 5, 2021Updated 4 years ago
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- ☆120Apr 11, 2024Updated last year
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 3 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Replication of "Taming the Factor Zoo: A Test of New Factors (Feng, Giglio, and Xiu, 2020, JF)"☆10Mar 4, 2024Updated 2 years ago
- JCudnn - Java bindings for cuDNN☆30Nov 16, 2024Updated last year
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- fpv vehicle powered by esp32 cam☆10Aug 9, 2022Updated 3 years ago
- cuASR: CUDA Algebra for Semirings☆45Aug 22, 2022Updated 3 years ago
- Exact real arithmetic in Julia☆13Feb 8, 2020Updated 6 years ago
- web app for designing and milling simple circuit boards☆14May 7, 2018Updated 7 years ago
- ☆49Sep 5, 2020Updated 5 years ago
- An exact real arithmetic (aka constructive reals) for OCaml☆13Jun 14, 2024Updated last year
- Image Restoration via Multi-domain Learning☆26May 25, 2025Updated 9 months ago
- Learning-aided 3D mapping☆10May 12, 2025Updated 10 months ago
- TensorFlow implementation of the Dissimilarity Mixture Autoencoder: https://arxiv.org/abs/2006.08177☆13Dec 8, 2022Updated 3 years ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆94Feb 23, 2023Updated 3 years ago
- Digital Audio Effects in JavaScript☆11Updated this week
- ☆18Jan 4, 2024Updated 2 years ago
- ArXiv paper website☆14Feb 25, 2018Updated 8 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- ☆14Jun 8, 2023Updated 2 years ago
- FFT-based windowed spectrum analyzer☆13Mar 10, 2017Updated 9 years ago
- A toy text-to-image model trained from scratch.☆19Jun 9, 2025Updated 9 months ago
- Easily install Kubernetes on Raspbian/HypriotOS☆10Jul 16, 2018Updated 7 years ago
- cross-platform modular neural network inference library, small and efficient☆13May 15, 2023Updated 2 years ago
- This project is an extension of the project of the same name developped by☆14Jun 20, 2022Updated 3 years ago
- This repository contains the hardware implementation for Static BFP convolution on FPGA☆10Oct 15, 2019Updated 6 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- ☆15May 24, 2021Updated 4 years ago