int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991
☆74Dec 30, 2023Updated 2 years ago
Alternatives and similar repositories for intgemm
Users that are interested in intgemm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient teacher-student models and scripts to make them☆57Dec 16, 2023Updated 2 years ago
- A GPU language model, based on btree backed tries.☆30Mar 6, 2018Updated 8 years ago
- The 14th Machine Translation Marathon 2019 in Edinburgh☆13Dec 8, 2022Updated 3 years ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- symmetric int8 gemm☆67Jun 7, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Fast Neural Machine Translation in C++ - development repository☆287Jul 9, 2025Updated 11 months ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆61Apr 14, 2024Updated 2 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- ☆16Jun 13, 2022Updated 4 years ago
- ☆29Jul 30, 2024Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Jul 25, 2024Updated last year
- XenC: open-source data selection tool for NLP☆65Mar 21, 2016Updated 10 years ago
- The l4t linux kernel☆10Dec 5, 2018Updated 7 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆39Aug 7, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Fast stand-alone C++ decoder for RNN-based NMT models☆31Dec 12, 2020Updated 5 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- Corpus preprocessing☆100Mar 16, 2024Updated 2 years ago
- Lightweight C++ translator for OpenNMT Torch models (deprecated)☆80Apr 7, 2020Updated 6 years ago
- ☆42Jul 17, 2018Updated 7 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 5 years ago
- Tool for manual evaluation of parallel sentences.☆15Jan 26, 2026Updated 5 months ago
- Nonblocking data structures☆12Jan 25, 2015Updated 11 years ago
- Header file to translate SSE instructions to ARM NEON instructions☆48Nov 22, 2013Updated 12 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,568Jun 21, 2026Updated last week
- ☆322Feb 17, 2026Updated 4 months ago
- Itoyori: A distributed multi-threading runtime system for global-view fork-join task parallelism☆23Feb 9, 2024Updated 2 years ago
- Fast Neural Machine Translation in C++☆1,457Aug 25, 2023Updated 2 years ago
- Low-precision matrix multiplication☆1,844Jan 29, 2024Updated 2 years ago
- a ducttape workflow for neural machine translation☆14Mar 23, 2021Updated 5 years ago
- ROCm Command Line Profiler - Updated moved to https://github.com/GPUOpen-Tools/RCP☆10Aug 24, 2017Updated 8 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆15Jan 24, 2017Updated 9 years ago
- A fast, simple, multilingual tokenizer☆29May 24, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Open single and half precision gemm implementations☆397Apr 2, 2023Updated 3 years ago
- A terminal-based renderer for OpenGL shaders. Like Shadertoy, but in the terminal.☆12Sep 24, 2023Updated 2 years ago
- Programming Assignment Project for Information Visualization Course on University of Chinese Academy of Sciences☆12Mar 10, 2017Updated 9 years ago
- Fast matrix multiplication☆32Jul 6, 2021Updated 4 years ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆962Updated this week
- A catalogue of efficient and accurate polynomial approximations☆17Feb 5, 2022Updated 4 years ago
- Kaldi Speech Processing Tools☆25Nov 16, 2018Updated 7 years ago