Customized matrix multiplication kernels
☆57Mar 5, 2022Updated 4 years ago
Alternatives and similar repositories for custom_matmul_kernels
Users that are interested in custom_matmul_kernels are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- ☆16Mar 24, 2025Updated last year
- ☆15Dec 31, 2020Updated 5 years ago
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 6 years ago
- 4th place solution to datafactory challenge by Intermarché.☆12Jun 28, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- All the useful tools I have been using while working in data science for remote sensing☆11Nov 27, 2019Updated 6 years ago
- ☆15Jun 11, 2022Updated 3 years ago
- Overview of IR/NLP papers covered in my team's reading group.☆10May 5, 2020Updated 6 years ago
- ☆18Aug 18, 2023Updated 2 years ago
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆11Nov 2, 2015Updated 10 years ago
- Code for running the transformers in the ICML 2021 paper "Thinking Like Transformers"☆18Jun 28, 2021Updated 4 years ago
- PyTorch Code for the Paper: "Exploiting Uncertainty of Loss Landscape for Stochastic Optimization [Bhaskara et al. (2019)]☆16Apr 30, 2026Updated last month
- ☆12Jun 14, 2021Updated 4 years ago
- Code accompanying the paper "R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents"☆22May 2, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- The simplest way to deploy a machine learning model☆23Nov 19, 2022Updated 3 years ago
- Generic floating-point types in Python☆17Apr 18, 2026Updated last month
- Hessian trace estimation using PyTorch and Hutch++☆20Oct 29, 2020Updated 5 years ago
- Implementation of Lie Transformer, Equivariant Self-Attention, in Pytorch☆98Feb 19, 2021Updated 5 years ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- Official code for NeurIPS paper "Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach".☆16Jun 30, 2022Updated 3 years ago
- Yaae: Yet another autodiff engine (written in Numpy).☆28Jul 6, 2023Updated 2 years ago
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Jan 14, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Jul 15, 2018Updated 7 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Nov 7, 2017Updated 8 years ago
- TwoFold (2✂︎f). Text files breathe fire.☆23Jan 28, 2026Updated 4 months ago
- ☆14Jun 27, 2019Updated 6 years ago
- https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/☆19Oct 20, 2019Updated 6 years ago
- Quantization of Convolutional Neural networks.☆250Aug 5, 2024Updated last year
- Reparameterize your PyTorch modules☆70Dec 31, 2020Updated 5 years ago
- a lightweight transformer library for PyTorch☆71Nov 2, 2021Updated 4 years ago
- @ArchieMeng's prototype of a Python FFI of nihui/waifu2x-ncnn-vulkan achieved with SWIG☆12Jul 20, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Overlay gifs of memes on your video calls (works with Zoom, Google Meet, Microsoft Teams)☆10Apr 15, 2021Updated 5 years ago
- ☆22May 3, 2022Updated 4 years ago
- Google Colab notebooks☆43Sep 9, 2024Updated last year
- ☆12Mar 24, 2023Updated 3 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Jun 16, 2023Updated 2 years ago
- Light weight Object detection on Nintendo 3DS, powered by NCNN☆13Apr 3, 2024Updated 2 years ago