Customized matrix multiplication kernels
☆57Mar 5, 2022Updated 4 years ago
Alternatives and similar repositories for custom_matmul_kernels
Users that are interested in custom_matmul_kernels are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago
- Rate model implementations for (adaptive) integrate-and-fire neurons based on the Fokker-Planck equation: (i) numerical (finite volume) s…☆11Apr 23, 2019Updated 6 years ago
- ☆16Mar 24, 2025Updated last year
- ☀️ Measuring the accuracy of BBC weather forecasts in Honolulu, USA☆12Jul 10, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆15Dec 31, 2020Updated 5 years ago
- A Hackable Quantization Library for PyTorch☆22Mar 29, 2021Updated 5 years ago
- Fast Emulation of Approximate DNN Accelerators in PyTorch☆30Feb 23, 2024Updated 2 years ago
- 4th place solution to datafactory challenge by Intermarché.☆12Jun 28, 2021Updated 4 years ago
- Approximate layers - TensorFlow extension☆27Apr 14, 2025Updated 11 months ago
- ☆15Jun 11, 2022Updated 3 years ago
- A fork of http://pydispatcher.sourceforge.net/ with PyPy support☆16Jul 3, 2017Updated 8 years ago
- Project management ROPE™ estimate: realistic estimate, optimistic estimate, pessimistic estimate, equilibristic estimate☆27Apr 14, 2025Updated 11 months ago
- Overview of IR/NLP papers covered in my team's reading group.☆10May 5, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- ☆18Aug 18, 2023Updated 2 years ago
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆11Nov 2, 2015Updated 10 years ago
- Code for running the transformers in the ICML 2021 paper "Thinking Like Transformers"☆18Jun 28, 2021Updated 4 years ago
- PyTorch implementation of the estimator proposed in the paper "Estimating Differential Entropy under Gaussian Convolutions"☆13Oct 22, 2020Updated 5 years ago
- ☆12Jun 14, 2021Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- Hessian trace estimation using PyTorch and Hutch++☆20Oct 29, 2020Updated 5 years ago
- Benchmark your NCNN models on 3DS(or crash)☆10Apr 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Causal Fairness Analysis☆20Apr 16, 2025Updated 11 months ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- RUSSE: Russian Semantic Evaluation.☆16Mar 1, 2022Updated 4 years ago
- Yaae: Yet another autodiff engine (written in Numpy).☆28Jul 6, 2023Updated 2 years ago
- Official code for NeurIPS paper "Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach".☆16Jun 30, 2022Updated 3 years ago
- Implementation of Flash Attention in Jax☆227Mar 1, 2024Updated 2 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Nov 7, 2017Updated 8 years ago
- Neural Network Based Dependency Parsers☆11Jan 14, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ncnn export & infer mobileclip☆21Aug 18, 2025Updated 7 months ago
- TwoFold (2✂︎f). Text files breathe fire.☆23Jan 28, 2026Updated 2 months ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 3 months ago
- ☆14Jun 27, 2019Updated 6 years ago
- https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/☆19Oct 20, 2019Updated 6 years ago
- Quantization of Convolutional Neural networks.☆250Aug 5, 2024Updated last year
- Reparameterize your PyTorch modules☆71Dec 31, 2020Updated 5 years ago