Customized matrix multiplication kernels
☆57Mar 5, 2022Updated 4 years ago
Alternatives and similar repositories for custom_matmul_kernels
Users that are interested in custom_matmul_kernels are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago
- ☆12Sep 29, 2021Updated 4 years ago
- ☆16Mar 24, 2025Updated last year
- ☀️ Measuring the accuracy of BBC weather forecasts in Honolulu, USA☆12Jul 10, 2021Updated 4 years ago
- ☆15Dec 31, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Hackable Quantization Library for PyTorch☆22Mar 29, 2021Updated 5 years ago
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- 4th place solution to datafactory challenge by Intermarché.☆12Jun 28, 2021Updated 4 years ago
- Approximate layers - TensorFlow extension☆27Apr 14, 2025Updated last year
- ☆15Jun 11, 2022Updated 3 years ago
- Overview of IR/NLP papers covered in my team's reading group.☆10May 5, 2020Updated 5 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 3 years ago
- ☆18Aug 18, 2023Updated 2 years ago
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆11Nov 2, 2015Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PyTorch Code for the Paper: "Exploiting Uncertainty of Loss Landscape for Stochastic Optimization [Bhaskara et al. (2019)]☆16Dec 8, 2025Updated 4 months ago
- ☆13Jun 20, 2019Updated 6 years ago
- ☆12Jun 14, 2021Updated 4 years ago
- PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation☆14Jul 15, 2016Updated 9 years ago
- ☆16Sep 24, 2024Updated last year
- A Numpy implementation of a Generative Adversarial Network.☆17Sep 4, 2020Updated 5 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- The simplest way to deploy a machine learning model☆24Nov 19, 2022Updated 3 years ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Aug 8, 2018Updated 7 years ago
- Hessian trace estimation using PyTorch and Hutch++☆20Oct 29, 2020Updated 5 years ago
- Scale-out system monitoring☆21Updated this week
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- Causal Fairness Analysis☆21Apr 16, 2025Updated last year
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- Official code for NeurIPS paper "Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach".☆16Jun 30, 2022Updated 3 years ago
- Implementation of Flash Attention in Jax☆228Mar 1, 2024Updated 2 years ago
- ncnn export & infer mobileclip☆21Aug 18, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems☆17May 23, 2022Updated 3 years ago
- LLM-DSE: Searching Accelerator Parameters with LLM Agents☆13May 22, 2025Updated 11 months ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 4 months ago
- https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/☆19Oct 20, 2019Updated 6 years ago
- Quantization of Convolutional Neural networks.☆249Aug 5, 2024Updated last year
- a lightweight transformer library for PyTorch☆71Nov 2, 2021Updated 4 years ago
- Gate-Level Simulation on a GPU☆10Nov 22, 2016Updated 9 years ago