facebookresearch / loop_nest
Loop Nest - Linear algebra compiler and code generator.
☆22Updated 2 years ago
Alternatives and similar repositories for loop_nest:
Users that are interested in loop_nest are comparing it to the libraries listed below
- Better bindings for Python☆17Updated 2 years ago
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆43Updated last week
- FlexAttention w/ FlashAttention3 Support☆26Updated 4 months ago
- A tracing JIT compiler for PyTorch☆12Updated 3 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594☆13Updated 10 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- ☆51Updated 6 months ago
- Make triton easier☆43Updated 8 months ago
- Hacks for PyTorch☆18Updated last year
- ☆57Updated this week
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆23Updated last month
- ☆15Updated 4 months ago
- benchmarking some transformer deployments☆26Updated last year
- ☆9Updated 4 years ago
- ☆21Updated 3 months ago
- Prototype routines for GPU quantization written using PyTorch.☆19Updated this week
- Awesome Triton Resources☆19Updated 2 months ago
- Learning Compiler Pass Orders using Coreset and Normalized Value Prediction. (ICML 2023)☆18Updated last year
- ☆18Updated 9 months ago
- ☆18Updated 2 years ago
- TORCH_LOGS parser for PT2☆32Updated this week
- A LinearOperator implementation for PyTorch☆18Updated 4 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Updated 4 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated 11 months ago
- High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm☆26Updated 2 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 5 months ago
- Exploration into the Firefly algorithm in Pytorch☆35Updated this week
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- Benchmark tests supporting the TiledCUDA library.☆15Updated 2 months ago