facebookresearch / loop_nestLinks
Loop Nest - Linear algebra compiler and code generator.
☆22Updated 2 years ago
Alternatives and similar repositories for loop_nest
Users that are interested in loop_nest are comparing it to the libraries listed below
Sorting:
- FlexAttention w/ FlashAttention3 Support☆26Updated 8 months ago
- ☆52Updated 9 months ago
- Better bindings for Python☆17Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆11Updated 9 months ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆60Updated last month
- NumPy+Jax with named axes and an uncompromising attitude☆20Updated 3 months ago
- ☆18Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- PyTorch implementation of the Flash Spectral Transform Unit.☆16Updated 8 months ago
- Exploration into the Firefly algorithm in Pytorch☆39Updated 3 months ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆23Updated 2 weeks ago
- Personal solutions to the Triton Puzzles☆18Updated 10 months ago
- Make triton easier☆47Updated 11 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆110Updated 3 weeks ago
- ☆21Updated 3 months ago
- Hacks for PyTorch☆19Updated 2 years ago
- Implementation of a Tensorflow XLA rematerialization pass☆15Updated 5 years ago
- ☆18Updated 2 years ago
- ☆9Updated 4 years ago
- ☆60Updated this week
- A tracing JIT for PyTorch☆17Updated 2 years ago
- benchmarking some transformer deployments☆26Updated 2 years ago
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- TritonParse is a tool designed to help developers analyze and debug Triton kernels by visualizing the compilation process and source code…☆14Updated this week
- ☆13Updated 4 years ago
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.☆148Updated 2 years ago
- A LinearOperator implementation for PyTorch☆18Updated 4 years ago
- A demonstration of source code transformation to implement automatic differentiation, compatible with an operation overload style AD libr…☆13Updated 2 years ago