fishmingyu / GeoTLinks
GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU
☆22Updated 4 months ago
Alternatives and similar repositories for GeoT
Users that are interested in GeoT are comparing it to the libraries listed below
Sorting:
- EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…☆62Updated last week
- ☆21Updated 5 months ago
- PyTorch centric eager mode debugger☆47Updated 7 months ago
- Personal solutions to the Triton Puzzles☆19Updated last year
- Experiment of using Tangent to autodiff triton☆80Updated last year
- ☆53Updated 10 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Updated last year
- Fast and memory-efficient exact attention☆69Updated 5 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆59Updated 2 weeks ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated 2 years ago
- A parallel framework for training deep neural networks☆63Updated 4 months ago
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆326Updated 7 months ago
- A library for unit scaling in PyTorch☆128Updated last month
- Memory Optimizations for Deep Learning (ICML 2023)☆102Updated last year
- FlashRNN - Fast RNN Kernels with I/O Awareness☆93Updated 2 months ago
- ☆81Updated last year
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆83Updated 3 weeks ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆58Updated last year
- Unofficial implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch☆63Updated 4 months ago
- Graph neural networks in JAX.☆67Updated last year
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆70Updated 2 months ago
- TorchFix - a linter for PyTorch-using code with autofix support☆144Updated 6 months ago
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆135Updated 4 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆94Updated 8 months ago
- Running Jax in PyTorch Lightning☆111Updated 7 months ago
- This repository contains the experimental PyTorch native float8 training UX☆224Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆158Updated last month
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆212Updated this week
- ☆14Updated last year
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated 11 months ago