fishmingyu / GeoTLinks
GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU
☆23Updated 7 months ago
Alternatives and similar repositories for GeoT
Users that are interested in GeoT are comparing it to the libraries listed below
Sorting:
- EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…☆64Updated 2 weeks ago
- ☆21Updated 7 months ago
- Experiment of using Tangent to autodiff triton☆80Updated last year
- PyTorch centric eager mode debugger☆48Updated 10 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆59Updated 2 weeks ago
- Parallel framework for training and fine-tuning deep neural networks☆65Updated last week
- Personal solutions to the Triton Puzzles☆20Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆58Updated 2 weeks ago
- ☆83Updated last year
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆51Updated last year
- Train across all your devices, ezpz 🍋☆24Updated this week
- FlashRNN - Fast RNN Kernels with I/O Awareness☆103Updated last week
- TORCH_LOGS parser for PT2☆62Updated last month
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆92Updated 3 months ago
- TorchFix - a linter for PyTorch-using code with autofix support☆148Updated 2 months ago
- FLOPS counter for all your GPU benchmarking needs☆12Updated last year
- Fast and memory-efficient exact attention☆71Updated 7 months ago
- extensible collectives library in triton☆90Updated 7 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆45Updated last year
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆329Updated 10 months ago
- ☆58Updated last year
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated last year
- ☆103Updated this week
- This repository contains the experimental PyTorch native float8 training UX☆223Updated last year
- ☆28Updated 9 months ago
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆39Updated last year
- Memory Optimizations for Deep Learning (ICML 2023)☆110Updated last year
- ☆112Updated last year
- Torch Distributed Experimental☆117Updated last year
- ☆126Updated last year