openmlsys / openmlsys-cudaLinks
Tutorials for writing high-performance GPU operators in AI frameworks.
☆132Updated 2 years ago
Alternatives and similar repositories for openmlsys-cuda
Users that are interested in openmlsys-cuda are comparing it to the libraries listed below
Sorting:
- ☆144Updated last year
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆44Updated 10 months ago