Chillee / CS344_2021Links
Udacity CS344 Introduction to Parallell Programming (https://classroom.udacity.com/courses/cs344), with assignments/materials updated to build in 2021
☆46Updated 3 years ago
Alternatives and similar repositories for CS344_2021
Users that are interested in CS344_2021 are comparing it to the libraries listed below
Sorting:
- Customized matrix multiplication kernels☆54Updated 3 years ago
- ☆18Updated 2 years ago
- Texture mapping with variational auto-encoders☆40Updated 3 years ago
- Authors implementation of LieTransformer: Equivariant Self-Attention for Lie Groups☆36Updated 4 years ago
- ☆20Updated last month
- PyTorch implementation of L2L execution algorithm☆107Updated 2 years ago
- ☆29Updated 2 years ago
- Learning about Neural Radiance Fields☆67Updated 3 years ago
- ☆67Updated last year
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated 2 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆53Updated 2 years ago
- Hacks for PyTorch☆19Updated 2 years ago
- Code for the paper PermuteFormer☆42Updated 3 years ago
- a lightweight transformer library for PyTorch☆71Updated 3 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training☆45Updated 2 weeks ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 4 years ago
- ☆21Updated 4 years ago
- Experiment of using Tangent to autodiff triton☆79Updated last year
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Example python package with pybind11 cpp extension☆57Updated 4 years ago
- Generate bird's-eye views of conference proceedings.☆24Updated 5 months ago
- 使用c++以及cuda加速神经网络样例(实现矩阵加法和矩阵乘法)☆56Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 5 months ago
- Toy implementations of some popular ML optimizers using Python/JAX☆44Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year