fuzihaofzh / cstlLinks
The C++ Standard Template Library (STL) for Python.
☆24Updated 2 years ago
Alternatives and similar repositories for cstl
Users that are interested in cstl are comparing it to the libraries listed below
Sorting:
- Demystify RAM Usage in Multi-Process Data Loaders☆204Updated 2 years ago
- Implementation of a Transformer, but completely in Triton☆276Updated 3 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆217Updated 2 years ago
- ☆121Updated last year
- Prune a model while finetuning or training.☆405Updated 3 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆159Updated last year
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆42Updated this week
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆414Updated last year
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Updated 2 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆209Updated last year
- Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"☆120Updated last year
- Code release for "Dropout Reduces Underfitting"☆315Updated 2 years ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆182Updated 3 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Updated 3 years ago
- ☆186Updated last year
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆69Updated 4 years ago
- Root Mean Square Layer Normalization☆256Updated 2 years ago
- Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)☆193Updated 2 years ago
- ☆105Updated last year
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆114Updated last year
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Updated 3 years ago
- Low-bit optimizers for PyTorch☆132Updated 2 years ago
- Torch Distributed Experimental☆117Updated last year
- This repository contains the experimental PyTorch native float8 training UX☆223Updated last year
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆113Updated 2 years ago
- A general and accurate MACs / FLOPs profiler for PyTorch models☆629Updated 3 months ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆243Updated last month
- [KDD'22] Learned Token Pruning for Transformers☆101Updated 2 years ago
- Accelerate PyTorch models with ONNX Runtime☆366Updated 8 months ago