fuzihaofzh / cstlLinks
The C++ Standard Template Library (STL) for Python.
☆23Updated last year
Alternatives and similar repositories for cstl
Users that are interested in cstl are comparing it to the libraries listed below
Sorting:
- Demystify RAM Usage in Multi-Process Data Loaders☆196Updated 2 years ago
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆399Updated last year
- Implementation of a Transformer, but completely in Triton☆273Updated 3 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆206Updated 11 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆158Updated last year
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆24Updated 2 years ago
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆139Updated last year
- Detection Transformers with Assignment☆257Updated last year
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆214Updated 2 years ago
- Megatron's multi-modal data loader☆232Updated last week
- ☆114Updated last year
- A huge dataset for Document Visual Question Answering☆19Updated last year
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆179Updated 3 years ago
- Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"☆119Updated last year
- Research code for pixel-based encoders of language (PIXEL)☆337Updated 3 weeks ago
- Code release for "Dropout Reduces Underfitting"☆313Updated 2 years ago
- Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".☆58Updated 2 years ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆236Updated 6 months ago
- Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)☆193Updated 2 years ago
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆98Updated last year
- ☆18Updated 2 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆138Updated 2 years ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆112Updated 2 years ago
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆30Updated 2 years ago
- Prune a model while finetuning or training.☆403Updated 3 years ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆309Updated 4 months ago
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆173Updated last year
- Torch Distributed Experimental☆117Updated last year
- Slicing a PyTorch Tensor Into Parallel Shards☆299Updated 2 months ago