fuzihaofzh / cstlLinks
The C++ Standard Template Library (STL) for Python.
☆25Updated 2 years ago
Alternatives and similar repositories for cstl
Users that are interested in cstl are comparing it to the libraries listed below
Sorting:
- Demystify RAM Usage in Multi-Process Data Loaders☆205Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆420Updated last year
- Implementation of a Transformer, but completely in Triton☆277Updated 3 years ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆244Updated last week
- Accelerate PyTorch models with ONNX Runtime☆368Updated 2 weeks ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆219Updated 2 years ago
- ☆191Updated last year
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Updated 3 years ago
- Prune a model while finetuning or training.☆405Updated 3 years ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆182Updated 3 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆211Updated last year
- A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal, but extensible training loop …☆193Updated 6 months ago
- ☆124Updated last year
- Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)☆193Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆317Updated 2 years ago
- This repository contains the experimental PyTorch native float8 training UX☆227Updated last year
- Official implementation of "Active Image Indexing"☆60Updated 2 years ago
- Detection Transformers with Assignment☆264Updated 2 years ago
- # Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang P…☆34Updated 2 years ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆46Updated this week
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆109Updated 2 years ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆310Updated last year
- Slicing a PyTorch Tensor Into Parallel Shards☆300Updated 6 months ago
- Research code for pixel-based encoders of language (PIXEL)☆345Updated 5 months ago
- Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time☆503Updated last year
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆150Updated 2 years ago
- M4 experiment logbook☆58Updated 2 years ago
- A library for unit scaling in PyTorch☆133Updated 5 months ago
- Torch Distributed Experimental☆117Updated last year