huggingface / chug
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
☆157Updated last year
Alternatives and similar repositories for chug:
Users that are interested in chug are comparing it to the libraries listed below
- M4 experiment logbook☆57Updated last year
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆200Updated 7 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆254Updated 9 months ago
- ☆58Updated last year
- Set of scripts to finetune LLMs☆37Updated last year
- LL3M: Large Language and Multi-Modal Model in Jax☆72Updated last year
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆92Updated 9 months ago
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆201Updated this week
- ☆63Updated 7 months ago
- ☆64Updated last year
- ☆123Updated 5 months ago
- ☆302Updated 10 months ago
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆242Updated 3 months ago
- PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning☆151Updated 10 months ago
- Supercharge huggingface transformers with model parallelism.☆76Updated 6 months ago
- ☆169Updated 2 months ago
- ☆101Updated 10 months ago
- ☆47Updated 7 months ago
- Let's build better datasets, together!☆259Updated 4 months ago
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆40Updated 2 months ago
- Multimodal language model benchmark, featuring challenging examples☆167Updated 4 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 8 months ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated last year
- ☆75Updated 6 months ago
- ☆79Updated last year
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆91Updated 4 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆65Updated 7 months ago
- Code release for "Dropout Reduces Underfitting"☆313Updated last year
- ☆92Updated last year
- Easily run PyTorch on multiple GPUs & machines☆45Updated last month