shan18 / Perceiver-Resampler-XAttn-CaptioningLinks
Generating Captions via Perceiver-Resampler Cross-Attention Networks
☆17Updated 2 years ago
Alternatives and similar repositories for Perceiver-Resampler-XAttn-Captioning
Users that are interested in Perceiver-Resampler-XAttn-Captioning are comparing it to the libraries listed below
Sorting:
- ☆23Updated 11 months ago
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- Code release for "Improved baselines for vision-language pre-training"☆61Updated last year
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆124Updated 7 months ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆219Updated 2 years ago
- ☆54Updated 2 years ago
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆54Updated 10 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆115Updated last year
- Automatically take good care of your preemptible TPUs☆37Updated 2 years ago
- ☆65Updated 2 years ago
- ☆209Updated 3 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆57Updated 2 years ago
- ☆34Updated last year
- ☆64Updated last year
- WIP☆93Updated last year
- ☆187Updated last year
- M4 experiment logbook☆58Updated 2 years ago
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆28Updated last year
- Utilities for Training Very Large Models☆58Updated last year
- ☆103Updated last year
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆43Updated last week
- Un-*** 50 billions multimodality dataset☆23Updated 3 years ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆90Updated last year
- Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …☆40Updated last year
- ☆16Updated last year
- Easily run PyTorch on multiple GPUs & machines☆54Updated 2 weeks ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74Updated 3 years ago