shan18 / Perceiver-Resampler-XAttn-Captioning
Generating Captions via Perceiver-Resampler Cross-Attention Networks
☆16Updated 2 years ago
Alternatives and similar repositories for Perceiver-Resampler-XAttn-Captioning
Users that are interested in Perceiver-Resampler-XAttn-Captioning are comparing it to the libraries listed below
Sorting:
- Load any clip model with a standardized interface☆21Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- ☆64Updated last year
- ☆58Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆59Updated 5 months ago
- Utilities for Training Very Large Models☆58Updated 7 months ago
- ☆68Updated 10 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆51Updated last month
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- M4 experiment logbook☆57Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆54Updated 2 years ago
- ☆33Updated 8 months ago
- Language models scale reliably with over-training and on downstream tasks☆97Updated last year
- Lottery Ticket Adaptation☆39Updated 5 months ago
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- ☆25Updated last year
- ☆72Updated 3 weeks ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 9 months ago
- ☆22Updated 4 months ago
- MEXMA: Token-level objectives improve sentence representations☆41Updated 4 months ago
- ☆50Updated last year
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆28Updated last year
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆18Updated last year
- ☆51Updated 11 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Mixture-of-Transformers A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025. 🔗 https//arxiv.org/abs/2411.049…☆46Updated last week
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆73Updated 6 months ago