albanie / foundation-models
Video descriptions of research papers relating to foundation models and scaling
☆30Updated last year
Alternatives and similar repositories for foundation-models:
Users that are interested in foundation-models are comparing it to the libraries listed below
- ☆41Updated last month
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆100Updated 5 months ago
- ☆22Updated 4 months ago
- ☆31Updated last year
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆14Updated last year
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆35Updated 6 months ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- ☆64Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Holistic evaluation of multimodal foundation models☆42Updated 6 months ago
- ☆51Updated 8 months ago
- ☆29Updated 2 years ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated 2 years ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆96Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆51Updated 2 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆51Updated 5 months ago
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆45Updated last year
- ☆23Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆99Updated last year
- Code release for paper Extremely Simple Activation Shaping for Out-of-Distribution Detection☆50Updated 5 months ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆56Updated 5 months ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆87Updated last year
- M4 experiment logbook☆56Updated last year
- ☆20Updated last month
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations"☆31Updated last year
- ☆30Updated last year
- GeckoNum Benchmark for T2I Model Eval.☆11Updated 2 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆12Updated 2 months ago