albanie / foundation-modelsLinks
Video descriptions of research papers relating to foundation models and scaling
☆31Updated 2 years ago
Alternatives and similar repositories for foundation-models
Users that are interested in foundation-models are comparing it to the libraries listed below
Sorting:
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Updated last year
- Distributed Optimization Infra for learning CLIP models☆26Updated 8 months ago
- ☆22Updated 4 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 8 months ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆101Updated last year
- ☆32Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆15Updated 6 months ago
- ☆64Updated last year
- ☆50Updated 4 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆102Updated 11 months ago
- Minimal Implementation of Visual Autoregressive Modelling (VAR)☆33Updated 2 months ago
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Updated last year
- ☆18Updated 2 years ago
- Codebase for adaptive continual memory☆13Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆51Updated last year
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated 2 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆56Updated 5 months ago
- ☆51Updated last month
- An official PyTorch implementation for CLIPPR☆29Updated last year
- ☆72Updated 7 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆44Updated 11 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆13Updated 5 months ago
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆41Updated 3 months ago
- ☆26Updated 7 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Updated 2 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- ☆51Updated 11 months ago