albanie / foundation-models
Video descriptions of research papers relating to foundation models and scaling
☆31Updated 2 years ago
Alternatives and similar repositories for foundation-models
Users that are interested in foundation-models are comparing it to the libraries listed below
Sorting:
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 8 months ago
- ☆45Updated 3 months ago
- ☆22Updated 4 months ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Updated last year
- ☆25Updated 7 months ago
- ☆32Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆100Updated last year
- ☆64Updated last year
- Holistic evaluation of multimodal foundation models