slds-lmu / seminar_multimodal_dlLinks
https://slds-lmu.github.io/seminar_multimodal_dl/
☆171Updated 2 years ago
Alternatives and similar repositories for seminar_multimodal_dl
Users that are interested in seminar_multimodal_dl are comparing it to the libraries listed below
Sorting:
- ☆134Updated 2 years ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆121Updated last year
- Code release for "Dropout Reduces Underfitting"☆317Updated 2 years ago
- In this page, I will provide a list of survey papers on topics related to deep learning and its applications in various fields.☆126Updated last year
- This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.☆187Updated 3 years ago
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracy☆129Updated 2 years ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆71Updated last year
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- Website☆57Updated 2 years ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆120Updated last year
- ☆30Updated 2 years ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆230Updated last year
- API Client for paperswithcode.com☆187Updated last year
- ☆100Updated 3 years ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆258Updated 2 years ago
- A modular PyTorch library for vision transformer models☆164Updated 2 years ago
- ☆97Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆90Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆117Updated 5 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Computer Vision and Pattern Recognition, NUS CS4243, 2022☆177Updated 3 years ago
- Probing the representations of Vision Transformers.☆336Updated 3 years ago
- Place where folks can contribute to 🤗 community events☆427Updated 2 years ago
- ☆166Updated 2 years ago
- ML/DL Math and Method notes☆64Updated 2 years ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆97Updated 2 years ago
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆179Updated last year
- Visualizing query-key interactions in language + vision transformers (VIS 2023)☆156Updated last year
- LoRA and DoRA from Scratch Implementations☆215Updated last year