CMU-MultiComp-Lab / adv-mmml-courseLinks
☆40Updated last year
Alternatives and similar repositories for adv-mmml-course
Users that are interested in adv-mmml-course are comparing it to the libraries listed below
Sorting:
- ☆97Updated last year
- ☆30Updated 2 years ago
- ☆101Updated 3 years ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆82Updated 6 months ago
- code for the ddp tutorial☆32Updated 3 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Updated 2 years ago
- Toloka Visual Question Answering Challenge at WSDM Cup 2023☆31Updated last year
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆27Updated 4 months ago
- ☆16Updated 2 years ago
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- ☆27Updated last year
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆81Updated 2 years ago
- ScrollNet for Continual Learning☆11Updated 2 years ago
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆24Updated last year
- https://slds-lmu.github.io/seminar_multimodal_dl/☆171Updated 2 years ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆48Updated 3 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆72Updated last year
- A curated list of vision-and-language pre-training (VLP). :-)☆60Updated 3 years ago
- (ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.☆11Updated last year
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Updated last year
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆97Updated last year
- PyTorch implementation of LIMoE☆52Updated last year
- ☆134Updated 2 years ago
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆31Updated 6 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆61Updated last year
- [TMLR 2022] High-Modality Multimodal Transformer☆117Updated last year
- ☆14Updated 2 years ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆91Updated 2 years ago
- ☆19Updated 6 months ago