Experiments with multimodal deep learning models based on transformers
☆11Oct 9, 2022Updated 3 years ago
Alternatives and similar repositories for multimodal-transformers-movies
Users that are interested in multimodal-transformers-movies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"☆30Feb 18, 2021Updated 5 years ago
- ☆16Oct 14, 2020Updated 5 years ago
- The source code of the CVPR22 paper titled "Multi-Modal Dynamic Graph Transformer for Visual Grounding".☆22Mar 26, 2022Updated 4 years ago
- ☆22Mar 3, 2022Updated 4 years ago
- ☆12Apr 7, 2014Updated 11 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- fine-Grained classify 细颗粒度图像分类☆12Nov 21, 2022Updated 3 years ago
- ☆15Sep 23, 2020Updated 5 years ago
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Computation of binomial confidence intervals that achieve exact coverage.☆14Apr 23, 2025Updated 11 months ago
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆22Jan 24, 2026Updated 2 months ago
- ☆12Nov 21, 2023Updated 2 years ago
- The re-implementation of <End-to-End Lane Marker Detection via Row-wise Classification>☆14Sep 21, 2020Updated 5 years ago
- Library for soft prompt tuning☆22Jun 12, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆20Jun 21, 2024Updated last year
- Q-HEART: ECG Question Answering via Knowledge-Informed Multimodal LLMs (ECAI 2025)☆15Jan 23, 2026Updated 2 months ago
- [CVPR2019] Synthesizing Environment-Aware Activities via Activity Sketches☆13Oct 3, 2023Updated 2 years ago
- Implementation of the paper "Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network"☆16Nov 1, 2019Updated 6 years ago
- This is the official repository for CardioLab. A machine and deep learning framework for the estimation and monitoring of laboratory abno…☆17Jan 12, 2026Updated 2 months ago
- ☆12Jan 27, 2025Updated last year
- ☆15Sep 24, 2022Updated 3 years ago
- ☆13May 15, 2025Updated 10 months ago
- ECG reconstruction☆14Nov 29, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆22Feb 25, 2021Updated 5 years ago
- Papers of Implicit Reasoning in LLMs.☆24Mar 13, 2025Updated last year
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- ECG-R1: Protocol-Guided and Modality-Agnostic MLLM for Reliable ECG Interpretation☆33Feb 21, 2026Updated last month
- The personal website of Scott W Harden☆13Mar 5, 2026Updated 3 weeks ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated last year
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- [ACL 2022] The source code of Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network☆40Mar 20, 2023Updated 3 years ago
- ☆18Apr 19, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- State-of-the-art count-based word embeddings for low-resource languages.☆12Nov 13, 2025Updated 4 months ago
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆18Jun 4, 2025Updated 9 months ago
- ☆16Aug 10, 2022Updated 3 years ago
- ☆19Sep 10, 2023Updated 2 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- ☆11Feb 24, 2022Updated 4 years ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)☆15Feb 24, 2026Updated last month