Experiments with multimodal deep learning models based on transformers
☆11Oct 9, 2022Updated 3 years ago
Alternatives and similar repositories for multimodal-transformers-movies
Users that are interested in multimodal-transformers-movies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Labeled Movie Trailer Dataset☆16Mar 23, 2018Updated 8 years ago
- Code & data for IJCAI'22 paper "Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks".☆14Jul 24, 2022Updated 3 years ago
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- ☆22Mar 3, 2022Updated 4 years ago
- psenet,prune model, text detection☆16Jun 24, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Movie Screenplay Parser☆13Apr 29, 2024Updated 2 years ago
- [ACL 2024] "Understanding and Patching Compositional Reasoning in LLMs"☆14Aug 28, 2024Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- ☆41Jun 29, 2022Updated 3 years ago
- Computation of binomial confidence intervals that achieve exact coverage.☆15Apr 23, 2025Updated last year
- 2021 腾讯广告赛算法大赛 赛道二 决赛第六名☆42Oct 7, 2022Updated 3 years ago
- 利用目标检测实现的漫画对话框识别,comic,textboxs☆12Apr 14, 2019Updated 7 years ago
- ☆10Feb 16, 2025Updated last year
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆23Jan 24, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Nov 21, 2023Updated 2 years ago
- Simple MoE - Day 17 of 365 Days of Repos☆19Apr 21, 2026Updated 2 weeks ago
- Library for soft prompt tuning☆22Jun 12, 2023Updated 2 years ago
- [CVPR2019] Synthesizing Environment-Aware Activities via Activity Sketches☆13Oct 3, 2023Updated 2 years ago
- ☆12Jan 27, 2025Updated last year
- Automatically generated television (draft)☆18Feb 27, 2023Updated 3 years ago
- ☆15Sep 24, 2022Updated 3 years ago
- ECG reconstruction☆14Nov 29, 2023Updated 2 years ago
- ☆22Feb 25, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Type is a directed typing experiment. You choose the direction the letters should flow.☆11Nov 7, 2021Updated 4 years ago
- Q-HEART: ECG Question Answering via Knowledge-Informed Multimodal LLMs (ECAI 2025)☆16Apr 17, 2026Updated 3 weeks ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆15Feb 27, 2019Updated 7 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- A configurable twitter bot that should be used responsibly☆10Jan 29, 2016Updated 10 years ago
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆12Nov 7, 2023Updated 2 years ago
- A polygon detector based on obb-yolov3 (WIP)☆17Jul 21, 2021Updated 4 years ago
- Capstone project for Galvanize - Data Science Immersive. 'Project Plotline' looks at the emotional content of movie scripts (web scraping…☆16Sep 27, 2016Updated 9 years ago
- Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling☆32Oct 29, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated last year
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆52Aug 9, 2020Updated 5 years ago
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- [ACL 2022] The source code of Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network☆40Mar 20, 2023Updated 3 years ago
- State-of-the-art count-based word embeddings for low-resource languages.☆12Nov 13, 2025Updated 5 months ago
- Pytorch implementation(LeNet, VGGNet, GAN, UNet, Mask R-CNN, ...)☆31May 19, 2021Updated 4 years ago
- Collaborative shopping basket built with Liveblocks in React/Next.js☆15Nov 27, 2023Updated 2 years ago