(TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
☆22Aug 8, 2024Updated last year
Alternatives and similar repositories for ZeroNLG
Users that are interested in ZeroNLG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2022] Code for "Retrieve, Reason, and Refine: Generating Accurate and Faithful Discharge/Patient Instructions"☆37Jul 28, 2024Updated last year
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆17Feb 6, 2025Updated last year
- Code for PromptNet☆16Jan 29, 2025Updated last year
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆36Sep 18, 2025Updated 6 months ago
- GPT-4V(ision) as A Social Media Analysis Engine☆38Dec 20, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime☆14Dec 7, 2024Updated last year
- ☆35Jul 25, 2024Updated last year
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Aug 8, 2024Updated last year
- ☆13Jul 10, 2024Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- ☆15Mar 30, 2025Updated 11 months ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆19May 8, 2025Updated 10 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- ☆19Dec 17, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Repo for "Uncertain Multimodal Intention and Emotion Understanding in the Wild"☆16Oct 20, 2025Updated 5 months ago
- ☆13May 29, 2024Updated last year
- Official PyTorch Code for "Dynamic Temperature Knowledge Distillation"☆11Mar 28, 2025Updated 11 months ago
- Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.☆16May 30, 2023Updated 2 years ago
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated last month
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- Code for MInD: Multimodal Information Disentanglement☆18Dec 17, 2025Updated 3 months ago
- The specific details of the AAAI2025 paper "Enriching Multimodal Sentiment Analysis Through Textual Emotional Descriptions of Visual-Audi…☆30Feb 27, 2025Updated last year
- ☆22Sep 4, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codebase for TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis. The code has been reo…☆30May 27, 2025Updated 10 months ago
- 本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。☆47Mar 26, 2024Updated 2 years ago
- Source codes of the our paper titled "Multi-level Textual-Visual Alignment and Fusion Network for Multimodal Aspect-based Sentiment Analy…☆17Apr 23, 2024Updated last year
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models☆19Sep 3, 2024Updated last year
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- ☆11Nov 27, 2022Updated 3 years ago
- ☆20Aug 22, 2024Updated last year
- Multimodal Sentiment Analysis with Image-Text Interaction Network☆17Aug 31, 2023Updated 2 years ago
- [ICCV 2023] A Unified Continual Learning Framework with General Parameter-Efficient Tuning☆92Oct 9, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18May 30, 2024Updated last year
- ☆14Mar 1, 2024Updated 2 years ago
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- Code for GLoMo: Global-Local Modality Fusion for Multimodal Sentiment Analysis, which is accepted by ACM MM 24.☆35Dec 30, 2024Updated last year
- Django sample app for DigitalOcean App Platform☆12Nov 24, 2025Updated 4 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆35Jan 2, 2026Updated 2 months ago