(TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
☆22Aug 8, 2024Updated last year
Alternatives and similar repositories for ZeroNLG
Users that are interested in ZeroNLG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2022] Code for "Retrieve, Reason, and Refine: Generating Accurate and Faithful Discharge/Patient Instructions"☆37Jul 28, 2024Updated last year
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆18Feb 6, 2025Updated last year
- Code for PromptNet☆16Jan 29, 2025Updated last year
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆37Sep 18, 2025Updated 7 months ago
- GPT-4V(ision) as A Social Media Analysis Engine☆39Dec 20, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime☆15Dec 7, 2024Updated last year
- ☆35Jul 25, 2024Updated last year
- Hidden Markov Models in JavaScript.☆12May 17, 2015Updated 10 years ago
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Aug 8, 2024Updated last year
- ☆13Jul 10, 2024Updated last year
- ☆10Mar 18, 2025Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆15Jul 4, 2025Updated 10 months ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆21May 8, 2025Updated 11 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Grouping and Recognize speaker from an animation video. 从动漫中提取每一个说话人。☆13May 8, 2024Updated last year
- ☆21Dec 17, 2024Updated last year
- ☆19Jan 5, 2023Updated 3 years ago
- ☆15May 29, 2024Updated last year
- Official PyTorch Code for "Dynamic Temperature Knowledge Distillation"☆11Mar 28, 2025Updated last year
- Mini-Luotuo: A Diverse Herd of Distilled Chinese Models from Large-Scale Instructions☆18Jun 4, 2023Updated 2 years ago
- Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.☆16May 30, 2023Updated 2 years ago
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated 3 months ago
- Video inpainting with automatic object detection☆24Nov 8, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch Implementation for InMaP☆12Oct 28, 2023Updated 2 years ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆30Dec 1, 2022Updated 3 years ago
- ☆22Sep 4, 2023Updated 2 years ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Oct 13, 2022Updated 3 years ago
- Code for MInD: Multimodal Information Disentanglement☆20Dec 17, 2025Updated 4 months ago
- 本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。☆49Mar 26, 2024Updated 2 years ago
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models☆19Mar 23, 2026Updated last month
- ☆13Dec 11, 2025Updated 4 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆27Mar 3, 2025Updated last year
- Codebase for TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis. The code has been reo…☆33May 27, 2025Updated 11 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated last year
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- ☆14Mar 1, 2024Updated 2 years ago
- ☆24Jun 26, 2024Updated last year
- Code for GLoMo: Global-Local Modality Fusion for Multimodal Sentiment Analysis, which is accepted by ACM MM 24.☆37Dec 30, 2024Updated last year