(TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
☆22Aug 8, 2024Updated last year
Alternatives and similar repositories for ZeroNLG
Users that are interested in ZeroNLG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆19Feb 6, 2025Updated last year
- Code for PromptNet☆16Jan 29, 2025Updated last year
- GPT-4V(ision) as A Social Media Analysis Engine☆39Dec 20, 2024Updated last year
- A Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime☆15Dec 7, 2024Updated last year
- ☆36Jul 25, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Aug 8, 2024Updated last year
- ☆13Jul 10, 2024Updated last year
- ☆10Mar 18, 2025Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆15Jul 4, 2025Updated 11 months ago
- ☆15Mar 30, 2025Updated last year
- ☆22Dec 17, 2024Updated last year
- ☆16May 29, 2024Updated 2 years ago
- Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.☆16May 30, 2023Updated 3 years ago
- Developer project for getting basic API integrations working in under 5 minutes☆11May 22, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Oct 23, 2024Updated last year
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆30Dec 1, 2022Updated 3 years ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Oct 13, 2022Updated 3 years ago
- Code for MInD: Multimodal Information Disentanglement☆20Jun 3, 2026Updated last week
- 本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。☆50Mar 26, 2024Updated 2 years ago
- Source codes of the our paper titled "Multi-level Textual-Visual Alignment and Fusion Network for Multimodal Aspect-based Sentiment Analy…☆17Apr 23, 2024Updated 2 years ago
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models☆19Mar 23, 2026Updated 2 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆25Jul 30, 2025Updated 10 months ago
- ☆14Dec 11, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Nov 27, 2022Updated 3 years ago
- ☆28Mar 3, 2025Updated last year
- ☆20Aug 22, 2024Updated last year
- Multimodal Sentiment Analysis with Image-Text Interaction Network☆17Aug 31, 2023Updated 2 years ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated last year
- ☆18May 30, 2024Updated 2 years ago
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- ☆14Mar 1, 2024Updated 2 years ago
- ☆25Jun 26, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for GLoMo: Global-Local Modality Fusion for Multimodal Sentiment Analysis, which is accepted by ACM MM 24.☆38Dec 30, 2024Updated last year
- Video Diffusion Transformers are In-Context Learners☆37Jan 6, 2025Updated last year
- Building or integrating an LLM wrapper shouldn't take more than 10 minutes.☆13Feb 1, 2025Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Reward Guided Latent Consistency Distillation☆26Oct 9, 2024Updated last year
- Finding similarities between documents, and document search engine query language implementation☆11Dec 24, 2019Updated 6 years ago