[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
☆13Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for ReSee
Users that are interested in ReSee are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation☆14Oct 7, 2023Updated 2 years ago
- [TNNLS, to appear] FET-LM: Flow Enhanced Variational Auto-Encoder for Topic-Guided Language Modeling PyTorch Implementation☆14Mar 4, 2023Updated 3 years ago
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆26Apr 10, 2023Updated 2 years ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation☆37Oct 18, 2023Updated 2 years ago
- [Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics☆37Jan 22, 2025Updated last year
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆52Dec 23, 2022Updated 3 years ago
- [CVPR 2025 Highlight] Interpreting Object-level Foundation Models via Visual Precision Search☆56Nov 24, 2025Updated 3 months ago
- [Tool] AutoRec (2015) PyTorch Implementation☆10Mar 1, 2020Updated 6 years ago
- ☆23May 20, 2025Updated 10 months ago
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆17Jul 1, 2024Updated last year
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- About face technology☆20Feb 9, 2023Updated 3 years ago
- [ACM MM21] Official Code: Identity-Preserving Face Anonymization via Adaptively Facial Attributes Obfuscation☆18Jun 5, 2024Updated last year
- Official PyTorch implementation for "Where You Edit is What You Get: Text-Guided Image Editing with Region-Based Attention" (Pattern Reco…☆10Oct 1, 2024Updated last year
- The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…☆18Jan 24, 2025Updated last year
- [ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering☆13Nov 23, 2022Updated 3 years ago
- init project☆15Jul 20, 2025Updated 8 months ago
- ☆12Jul 16, 2025Updated 8 months ago
- Code of D2TNet: A ConvLSTM Network with Dual-direction Transfer for Pan-sharpening☆14Dec 7, 2023Updated 2 years ago
- [ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning☆16Feb 17, 2025Updated last year
- PyTorch unoffical implementation of "PoE-GAN : Multimodal Conditional Image Synthesis with Product-of-Experts GANs"☆14Mar 29, 2023Updated 2 years ago
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- 🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"☆13Feb 1, 2023Updated 3 years ago
- A curated publication list on visual dialog☆14May 8, 2023Updated 2 years ago
- Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model☆31Jun 24, 2025Updated 8 months ago
- Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object☆18Dec 1, 2024Updated last year
- This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations.☆23Aug 2, 2024Updated last year
- naïve blockchain in Rust☆10Nov 13, 2020Updated 5 years ago
- [CVPR 2022] HINT: Hierarchical Neuron Concept Explainer☆20Apr 19, 2023Updated 2 years ago
- ☆16Jul 20, 2023Updated 2 years ago
- [WACV 2025] Official Implementation of LIME: Localized Image Editing via Attention Regularization in Diffusion Models☆10Apr 7, 2025Updated 11 months ago
- Bi-level feature alignment for versatile image translation and manipulation [ECCV 2022]☆18Nov 26, 2022Updated 3 years ago
- Conformer-based Metric GAN for speech enhancement☆27May 3, 2024Updated last year
- Official implement of our work: Sim2Word: Explaining Similarity with Representative Attribute Words via Counterfactual Explanations, whic…☆16Aug 1, 2023Updated 2 years ago
- code for 'Representation Learning for Visual Object Tracking by Masked Appearance Transfer'☆19Jun 10, 2023Updated 2 years ago