ImKeTT/ReSee

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ImKeTT/ReSee)

ImKeTT / ReSee

[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation

☆12

Alternatives and similar repositories for ReSee

Users that are interested in ReSee are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ImKeTT / ZeroGen
View on GitHub
[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation
☆14Oct 7, 2023Updated 2 years ago
ImKeTT / FET-LM
View on GitHub
[TNNLS, to appear] FET-LM: Flow Enhanced Variational Auto-Encoder for Topic-Guided Language Modeling PyTorch Implementation
☆14Mar 4, 2023Updated 3 years ago
ImKeTT / PCAE
View on GitHub
[KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation
☆26Apr 10, 2023Updated 3 years ago
UCSC-VLAA / Sight-Beyond-Text
View on GitHub
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
☆20Sep 15, 2023Updated 2 years ago
ImKeTT / AdaVAE
View on GitHub
[Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation
☆38Oct 18, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yuco-Z / Awesome-Multi-Modal-Dialog
View on GitHub
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
☆36Jan 22, 2025Updated last year
ImKeTT / CTG-latentAEs
View on GitHub
[Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.
☆51Dec 23, 2022Updated 3 years ago
UCSC-VLAA / VLAA-GUI
View on GitHub
Official implementation of VLAA-GUI series
☆34Jun 20, 2026Updated last month
ImKeTT / AutoRec-Pytorch
View on GitHub
[Tool] AutoRec (2015) PyTorch Implementation
☆10Mar 1, 2020Updated 6 years ago
RuoyuChen10 / VPS
View on GitHub
[CVPR 2025 Highlight] Interpreting Object-level Foundation Models via Visual Precision Search
☆58Nov 24, 2025Updated 7 months ago
DTennant / dual-rank-ncd
View on GitHub
Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)
☆12Aug 20, 2023Updated 2 years ago
passing2961 / DialogCC
View on GitHub
Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…
☆13Jun 24, 2024Updated 2 years ago
RuoyuChen10 / Facial_Attributes_Obfuscation
View on GitHub
[ACM MM21] Official Code: Identity-Preserving Face Anonymization via Adaptively Facial Attributes Obfuscation
☆18Jun 5, 2024Updated 2 years ago
MaPM-git / MapleDpm
View on GitHub
init project
☆15Jul 20, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
RuoyuChen10 / FaceTechnologyTool
View on GitHub
About face technology
☆20Feb 9, 2023Updated 3 years ago
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
Big-Brother-Pikachu / Where2edit
View on GitHub
Official PyTorch implementation for "Where You Edit is What You Get: Text-Guided Image Editing with Region-Based Attention" (Pattern Reco…
☆10Oct 1, 2024Updated last year
guyyariv / LaMI
View on GitHub
[ACL 2026 Oral] Official implementation of LaMI: Augmenting Large Language Models via Late Multi-Image Fusion
☆19Jul 4, 2026Updated 2 weeks ago
HITsz-TMG / Cognitive-Visual-Language-Mapper
View on GitHub
The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…
☆17Jan 24, 2025Updated last year
Meiqi-Gong / D2TNet
View on GitHub
Code of D2TNet: A ConvLSTM Network with Dual-direction Transfer for Pan-sharpening
☆13Dec 7, 2023Updated 2 years ago
ItemZheng / KDDAug
View on GitHub
[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering
☆13Nov 23, 2022Updated 3 years ago
hyoseok1223 / Product-of-Experts-GAN
View on GitHub
PyTorch unoffical implementation of "PoE-GAN : Multimodal Conditional Image Synthesis with Product-of-Experts GANs"
☆15Mar 29, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
passing2961 / Stark
View on GitHub
Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…
☆19Dec 27, 2024Updated last year
Aman-4-Real / See-or-Guess
View on GitHub
[ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning
☆16Feb 17, 2025Updated last year
ChengshuaiZhao0 / The-Wolf-Within
View on GitHub
☆13Updated this week
LibrAIResearch / libra-eval
View on GitHub
☆23May 20, 2025Updated last year
yisuanwang / Finestyler
View on GitHub
Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
☆19Dec 1, 2024Updated last year
gicheonkang / sglkt-visdial
View on GitHub
🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"
☆13Feb 1, 2023Updated 3 years ago
MIPS-COLT / MER-MCE
View on GitHub
This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations.
☆25Aug 2, 2024Updated last year
MengyuanChen21 / Awesome-Visual-Dialog
View on GitHub
A curated publication list on visual dialog
☆14May 8, 2023Updated 3 years ago
enisimsar / LIME
View on GitHub
[WACV 2025] Official Implementation of LIME: Localized Image Editing via Attention Regularization in Diffusion Models
☆10Apr 7, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ixxchan / nb
View on GitHub
naïve blockchain in Rust
☆10Nov 13, 2020Updated 5 years ago
RuoyuChen10 / Sim2Word
View on GitHub
Official implement of our work: Sim2Word: Explaining Similarity with Representative Attribute Words via Counterfactual Explanations, whic…
☆16Aug 1, 2023Updated 2 years ago
fnzhan / RABIT
View on GitHub
Bi-level feature alignment for versatile image translation and manipulation [ECCV 2022]
☆18Nov 26, 2022Updated 3 years ago
AntonotnaWang / HINT
View on GitHub
[CVPR 2022] HINT: Hierarchical Neuron Concept Explainer
☆20Apr 19, 2023Updated 3 years ago
CRIPAC-DIG / tgm-dlm
View on GitHub
Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model
☆33Jun 24, 2025Updated last year
SherifAbdulatif / CMGAN
View on GitHub
Conformer-based Metric GAN for speech enhancement
☆27May 3, 2024Updated 2 years ago
difhnp / MAT
View on GitHub
code for 'Representation Learning for Visual Object Tracking by Masked Appearance Transfer'
☆19Jun 10, 2023Updated 3 years ago