[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
☆13Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for ReSee
Users that are interested in ReSee are comparing it to the libraries listed below
Sorting:
- [NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation☆14Oct 7, 2023Updated 2 years ago
- [TNNLS, to appear] FET-LM: Flow Enhanced Variational Auto-Encoder for Topic-Guided Language Modeling PyTorch Implementation☆14Mar 4, 2023Updated 2 years ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆26Apr 10, 2023Updated 2 years ago
- [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation☆37Oct 18, 2023Updated 2 years ago
- [Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics☆37Jan 22, 2025Updated last year
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- [Tool] AutoRec (2015) PyTorch Implementation☆10Mar 1, 2020Updated 6 years ago
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆52Dec 23, 2022Updated 3 years ago
- [CVPR 2025 Highlight] Interpreting Object-level Foundation Models via Visual Precision Search☆54Nov 24, 2025Updated 3 months ago
- [ACM MM21] Official Code: Identity-Preserving Face Anonymization via Adaptively Facial Attributes Obfuscation☆18Jun 5, 2024Updated last year
- [CVPR 2022] HINT: Hierarchical Neuron Concept Explainer☆20Apr 19, 2023Updated 2 years ago
- About face technology☆20Feb 9, 2023Updated 3 years ago
- Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704☆30Apr 21, 2023Updated 2 years ago
- Official implement of our work: Sim2Word: Explaining Similarity with Representative Attribute Words via Counterfactual Explanations, whic…☆16Aug 1, 2023Updated 2 years ago
- A music composer and player with MATLAB☆11Mar 14, 2020Updated 5 years ago
- LLM-based autonomous world☆42Sep 1, 2023Updated 2 years ago
- [ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection☆88Oct 27, 2025Updated 4 months ago
- [TPAMI 2025] Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection☆53Jul 1, 2025Updated 8 months ago
- ☆11Feb 28, 2024Updated 2 years ago
- Teaching Categories to Human Learners with Visual Explanations - CVPR 2018☆11Jun 21, 2022Updated 3 years ago
- A pytorch image classifier for the recognising letters from the notMNIST dataset☆11Jan 4, 2019Updated 7 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- [ISSTA 2025] Unlocking Low Frequency Syscalls in Kernel Fuzzing with Dependency-Based RAG☆52Jan 29, 2026Updated last month
- 2021-2022国科大强化学习格斗游戏大作业☆37Jun 11, 2022Updated 3 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 5 months ago
- naïve blockchain in Rust☆10Nov 13, 2020Updated 5 years ago
- ☆13Oct 7, 2024Updated last year
- The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…☆17Jan 24, 2025Updated last year
- Post-selection inference based on truncated Gaussians for the HSIC-Lasso feature selection procedure☆10Jun 17, 2021Updated 4 years ago
- A curated publication list on visual dialog☆14May 8, 2023Updated 2 years ago
- ☆18Dec 3, 2021Updated 4 years ago
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- ☆12Aug 25, 2022Updated 3 years ago
- ☆10Feb 19, 2019Updated 7 years ago
- Entity Summarization with User Feedback (ESWC 2020)☆10Dec 8, 2020Updated 5 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- Combines the SSL Method MixMatch with a pre-trained model (EfficientNet) on a chest x-ray dataset.☆11Jun 22, 2019Updated 6 years ago