自己阅读的多模态对话系统论文(及部分笔记)汇总
☆22Jan 5, 2023Updated 3 years ago
Alternatives and similar repositories for Multimodel-Dialog
Users that are interested in Multimodel-Dialog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-step reasoning MLLM☆16Mar 8, 2026Updated 2 weeks ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆23Mar 11, 2026Updated 2 weeks ago
- Collection of evaluation code for natural language generation.☆12Jan 6, 2021Updated 5 years ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆55Mar 17, 2026Updated last week
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- 🔥 A recruiting software built with React/Redux (Client), Node/Express (API), and MongoDB (Database).☆13May 2, 2024Updated last year
- nips25-all-papers☆37Feb 26, 2026Updated last month
- A python script for downloading huggingface datasets and models.☆20Apr 10, 2025Updated 11 months ago
- List of papers about Large Multimodal model☆31May 31, 2025Updated 9 months ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆15Jun 6, 2024Updated last year
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆20Feb 26, 2025Updated last year
- Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Sc…☆12May 19, 2023Updated 2 years ago
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The code of ACL2022 paper "Conditional Bilingual Mutual Information based Adaptive Training for Neural Machine Translation"..☆14Aug 6, 2022Updated 3 years ago
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- One implementation of the paper "Coreference-Aware Dialogue Summarization".☆19Nov 9, 2023Updated 2 years ago
- PyTorch implementation of L2R2 in SIGIR 2020☆17Jun 12, 2023Updated 2 years ago
- ☆11May 24, 2024Updated last year
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Aug 8, 2024Updated last year
- This repo contains the dataset for the EMNLP 2022 paper "Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts…☆19Oct 9, 2023Updated 2 years ago
- ☆18May 24, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17May 15, 2023Updated 2 years ago
- Posterior Control of Blackbox Generation☆23May 2, 2020Updated 5 years ago
- 📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"☆22Sep 5, 2023Updated 2 years ago
- [EMNLP'25 main] This is the official repo for the paper, Can LLMs be Good Graph Judge for Knowledge Graph Construction?☆27Sep 23, 2025Updated 6 months ago
- Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"☆26Mar 9, 2024Updated 2 years ago
- ☆50Feb 23, 2021Updated 5 years ago
- 😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)☆13Apr 19, 2023Updated 2 years ago
- A fluent, scalable, and easy-to-use LLM data processing framework.☆28Jan 31, 2026Updated last month
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 力扣题单hot100的ACM模式实现☆31Sep 2, 2025Updated 6 months ago
- M-SENA: All-in-One Platform for Multimodal Sentiment Analysis☆100Mar 30, 2022Updated 3 years ago
- AI Emoji Argue Agent 🚀 基于LangChain的开源表情包斗图Agent☆28May 30, 2024Updated last year
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆25Dec 5, 2022Updated 3 years ago
- LingYi: Multi-modal Medical Conversational Question Answering System based on Knowledge Graph☆32Feb 9, 2023Updated 3 years ago
- ☆23Sep 12, 2023Updated 2 years ago
- [TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset☆307Dec 25, 2024Updated last year