RUC-AIMind / TikTalkView external linksLinks
☆71Jun 1, 2025Updated 8 months ago
Alternatives and similar repositories for TikTalk
Users that are interested in TikTalk are comparing it to the libraries listed below
Sorting:
- ☆21Aug 26, 2025Updated 5 months ago
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆21Nov 27, 2024Updated last year
- Danmuku dataset☆11Jul 7, 2023Updated 2 years ago
- TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …☆17Feb 12, 2025Updated last year
- Paper, dataset and code list for multimodal dialogue.☆22Jan 2, 2025Updated last year
- Open domain Chinese dialogue corpus and datasets.☆16Jan 8, 2022Updated 4 years ago
- [MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling☆22Aug 4, 2024Updated last year
- Matlab/Octave toolbox for deep learning. Includes Deep Belief Nets, Stacked Autoencoders, Convolutional Neural Nets, Convolutional Autoen…☆10Jul 10, 2013Updated 12 years ago
- [ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning☆16Feb 17, 2025Updated 11 months ago
- ☆12Jan 10, 2025Updated last year
- Code for EMNLP2019 paper "Low-Resource Response Generation with Template Prior"☆13Jan 17, 2020Updated 6 years ago
- The official site of paper MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation☆203Sep 3, 2023Updated 2 years ago
- M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. ACL 2022☆121Sep 24, 2022Updated 3 years ago
- Mxnet2Caffe_Tensor RT☆18Apr 20, 2019Updated 6 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Nov 29, 2022Updated 3 years ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"☆25Sep 12, 2025Updated 5 months ago
- flyai 医疗QA NLG☆21Oct 31, 2019Updated 6 years ago
- Code, Models and Datasets for OpenViDial Dataset☆132Jan 22, 2022Updated 4 years ago
- ACM MM '22: Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning☆18Dec 20, 2022Updated 3 years ago
- This is the official implementation of 2025 CVPR paper "EmoEdit: Evoking Emotions through Image Manipulation".☆35Dec 1, 2025Updated 2 months ago
- MFIN7036 NLP Course Project☆10Jul 25, 2024Updated last year
- State Transition Dialogue Manager☆26Apr 30, 2023Updated 2 years ago
- ☆30Sep 13, 2021Updated 4 years ago
- An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation☆23Mar 6, 2022Updated 3 years ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog☆25Mar 8, 2022Updated 3 years ago
- ☆14Mar 12, 2023Updated 2 years ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆285Aug 20, 2023Updated 2 years ago
- [ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆412May 5, 2025Updated 9 months ago
- Integrating Graph Knowledge into End-to-End Task-Oriented Dialogue Systems☆29Apr 15, 2021Updated 4 years ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆58May 26, 2025Updated 8 months ago
- A Chinese lyric corpus which contains nearly 50,000 lyrics from 500 artists☆39Jan 22, 2018Updated 8 years ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- ☆37Jul 9, 2024Updated last year
- This repository contains inference script for Face Swapping as A Simple Arithmetic Operation☆35Feb 23, 2023Updated 2 years ago
- [ACL 2022] The source code of Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network☆40Mar 20, 2023Updated 2 years ago