[2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation
☆47Sep 19, 2023Updated 2 years ago
Alternatives and similar repositories for TextBind
Users that are interested in TextBind are comparing it to the libraries listed below
Sorting:
- ☆14Nov 14, 2023Updated 2 years ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated 10 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆40Oct 17, 2023Updated 2 years ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆360Dec 18, 2023Updated 2 years ago
- ☆22Nov 6, 2022Updated 3 years ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Nested Named Entity Recognition for Chinese Biomedical Text☆11Jan 25, 2024Updated 2 years ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆54Mar 9, 2025Updated 11 months ago
- ☆11Sep 7, 2020Updated 5 years ago
- ☆36Jan 13, 2026Updated last month
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 3 months ago
- ☆15Mar 12, 2024Updated last year
- ☆15Oct 20, 2023Updated 2 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- Neural Machine Translation in Pytorch☆31Jun 11, 2018Updated 7 years ago
- Codes for paper "Stylized Story Generation with Style-Guided Planning"☆12May 9, 2021Updated 4 years ago
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆58Jun 27, 2023Updated 2 years ago
- ☆87Apr 15, 2022Updated 3 years ago
- Findings of ACL 2021☆24May 8, 2021Updated 4 years ago
- an end-to-end instance-segmentation framework inspired by YOLO and mask R-CNN☆13Nov 22, 2019Updated 6 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Jun 8, 2022Updated 3 years ago
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 3 months ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- Implementation of "Modeling Past and Future for Neural Machine Translation"☆15Mar 16, 2018Updated 7 years ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆106Mar 14, 2024Updated last year
- ☆15Dec 10, 2021Updated 4 years ago
- ☆17May 28, 2024Updated last year
- ☆76Oct 22, 2022Updated 3 years ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated last year
- Seq2BF:based on paper《Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation》,C…☆17Nov 18, 2018Updated 7 years ago
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17May 13, 2014Updated 11 years ago
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Nov 10, 2023Updated 2 years ago
- [CVPR 2023] Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning☆22Jun 11, 2023Updated 2 years ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Jun 12, 2023Updated 2 years ago
- ACL 2022(findings): A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddings☆18Mar 23, 2022Updated 3 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆22Jan 30, 2023Updated 3 years ago