wangyu-ustc / LVChatLinks
The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**
☆14Updated last year
Alternatives and similar repositories for LVChat
Users that are interested in LVChat are comparing it to the libraries listed below
Sorting:
- The official implementation of the paper "Large Scale Knowledge Washing"☆11Updated last year
- Active Example Selection for In-Context Learning (EMNLP'22)☆49Updated last year
- The demo for "Convolutional Poisson Gamma Belief Network" published in ICML2019☆11Updated 3 years ago
- A symbolic benchmark for verifiable chain-of-thought financial reasoning. Includes executable templates, 58 topics across 12 domains, and…☆21Updated last month
- ☆67Updated 2 years ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆24Updated last year
- a multimodal retrieval dataset☆24Updated 2 years ago
- ☆20Updated last month
- ☆15Updated last year
- ☆10Updated last year
- ☆38Updated 7 months ago
- ☆21Updated 7 months ago
- ☆64Updated 3 years ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆98Updated last year
- ☆22Updated last year
- ☆67Updated 2 years ago
- FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models☆31Updated 2 weeks ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Updated 2 years ago
- [EACL'23] COVID-VTS: Fact Extraction and Verification on Short Video Platforms☆11Updated 2 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25Updated last year
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆69Updated 3 years ago
- my commonly-used tools☆63Updated 11 months ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆134Updated 2 years ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆83Updated last year
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Updated 2 years ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆154Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆49Updated last year
- ☆13Updated 5 months ago
- ☆53Updated last year