dhk1349 / MERLIN_text_to_video_searchLinks
[EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
☆14Updated 8 months ago
Alternatives and similar repositories for MERLIN_text_to_video_search
Users that are interested in MERLIN_text_to_video_search are comparing it to the libraries listed below
Sorting:
- ☆31Updated 2 years ago
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Updated 5 months ago
- Bilinear Attention Networks for Korean Visual Question Answering☆24Updated last year
- Google's Conceptual Captions Dataset translated into Korean☆23Updated 3 years ago
- ☆19Updated 3 years ago
- ☆11Updated 2 months ago
- ☆24Updated 2 years ago
- [ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…☆53Updated last year
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆18Updated 11 months ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Updated last year
- Korean Abstract Meaning Representation (AMR) Corpus☆10Updated 3 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Updated last year
- 📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"☆22Updated 2 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10Updated last year
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆168Updated last year
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Updated last year
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Updated 3 years ago
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆43Updated last year
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆35Updated 3 months ago
- Code for "RADCoT: Retrieval-Augmented Distillation to Specialization Models for Generating Chain-of-Thoughts in Query Expansion", LREC-CO…☆10Updated last year
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆17Updated 7 months ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆61Updated 3 years ago
- ☆24Updated last year
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆24Updated 2 years ago
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆17Updated 7 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated 2 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23Updated 4 years ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆24Updated 6 months ago
- ☆40Updated 2 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆17Updated last year