lucaro / VBS-Archive
Archive of Tasks and Results of the Video Browser Showdown
☆11Updated last month
Alternatives and similar repositories for VBS-Archive:
Users that are interested in VBS-Archive are comparing it to the libraries listed below
- Distributed Retrieval Evaluation Server☆14Updated 5 months ago
- AICITY2024 Track 2 - Code from AIO_ISC Team☆32Updated 9 months ago
- ☆12Updated last year
- Collection of recent advanced RAG techniques.☆12Updated last week
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 11 months ago
- VLSP2021 vieCap4H Challenge: Automatic image caption generation for healthcare domains in Vietnamese☆11Updated last year
- ☆13Updated 2 years ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 9 months ago
- General template for most Pytorch projects☆34Updated 2 weeks ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated 2 weeks ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated 11 months ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- An AI-powered interactive video retrieval system☆31Updated 7 months ago
- ☆88Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 8 months ago
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…☆23Updated 2 years ago
- Open-source release of the SOMHunter video retrieval tool☆21Updated 2 years ago
- ☆23Updated last year
- Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"☆30Updated 2 months ago
- 2nd BKAI CHALLENGE☆8Updated 2 years ago
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated 3 months ago
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 2 months ago
- ☆12Updated last year
- Easy-to-read implementation of self-supervised learning using vision transformer and knowledge distillation with no labels - DINO☆27Updated last year
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆31Updated last year
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆35Updated 8 months ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆18Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS☆37Updated 3 months ago