lucaro / VBS-Archive
Archive of Tasks and Results of the Video Browser Showdown
☆11Updated 2 weeks ago
Alternatives and similar repositories for VBS-Archive:
Users that are interested in VBS-Archive are comparing it to the libraries listed below
- Distributed Retrieval Evaluation Server☆14Updated 4 months ago
- VLSP2021 vieCap4H Challenge: Automatic image caption generation for healthcare domains in Vietnamese☆11Updated last year
- ☆12Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 10 months ago
- AICITY2024 Track 2 - Code from AIO_ISC Team☆31Updated 8 months ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 7 months ago
- ☆13Updated 2 years ago
- General template for most Pytorch projects☆34Updated 6 months ago
- ☆12Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆15Updated last month
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 8 months ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated 10 months ago
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆32Updated last year
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…☆22Updated 2 years ago
- Implementation for the CVPR 2023 paper "Improving Selective Visual Question Answering by Learning from Your Peers" (https://arxiv.org/abs…☆24Updated last year
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆43Updated 11 months ago
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆18Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- An AI-powered interactive video retrieval system☆28Updated 6 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"☆29Updated last month
- ☆88Updated last year
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆38Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Official repository for the General Robust Image Task (GRIT) Benchmark☆53Updated last year
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆174Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆53Updated 3 months ago