lucaro / VBS-ArchiveLinks
Archive of Tasks and Results of the Video Browser Showdown
☆12Updated 4 months ago
Alternatives and similar repositories for VBS-Archive
Users that are interested in VBS-Archive are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- AICITY2024 Track 2 - Code from AIO_ISC Team☆35Updated last year
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 11 months ago
- Distributed Retrieval Evaluation Server☆15Updated 7 months ago
- ☆14Updated 3 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆27Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated last year
- Collection of recent advanced RAG techniques.☆15Updated last week
- ☆41Updated last month
- Official codes of the 1st place for The NVIDIA AI City Challenge 2023 - Track 2☆19Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated 3 months ago
- ☆66Updated last year
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆31Updated 2 years ago
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆150Updated 11 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆48Updated 5 months ago
- 2nd BKAI CHALLENGE☆8Updated 3 years ago
- ☆60Updated 6 months ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆180Updated last year
- General template for most Pytorch projects☆35Updated 3 months ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆142Updated 4 months ago
- ☆46Updated 2 years ago
- Foundation Models for Video Understanding: A Survey☆126Updated last week
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated last year
- ☆11Updated last year
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- Open-source release of the SOMHunter video retrieval tool☆21Updated 2 years ago
- Vietnamese handwritten text recognition system☆17Updated 4 years ago