lucaro / VBS-ArchiveLinks
Archive of Tasks and Results of the Video Browser Showdown
☆13Updated 9 months ago
Alternatives and similar repositories for VBS-Archive
Users that are interested in VBS-Archive are comparing it to the libraries listed below
Sorting:
- AICITY2024 Track 2 - Code from AIO_ISC Team☆37Updated last year
- ☆12Updated 2 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- ☆14Updated 3 years ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated last year
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆31Updated 2 years ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆152Updated last year
- ☆67Updated last year
- Distributed Retrieval Evaluation Server☆16Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Updated 2 years ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆13Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆21Updated 8 months ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆146Updated last month
- The task aims at extracting required fields in receipts captured by mobile devices☆33Updated 3 years ago
- Open-source release of the SOMHunter video retrieval tool☆24Updated 2 years ago
- [ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection☆12Updated last year
- ☆32Updated 2 years ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆188Updated 2 years ago
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…☆23Updated 3 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated 2 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Updated 3 years ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆28Updated 2 years ago
- ☆11Updated 2 years ago
- ☆75Updated last year
- ☆38Updated 2 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Updated 2 years ago
- 2024 Pedestrian Attribute Recognition and Attributed-based Person Retrieval Challenge at WACV☆23Updated last year
- This repository is created to share current progress of transformer based optical character recognition(OCR). Welcome to share~☆55Updated 2 years ago