lucaro / VBS-ArchiveLinks
Archive of Tasks and Results of the Video Browser Showdown
☆13Updated 6 months ago
Alternatives and similar repositories for VBS-Archive
Users that are interested in VBS-Archive are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated last year
- ☆14Updated 3 years ago
- AICITY2024 Track 2 - Code from AIO_ISC Team☆37Updated last year
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated 2 years ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated last year
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆31Updated 2 years ago
- The task aims at extracting required fields in receipts captured by mobile devices☆33Updated 2 years ago
- ☆46Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆20Updated 5 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆51Updated 7 months ago
- ☆72Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated last year
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆152Updated last year
- MLOps Platform for MLOps Crash Course☆42Updated 2 years ago
- ☆67Updated last year
- ☆30Updated last year
- Official codes of the 1st place for The NVIDIA AI City Challenge 2023 - Track 2☆19Updated 2 years ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆70Updated last year
- A collection of OCR-related datasets☆188Updated 3 years ago
- A curated list of papers about key information extraction.☆100Updated 9 months ago
- Vietnamese handwritten text recognition system☆17Updated 4 years ago
- Distributed Retrieval Evaluation Server☆15Updated 10 months ago
- Built and deployed scalable LLM retrieval APIs on a hybrid GCP architecture with full CI/CD, IaC, and monitoring☆70Updated last month
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆144Updated 6 months ago
- [Thesis'24] Efficient Class Incremental Learning for Object Detection☆25Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆27Updated 2 years ago
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…☆23Updated 2 years ago
- ☆68Updated last year