lucaro / VBS-Archive
Archive of Tasks and Results of the Video Browser Showdown
☆11Updated 3 weeks ago
Alternatives and similar repositories for VBS-Archive:
Users that are interested in VBS-Archive are comparing it to the libraries listed below
- Distributed Retrieval Evaluation Server☆14Updated 3 months ago
- ☆12Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 9 months ago
- VLSP2021 vieCap4H Challenge: Automatic image caption generation for healthcare domains in Vietnamese☆11Updated last year
- General template for most Pytorch projects☆34Updated 5 months ago
- ☆12Updated 2 years ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated 9 months ago
- ☆12Updated last year
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆32Updated last year
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v …☆22Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆15Updated this week
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 6 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆21Updated 7 months ago
- AICITY2024 Track 2 - Code from AIO_ISC Team☆29Updated 7 months ago
- 2nd BKAI CHALLENGE☆8Updated 2 years ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆14Updated 2 months ago
- ☆46Updated last year
- [Thesis'24] Efficient Class Incremental Learning for Object Detection☆15Updated 7 months ago
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆52Updated 2 months ago
- Create TensorRT-runtime for vietocr☆11Updated 3 years ago
- ☆16Updated 2 years ago
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- Dịch máy giữa ngôn ngữ anh-viet☆51Updated 4 years ago
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated 3 weeks ago
- A project for the Zalo AI Challenge 2019, Vietnamese Wikipedia Question Answering task.☆16Updated 5 years ago
- ☆60Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago