lucaro / VBS-Archive
Archive of Tasks and Results of the Video Browser Showdown
☆11Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for VBS-Archive
- Distributed Retrieval Evaluation Server☆14Updated this week
- ☆12Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated 6 months ago
- VLSP2021 vieCap4H Challenge: Automatic image caption generation for healthcare domains in Vietnamese☆11Updated last year
- General template for most Pytorch projects☆34Updated 2 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆24Updated 6 months ago
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 3 months ago
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…☆20Updated last year
- AICITY2024 Track 2 - Code from AIO_ISC Team☆28Updated 4 months ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆20Updated 3 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- ☆35Updated last year
- ☆12Updated 2 years ago
- Create TensorRT-runtime for vietocr☆10Updated 3 years ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆25Updated 6 months ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆19Updated 4 months ago
- Small application for Vietnamese scenetext detection and recognition☆14Updated last year
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆102Updated 7 months ago
- ☆54Updated 10 months ago
- ☆42Updated 2 months ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆124Updated 3 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆50Updated 5 months ago
- ☆46Updated 3 months ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆68Updated 4 months ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆13Updated 9 months ago
- A project for the Zalo AI Challenge 2019, Vietnamese Wikipedia Question Answering task.☆16Updated 4 years ago
- Vietnamese handwritten text recognition system☆17Updated 3 years ago