ngthanhtin / VLSP_ImageCaptioning
VLSP2021 vieCap4H Challenge: Automatic image caption generation for healthcare domains in Vietnamese
☆11Updated last year
Alternatives and similar repositories for VLSP_ImageCaptioning:
Users that are interested in VLSP_ImageCaptioning are comparing it to the libraries listed below
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆32Updated last year
- 2nd BKAI CHALLENGE☆8Updated 2 years ago
- ☆12Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 8 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 10 months ago
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 7 months ago
- General template for most Pytorch projects☆34Updated 6 months ago
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- ☆13Updated 2 years ago
- [Thesis'24] Efficient Class Incremental Learning for Object Detection☆15Updated 9 months ago
- AICITY2024 Track 2 - Code from AIO_ISC Team☆31Updated 8 months ago
- ☆24Updated last year
- Archive of Tasks and Results of the Video Browser Showdown☆11Updated 2 weeks ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆15Updated last month
- A project for the Zalo AI Challenge 2019, Vietnamese Wikipedia Question Answering task.☆16Updated 5 years ago
- Vietnamese Large Language Model (LLM) fine-tuned for the task of Question Answering within the medical and healthcare domain☆26Updated last year
- ☆23Updated last year
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆144Updated 8 months ago
- Top 2 Solution for Zalo AI Challenge 2022 - Liveness Detection track☆44Updated 2 years ago
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- get familiar with pytorch☆8Updated 4 years ago
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆13Updated 3 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated 2 months ago
- Create TensorRT-runtime for vietocr☆12Updated 3 years ago
- ☆35Updated 3 years ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- 1st place code of Player Contact Detection Kaggle competition☆50Updated 2 years ago
- ☆12Updated last year
- Solution for MC_OCR competition☆94Updated 2 years ago