ngthanhtin / VLSP_ImageCaptioning
VLSP2021 vieCap4H Challenge: Automatic image caption generation for healthcare domains in Vietnamese
☆11Updated last year
Alternatives and similar repositories for VLSP_ImageCaptioning:
Users that are interested in VLSP_ImageCaptioning are comparing it to the libraries listed below
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 9 months ago
- 2nd BKAI CHALLENGE☆8Updated 2 years ago
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆31Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 11 months ago
- General template for most Pytorch projects☆34Updated 2 weeks ago
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 8 months ago
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- ☆13Updated 2 years ago
- Top 2 Solution for Zalo AI Challenge 2022 - Liveness Detection track☆43Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated 2 weeks ago
- ☆12Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆69Updated last year
- Archive of Tasks and Results of the Video Browser Showdown☆11Updated last month
- Vietnamese Large Language Model (LLM) fine-tuned for the task of Question Answering within the medical and healthcare domain☆26Updated last year
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆30Updated 2 years ago
- ☆23Updated last year
- AICITY2024 Track 2 - Code from AIO_ISC Team☆32Updated 9 months ago
- Fullstack machine learning inference template☆30Updated last year
- ☆11Updated 10 months ago
- ☆27Updated 3 years ago
- MLOPs human pose estimation end-to-end.☆34Updated last year
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- ☆46Updated last year
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- Our implementation for paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale☆30Updated 3 years ago
- Solution for MC_OCR competition☆94Updated 2 years ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…☆23Updated 2 years ago
- ☆63Updated 3 years ago
- 1st place code of Player Contact Detection Kaggle competition☆50Updated 2 years ago