lucaro / VBS-ArchiveLinks
Archive of Tasks and Results of the Video Browser Showdown
☆12Updated 2 months ago
Alternatives and similar repositories for VBS-Archive
Users that are interested in VBS-Archive are comparing it to the libraries listed below
Sorting:
- Collection of recent advanced RAG techniques.☆12Updated 2 weeks ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated last month
- AICITY2024 Track 2 - Code from AIO_ISC Team☆33Updated 10 months ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- ☆36Updated this week
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- Distributed Retrieval Evaluation Server☆14Updated 6 months ago
- ☆12Updated last year
- ☆14Updated 2 years ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 11 months ago
- ☆64Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 10 months ago
- General template for most Pytorch projects☆35Updated last month
- Vietnamese handwritten text recognition system☆17Updated 4 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆25Updated last year
- ☆38Updated last year
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…☆23Updated 2 years ago
- ☆41Updated last year
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 6 months ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆26Updated last year
- ☆87Updated last year
- ☆70Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆29Updated 2 years ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆59Updated 3 weeks ago
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆31Updated last year
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆35Updated last year
- Official repository for the MMFM challenge☆25Updated 11 months ago
- An AI-powered interactive video retrieval system☆32Updated 8 months ago