siret / somhunter
Open-source release of the SOMHunter video retrieval tool
☆21Updated last year
Alternatives and similar repositories for somhunter:
Users that are interested in somhunter are comparing it to the libraries listed below
- Distributed Retrieval Evaluation Server☆14Updated 3 months ago
- Archive of Tasks and Results of the Video Browser Showdown☆11Updated 3 weeks ago
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- General template for most Pytorch projects☆34Updated 5 months ago
- ☆27Updated 3 years ago
- Key information extraction from invoice document with Graph Convolution Network☆53Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- ☆12Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 6 months ago
- Scene text vietnamese☆14Updated 2 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆76Updated 3 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated 5 months ago
- Create TensorRT-runtime for vietocr☆11Updated 3 years ago
- VLSP2021 vieCap4H Challenge: Automatic image caption generation for healthcare domains in Vietnamese☆11Updated last year
- Solution for MC_OCR competition☆93Updated last year
- ☆12Updated last year
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- A project for the Zalo AI Challenge 2019, Vietnamese Wikipedia Question Answering task.☆16Updated 5 years ago
- Official codes of the 1st place for The NVIDIA AI City Challenge 2023 - Track 2☆18Updated last year
- Effective frame sampling for ML applications.☆18Updated 2 months ago
- 2nd BKAI CHALLENGE☆8Updated 2 years ago
- ☆19Updated 3 years ago
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆31Updated 2 years ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆21Updated 7 months ago
- ☆12Updated 2 years ago
- Extracting Tabular Data from Image to Excel files☆37Updated 6 months ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆15Updated this week
- Fullstack machine learning inference template☆30Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 9 months ago