โ11Feb 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for C5-Multi-Instance-Retrieval
Users that are interested in C5-Multi-Instance-Retrieval are comparing it to the libraries listed below
Sorting:
- Code and benchmarks for the Semantic Video Retrieval Taskโ53Oct 18, 2022Updated 3 years ago
- ๐ฆ A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.โ28Jul 2, 2025Updated 8 months ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.โ32Sep 5, 2022Updated 3 years ago
- โ10Jul 2, 2020Updated 5 years ago
- ็ๆ่ฎญ็ปๆๆฌๆฃๆตๆฐๆฎ้โ12Jul 1, 2020Updated 5 years ago
- Multi Task Learning for Semantic Segmentation, Instance Segmentation and Depth Estimationโ12Jun 12, 2022Updated 3 years ago
- implement gat with batchโ10Nov 28, 2020Updated 5 years ago
- We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-โฆโ24Jan 27, 2026Updated last month
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selectionโ25Feb 10, 2026Updated 3 weeks ago
- A Python/Cython package for graph edit distances and graph matchingโ13Jan 30, 2023Updated 3 years ago
- Code for the Joint Part-of-Speech Embedding modelโ13Feb 16, 2023Updated 3 years ago
- The official implementation of VidFaceโ12Aug 27, 2024Updated last year
- Edit and Generate Anything in 3D world!โ14Apr 15, 2023Updated 2 years ago
- Annotations for the Mistake Detection benchmark of Assembly101โ10Aug 3, 2023Updated 2 years ago
- โ11Dec 8, 2022Updated 3 years ago
- Repository of GUI Action Narratorโ12Apr 8, 2025Updated 10 months ago
- Objective metrics for measuring visual texture similarity using STSIM features. Supervised by Thrasos Pappas.โ15Oct 4, 2023Updated 2 years ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed โฆโ11Sep 27, 2024Updated last year
- โ12Nov 26, 2019Updated 6 years ago
- โ16Sep 25, 2024Updated last year
- Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyToโฆโ13Oct 23, 2022Updated 3 years ago
- Code for reproducing the results in "Forecasting Human Dynamics from Static Images"โ13Jun 16, 2024Updated last year
- Video action classification benchmark for common CNN architectures, implemented in PyTorchโ11Jan 31, 2022Updated 4 years ago
- This is the project page for the HOSNeRFโ16Dec 11, 2023Updated 2 years ago
- โ11Apr 21, 2021Updated 4 years ago
- ใ็ฎๆณใ้่ฟๅพๅ้ข่ฒ่ฎก็ฎๅพๅ็็ธไผผๅบฆโ11Sep 16, 2020Updated 5 years ago
- Camera calibration algorithm using DLT (direct linear transformation).โ12Sep 23, 2019Updated 6 years ago
- Official implementation of "An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition", BMVC 2022โ12Dec 16, 2022Updated 3 years ago
- ่ง้ขๅๅฒใๅ่งฃใๅๆไปฃ็ โ11Mar 24, 2019Updated 6 years ago
- An implementation of DecorrelatedBN by tensorflowโ13Jun 30, 2022Updated 3 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.โ54Mar 30, 2022Updated 3 years ago
- Code for "Low Shot Box Correction for Weakly Supervised Object Detection"โ12Nov 22, 2022Updated 3 years ago
- Xray Image Classifier in collaboration with Chulalongkorn University Computational Molecular Biology Research Groupโ12Aug 18, 2020Updated 5 years ago
- When CNNs Meet Random RNNs: Towards Multi-Level Analysis for RGB-D Object and Scene Recognition (CVIU 2022)โ13Feb 27, 2023Updated 3 years ago
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimationโ59Jun 21, 2023Updated 2 years ago
- Source code of the TextLap model, a LLM for text-2-layout generation.โ17Oct 21, 2024Updated last year
- Source code of our MM'22 paper Partially Relevant Video Retrievalโ55Nov 4, 2024Updated last year
- Unofficial PyTorch implementation of "Composing Good Shots by Exploiting Mutual Relations"โ14May 13, 2022Updated 3 years ago
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrievalโ17Aug 24, 2022Updated 3 years ago