KhaLee2307 / image-retrieval
Content-Based Image Retrieval (CBIR) using Faiss (Facebook) and many different feature extraction methods ( VGG16, ResNet50, Local Binary Pattern, RGBHistogram)
☆42Updated 11 months ago
Alternatives and similar repositories for image-retrieval:
Users that are interested in image-retrieval are comparing it to the libraries listed below
- Few shot recognition using CLIP's OpenAI architecture.☆36Updated 3 years ago
- Official repository of the first-ranking solution for the UPAR2024 Challenge - Track 1.☆22Updated last year
- 2nd place solution to Google Universal Image Embedding Challenge!☆43Updated 2 years ago
- [CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks☆52Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆30Updated 2 months ago
- [NeurIPS 2024] TopoFR: A Closer Look at Topology Alignment on Face Recognition☆18Updated last month
- Few-shot Object Counting and Detection (ECCV 2022)☆67Updated 3 months ago
- Controllable and Guided Face Synthesis for Unconstrained Face Recognition (ECCV 2022)☆43Updated 2 years ago
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆21Updated last month
- This repositary contains an implemetation of the two stage networks CVNet and SuperGlobal, for Image Retrieval.☆21Updated last year
- Image/Instance Retrieval using CLIP, A self supervised Learning Model☆26Updated last year
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆103Updated 11 months ago
- Official implementation of Data-Free Sketch-Based Image Retrieval, CVPR 2023.☆26Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆38Updated 4 months ago
- ☆36Updated last year
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆59Updated 8 months ago
- 1st Place Solution in Google Universal Image Embedding☆62Updated last year
- Code for Recall@k Surrogate Loss with Large Batches and Similarity Mixup, CVPR 2022.☆60Updated 3 months ago
- Magface Triton Inferece Server Using Tensorrt☆16Updated 3 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆15Updated 2 weeks ago
- This is the official repository of "eDifFIQA: Towards Efficient Face Image Quality Assessment based on Denoising Diffusion Probabilistic …☆19Updated this week
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆132Updated 2 months ago
- MobileSAM already integrated into Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds☆36Updated last year
- (Unofficial) PyTorch implementation of Training Vision Transformers for Image Retrieval(El-Nouby, Alaaeldin, et al. 2021).☆47Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆127Updated 7 months ago
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆40Updated 11 months ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆166Updated 9 months ago
- Boosting vision transformers for image retrieval, proposed design of Deep Token Pooling(DToP)☆37Updated 2 years ago