Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark
☆35Nov 6, 2025Updated 5 months ago
Alternatives and similar repositories for RzenEmbed
Users that are interested in RzenEmbed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Apr 30, 2022Updated 3 years ago
- AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection☆22Jun 3, 2025Updated 10 months ago
- Official repository of the paper: Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics☆15Mar 22, 2024Updated 2 years ago
- tensorflow mtcnn☆24Feb 20, 2017Updated 9 years ago
- PyTorch model of OpenFace☆12May 8, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- visual-language reasoning segmentation of function-level building footprint☆20May 17, 2025Updated 11 months ago
- convert pytorch trained yolo model to ncnn for Flexible deployment☆10Aug 30, 2018Updated 7 years ago
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 10 months ago
- ☆37Apr 6, 2026Updated 2 weeks ago
- [WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting☆17Dec 29, 2025Updated 3 months ago
- ☆23Nov 4, 2024Updated last year
- Tracking part of siamese-fc.☆10Feb 25, 2017Updated 9 years ago
- ☆38Jan 9, 2026Updated 3 months ago
- Solving physionet2017 with RCRNN☆10Jun 11, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MXNet implementation of infoGAN, WGAN, CycleGAN☆10Jan 28, 2018Updated 8 years ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- Image Classification based on Analytics-Zoo and Redis☆10Apr 17, 2020Updated 6 years ago
- ☆16Mar 26, 2025Updated last year
- A simple TensorFlow example for training CNN models using input queues and labelled JPEGs☆10Mar 4, 2017Updated 9 years ago
- ☆16Apr 21, 2016Updated 9 years ago
- This is the pytorch implementation of Pose-native Neural Architecture Search for Multi-person Pose Estimation.☆11Oct 12, 2021Updated 4 years ago
- This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]☆624Apr 12, 2026Updated last week
- Preprocess the activityNet dataset for detection task☆13Mar 3, 2017Updated 9 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆17Jul 14, 2023Updated 2 years ago
- [Pattern Recognit. Lett.] This is the official code of the paper "Cloud removal using SAR and optical images via attention mechanism-base…☆26Jan 17, 2025Updated last year
- Extract video features. Currently, the models includes I3D, will be continuously updated.☆12Jun 4, 2020Updated 5 years ago
- A simple wrapper to localize human joints from images/video frames for multiple subjects.☆13Nov 22, 2022Updated 3 years ago
- [NeurIPS '24] Code repo for the paper entitled "Learning Structured Representations with Hyperbolic Embeddings" at NeurIPS 2024☆24Jan 22, 2025Updated last year
- CPP wrapper for MXNet interface☆10Feb 29, 2016Updated 10 years ago
- yolov5 pth convert tensorrt and inference☆14Nov 18, 2021Updated 4 years ago
- Motion Emotion Dataset(MED)☆15Oct 26, 2016Updated 9 years ago
- A Tri-map free direct Alpha Matting.☆12Sep 21, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆48May 24, 2024Updated last year
- The official code of our TGRS'24 paper MP2Net: Mask Propagation and Motion Prediction Network for Multi-Object Tracking in Satellite Vide…☆29Apr 25, 2024Updated last year
- This is an implementation of FATAUVA-Net model in paper: An Integrated Deep Learning Framework for Facial Attribute Recognition, Action U…☆19Mar 2, 2018Updated 8 years ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40May 26, 2025Updated 10 months ago
- PyTorch implementation of YOLOv3, including training and inference based on darknet and mobilnetv2☆12Feb 20, 2019Updated 7 years ago
- [CVPR 2025] Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space☆38Jul 18, 2025Updated 9 months ago
- [IEEE TGRS 2024]: The official PyTorch implementation of the paper "MaskCD: A Remote Sensing Change Detection Network Based on Mask Class…☆45Jul 14, 2024Updated last year