☆39May 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for Embed-RL
Users that are interested in Embed-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repository of VG-Refiner paper☆19Dec 9, 2025Updated 5 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆77Sep 23, 2024Updated last year
- 知予人工智能:从学习者到研究者☆13Jan 20, 2025Updated last year
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆72Dec 8, 2025Updated 5 months ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆92Oct 15, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- [CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction☆41May 16, 2024Updated 2 years ago
- ☆11May 10, 2024Updated 2 years ago
- A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…☆20Apr 23, 2025Updated last year
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆71Mar 27, 2026Updated last month
- CIKM 2021: Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking☆20Sep 28, 2022Updated 3 years ago
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆101Jan 3, 2025Updated last year
- 中文原生多层次文生视频测评基准☆18Jul 8, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official PyTorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆102Apr 3, 2025Updated last year
- PyTorch model of OpenFace☆12May 8, 2017Updated 9 years ago
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated last year
- The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).☆60Feb 25, 2026Updated 2 months ago
- [IJCV 2025] The project is an official implementation of our paper "Learning Structure-Supporting Dependencies via Keypoint Interactive T…☆19Jul 16, 2025Updated 10 months ago
- HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)☆14Nov 4, 2025Updated 6 months ago
- ☆23Sep 6, 2023Updated 2 years ago
- ☆29May 23, 2024Updated last year
- [ICML 2026] LaST$_0$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model☆70Apr 30, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enh…☆17Dec 31, 2024Updated last year
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆25Jan 21, 2025Updated last year
- NeurIPS 2025 Poster☆26Feb 4, 2025Updated last year
- Chat about anything on any video!☆39Sep 5, 2023Updated 2 years ago
- [WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting☆17Dec 29, 2025Updated 4 months ago
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery"☆25Jul 25, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- [ACL-26 Findings] Implementation for HiPrune, a training-free visual token pruning method for VLM acceleration.☆54Apr 29, 2026Updated 2 weeks ago
- CVPR 24 paper: Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs☆14Mar 19, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ALGM applied to Segmenter☆31May 27, 2024Updated last year
- 多变量时序预测transformer☆17Sep 13, 2022Updated 3 years ago
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆26Jun 4, 2025Updated 11 months ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆41Jan 12, 2026Updated 4 months ago
- [ICLR 2026] Efficient Reasoning with Balanced Thinking☆125May 4, 2026Updated 2 weeks ago
- An Arduino library to wrap multiple kind of serials.☆17Jan 3, 2022Updated 4 years ago
- ☆39Dec 19, 2025Updated 5 months ago