☆36Mar 24, 2026Updated 2 months ago
Alternatives and similar repositories for U-MARVEL
Users that are interested in U-MARVEL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Mar 31, 2025Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- ☆11Jul 31, 2022Updated 3 years ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆21Aug 5, 2025Updated 9 months ago
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆181Jul 7, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Dec 29, 2021Updated 4 years ago
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 8 months ago
- LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning☆77May 23, 2025Updated last year
- [ECAI-2024] OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning☆16Jan 7, 2025Updated last year
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- Explanation of the llama2 repo.☆12Jul 18, 2024Updated last year
- Visual-Text dataset based on NFT metadata☆19Nov 7, 2024Updated last year
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆17Jan 10, 2025Updated last year
- Official Implementation of GENIUS: A Generative Framework for Universal Multimodal Search, CVPR 2025☆55Aug 8, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Wasserstein Divergence for GANs☆19Jan 21, 2021Updated 5 years ago
- 复杂网络可视化mock工具☆13Aug 30, 2018Updated 7 years ago
- Awesome LLM for Cybersecurity☆12Nov 16, 2024Updated last year
- The official implementation for Candidate Set Re-ranking for Composed Image Retrieval (TMLR) 01/2024☆20Feb 7, 2024Updated 2 years ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆54May 27, 2025Updated last year
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆33Jul 14, 2023Updated 2 years ago
- 【KDD'20】OptMatch: Optimized Matchmaking via Modeling the High-Order Interactions on the Arena☆16Aug 26, 2020Updated 5 years ago
- Python Puppet Provider Abstraction for Wechaty☆13Nov 20, 2022Updated 3 years ago
- Composed Video Retrieval☆62May 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ICCV 2019 Workshop & Challenge on Computer Vision for Wildlife Conservation (CVWC).☆16Aug 27, 2019Updated 6 years ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 8 months ago
- Code for <Domain Adaptive Video Segmentation via Temporal Consistency Regularization> in ICCV 2021☆42Jul 5, 2022Updated 3 years ago
- A http/websocket server framework on linux.☆20Mar 17, 2023Updated 3 years ago
- [SIGIR'2024 Best Paper Honorable Mention] Official repository for "LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Compose…☆73Mar 14, 2025Updated last year
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆86Sep 19, 2025Updated 8 months ago
- ☆13Dec 25, 2018Updated 7 years ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆23Aug 1, 2025Updated 9 months ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Jun 12, 2021Updated 4 years ago
- V-SWIFT: Training a Small VideoMAE Model on a Single Machine in a Day☆30Feb 5, 2025Updated last year
- ☆40Jan 12, 2026Updated 4 months ago
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆28Nov 11, 2025Updated 6 months ago
- ☆37Jan 26, 2024Updated 2 years ago
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆179Oct 1, 2024Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆87Aug 6, 2025Updated 9 months ago