SmartCLIP: A training method to improve CLIP with both short and long texts
☆43Jun 18, 2025Updated 10 months ago
Alternatives and similar repositories for SmartCLIP
Users that are interested in SmartCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 【CVPR 2025】Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment☆37Sep 17, 2025Updated 7 months ago
- ☆22Oct 19, 2024Updated last year
- ☆13Jun 21, 2023Updated 2 years ago
- This is the open-source code for TokenCarve.☆26Jan 23, 2026Updated 3 months ago
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆31Jul 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation☆20Nov 8, 2025Updated 5 months ago
- [ICME 2024] DIIF (Dynamic Implicit Image Function for Efficient Arbitrary-Scale Super-Resolution).☆13Mar 13, 2024Updated 2 years ago
- Code for “ACE-HGNN: Adaptive Curvature ExplorationHyperbolic Graph Neural Network”☆17Mar 3, 2022Updated 4 years ago
- ☆29Apr 8, 2025Updated last year
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Feb 28, 2025Updated last year
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆58Updated this week
- 计算机图形学课程设计带报告,OpenGL、Qt,图形绘制系统,画图板,release版,exe直接运行☆11Feb 9, 2022Updated 4 years ago
- URDFs for the Stretch mobile manipulators from Hello Robot Inc.☆15Aug 19, 2025Updated 8 months ago
- [CVPR25] Official Implementation of CAV-MAE Sync☆30Apr 5, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆20Nov 11, 2025Updated 5 months ago
- ACMMM 2025☆17Dec 11, 2025Updated 4 months ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆41Jul 5, 2025Updated 9 months ago
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆94Jul 17, 2025Updated 9 months ago
- Improving large language models with concept-aware fine-tuning (CAFT)☆29Jan 31, 2026Updated 3 months ago
- Counterfactual Reasoning VQA Dataset☆28Nov 23, 2023Updated 2 years ago
- Aligning First, Then Fusing: A Novel Weakly-Supervised Multimodal Violence Detection Method☆22Oct 2, 2025Updated 7 months ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆26Mar 30, 2026Updated last month
- Minimal and Customizable CC-Style Coding Agent☆132Apr 2, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reversi AI based on Monte Carlo search algorithm☆10Apr 2, 2025Updated last year
- ☆30Aug 11, 2025Updated 8 months ago
- pytorch implementation of Semantics-AssistedVideoCaptioning☆11Feb 16, 2023Updated 3 years ago
- ☆17Jan 30, 2024Updated 2 years ago
- Portal for resources for the Stretch community☆13May 14, 2025Updated 11 months ago
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- ☆13Oct 15, 2025Updated 6 months ago
- A simple, elegant web tool that allows you to create custom RSS feeds for arXiv search queries. Stay up-to-date with the latest research …☆35Mar 21, 2026Updated last month
- ☆18Dec 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation of Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts(IJCAI 2024)☆15Oct 16, 2024Updated last year
- This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…☆14Jun 6, 2024Updated last year
- ☆28Jul 1, 2023Updated 2 years ago
- Example code for visual servoing using Stretch 3's gripper camera☆20Nov 5, 2024Updated last year
- Adaptive Local Implicit Image Function for Arbitrary-scale Super-resolution, accepted by the International Conference on Image Processing…☆22Nov 2, 2022Updated 3 years ago
- Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.☆21Jan 2, 2024Updated 2 years ago
- ☆26Apr 16, 2024Updated 2 years ago