SmartCLIP: A training method to improve CLIP with both short and long texts
☆42Jun 18, 2025Updated 9 months ago
Alternatives and similar repositories for SmartCLIP
Users that are interested in SmartCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Oct 19, 2024Updated last year
- ☆13Jun 21, 2023Updated 2 years ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆60Feb 6, 2026Updated 2 months ago
- ArcGis二次开发大作业(C#+AE)实现一些简单的空间分析以及一些基本操作等功能☆11Dec 20, 2018Updated 7 years ago
- ☆28Apr 8, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆56Oct 14, 2025Updated 5 months ago
- URDFs for the Stretch mobile manipulators from Hello Robot Inc.☆14Aug 19, 2025Updated 7 months ago
- ☆19Dec 2, 2025Updated 4 months ago
- Scale-aware Super-resolution Network☆19Aug 28, 2024Updated last year
- [CVPR25] Official Implementation of CAV-MAE Sync☆30Apr 5, 2026Updated last week
- ☆15Dec 12, 2024Updated last year
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆40Jul 5, 2025Updated 9 months ago
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆19Nov 11, 2025Updated 5 months ago
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆93Jul 17, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Improving large language models with concept-aware fine-tuning (CAFT)☆29Jan 31, 2026Updated 2 months ago
- Counterfactual Reasoning VQA Dataset☆28Nov 23, 2023Updated 2 years ago
- Aligning First, Then Fusing: A Novel Weakly-Supervised Multimodal Violence Detection Method☆22Oct 2, 2025Updated 6 months ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆24Mar 30, 2026Updated last week
- ☆30Aug 11, 2025Updated 8 months ago
- Minimal and Customizable CC-Style Coding Agent☆124Apr 2, 2026Updated last week
- Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models☆23Apr 16, 2025Updated 11 months ago
- ☆16Jan 30, 2024Updated 2 years ago
- Portal for resources for the Stretch community☆13May 14, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆16Jul 11, 2025Updated 9 months ago
- A drawing tool using C++/QT. 用QT编写的画图工具。☆14Apr 28, 2016Updated 9 years ago
- ☆13Oct 15, 2025Updated 5 months ago
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆26Oct 24, 2025Updated 5 months ago
- ☆29Dec 15, 2023Updated 2 years ago
- This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…☆14Jun 6, 2024Updated last year
- RAVEN: Resilient Aerial Navigation via Open-Set Semantic Memory and Behavior Adaptation☆33Mar 31, 2026Updated last week
- ☆28Jul 1, 2023Updated 2 years ago
- Example code for visual servoing using Stretch 3's gripper camera☆20Nov 5, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Adaptive Local Implicit Image Function for Arbitrary-scale Super-resolution, accepted by the International Conference on Image Processing…☆21Nov 2, 2022Updated 3 years ago
- Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.☆21Jan 2, 2024Updated 2 years ago
- ☆26Apr 16, 2024Updated last year
- ☆17Jul 23, 2024Updated last year
- [ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language☆17Dec 3, 2024Updated last year
- This is the source code to paper “DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation”.☆30Aug 13, 2025Updated 7 months ago
- [CVPR 2025] Enhanced OoD Detection through Cross-Modal Alignment of Multi-modal Representations☆32Jun 27, 2025Updated 9 months ago