[TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"
β16May 12, 2025Updated last year
Alternatives and similar repositories for GLSCL
Users that are interested in GLSCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β11Jul 11, 2023Updated 2 years ago
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β21May 15, 2025Updated last year
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Miningβ31Apr 4, 2022Updated 4 years ago
- Awesome multi-modal large language paper/project, collections of popular training strategies, e.g., PEFT, LoRA.β27Aug 2, 2024Updated last year
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".β10Aug 14, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"β55Mar 28, 2024Updated 2 years ago
- A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Longβ¦β22Sep 12, 2025Updated 9 months ago
- [TIP 2025] This is an official PyTorch implementation of "Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Alignβ¦β37Jul 24, 2025Updated 11 months ago
- β35Dec 6, 2025Updated 6 months ago
- β19May 19, 2024Updated 2 years ago
- β59Sep 2, 2024Updated last year
- β35Dec 14, 2025Updated 6 months ago
- β11Jul 26, 2024Updated last year
- Source code of the paper Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrievalβ19May 13, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β13Feb 2, 2025Updated last year
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Humβ¦β28Jul 22, 2024Updated last year
- β19Jul 9, 2023Updated 2 years ago
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priorsβ16Apr 23, 2025Updated last year
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"β17Nov 20, 2024Updated last year
- β14Sep 28, 2023Updated 2 years ago
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmarkβ11Jun 10, 2025Updated last year
- Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Trainingβ11Jan 23, 2024Updated 2 years ago
- code for downloading videos from HowTo100M datasetβ18May 13, 2021Updated 5 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICLR 2026] The implementation of paper "AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint"β60Nov 20, 2025Updated 7 months ago
- β11Sep 30, 2024Updated last year
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.β12Jun 19, 2024Updated 2 years ago
- The official repo of AIGC Image Quality Assessment via Image-Prompt Correspondence [CVPRW2024, NTIRE2024].β26Sep 16, 2025Updated 9 months ago
- β20Mar 12, 2025Updated last year
- β23Sep 5, 2023Updated 2 years ago
- Official implementation for "Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches" (CVPR 2024)β32Jul 4, 2024Updated 2 years ago
- A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detectionβ21May 18, 2024Updated 2 years ago
- PyTorch implementation of StableMask (ICML'24)β15Jun 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Viβ¦β41Dec 27, 2025Updated 6 months ago
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selectionβ14Mar 7, 2024Updated 2 years ago
- β22Dec 9, 2022Updated 3 years ago
- Official implementation for CVPR 2025 paper: DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID.β29Nov 19, 2025Updated 7 months ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"β12Apr 14, 2021Updated 5 years ago
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021β11May 16, 2021Updated 5 years ago
- A reading list of papers about Visual Grounding.β31Aug 24, 2022Updated 3 years ago