Video-Text Representation Learning via Differentiable Weak Temporal Alignment (CVPR 2022)
☆18Apr 19, 2024Updated 2 years ago
Alternatives and similar repositories for VT-TWINS
Users that are interested in VT-TWINS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 19, 2024Updated 2 years ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated 2 years ago
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆35Apr 23, 2024Updated 2 years ago
- ☆16Jun 5, 2023Updated 3 years ago
- Archive for AI grand challenge☆20Jun 6, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆12Mar 10, 2025Updated last year
- ☆16Jun 5, 2023Updated 3 years ago
- ☆11Jan 4, 2022Updated 4 years ago
- 2021 Drone AI challenge☆16Jan 4, 2022Updated 4 years ago
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Oct 12, 2022Updated 3 years ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆25Jan 26, 2025Updated last year
- Official Implementation (Pytorch) of "DAVI: Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems", ECCV 2024 …☆75Aug 16, 2024Updated last year
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆77Mar 26, 2025Updated last year
- Official PyTorch Implementation for Advancing Bayesian Optimization via Learning Correlated Latent Space (CoBO)☆18Apr 22, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Basic Artificial Intelligence Theory☆10Mar 11, 2025Updated last year
- Official Implementation (Pytorch) of the "LLaMo: Large Language Model-based Molecular Graph Assistant", NeurIPS 2024☆37Feb 12, 2025Updated last year
- Official PyTorch implementation of "Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relati…☆41Apr 19, 2024Updated 2 years ago
- [ICCVW2025] V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?☆13Dec 17, 2025Updated 6 months ago
- Official code of "Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble"☆43Feb 14, 2025Updated last year
- Deformable Graph Convolutional Networks (Author's PyTorch implementation for the AAAI 2022 paper)☆27Sep 22, 2022Updated 3 years ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Nov 22, 2022Updated 3 years ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆31Mar 10, 2025Updated last year
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆54Aug 19, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 24] Official Implementation (Pytorch) of "Inversion-based Latent Bayesian Optimization"☆10Nov 15, 2024Updated last year
- Official pytorch implementation of "Self-positioning Point-based Transformer for Point Cloud Understanding" (CVPR 2023).☆113Mar 27, 2025Updated last year
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆26Aug 1, 2025Updated 11 months ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆23Mar 2, 2025Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆40Jul 30, 2025Updated 11 months ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆55Oct 21, 2025Updated 8 months ago
- Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025☆15Jan 15, 2025Updated last year
- A PyTorch implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆18Dec 22, 2021Updated 4 years ago
- ☆22Jun 6, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆16Aug 30, 2023Updated 2 years ago
- ☆12May 15, 2025Updated last year
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- A face detection base on faster-rcnn.pytorch☆10Feb 9, 2018Updated 8 years ago
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆20Sep 15, 2025Updated 9 months ago
- Official PyTorch implementation of NeurIPS 2022 paper "Invertible Monotone Operators for Normalizing Flows"☆15Nov 28, 2022Updated 3 years ago
- [CEUS] Contrastive Language-Location Pre-training☆25Aug 26, 2025Updated 10 months ago