lucas-ventura / chapter-llamaView external linksLinks
Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"
☆88Jun 6, 2025Updated 8 months ago
Alternatives and similar repositories for chapter-llama
Users that are interested in chapter-llama are comparing it to the libraries listed below
Sorting:
- Code for "Don’t drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlight☆57Jul 24, 2025Updated 6 months ago
- Reliability in Semantic Segmentation: Can We Use Synthetic Data? (ECCV 2024)☆41Jul 17, 2024Updated last year
- (EarthVision 2025 - CVPR Workshop) Official repository of DAFA-LS, a dataset of satellite image time series for the task of archaeologica…☆38Nov 21, 2024Updated last year
- Multi-Camera Hand-Eye Calibration Framework for calibrating a camera network with respect to a robot arm☆32Jan 21, 2026Updated 3 weeks ago
- official implementation of the Polynomial Mixer☆22Sep 15, 2025Updated 5 months ago
- Implementation of the multi-temporal UTAE for the task of satellite image time series semantic change detection (SITS-SCD)☆60Jul 11, 2024Updated last year
- Toolbox for the Earth Parser Dataset, a dataset presented in the "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans" pape…☆26Aug 23, 2023Updated 2 years ago
- ☆74Oct 25, 2024Updated last year
- Official Pytorch implementation of the "A Model You Can Hear: Audio Identification with Playable Prototypes" paper☆37Aug 8, 2022Updated 3 years ago
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆95Nov 13, 2025Updated 3 months ago
- ☆89Oct 24, 2024Updated last year
- Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".☆118Oct 9, 2025Updated 4 months ago
- utility functions for CIL☆20Jun 18, 2024Updated last year
- ☆19Nov 23, 2022Updated 3 years ago
- (ICCV 2021) Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper☆46Feb 1, 2023Updated 3 years ago
- (IGARSS 2025) Prototype-based method for agricultural image time series classification.☆44Sep 5, 2024Updated last year
- [CVPR 2025 - Spotlight] Official PyTorch implementation of MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism F…☆257Apr 8, 2025Updated 10 months ago
- Toolbox for HelixNet, a dataset presented in the "Online Segmentation of LiDAR Sequences: Dataset and Algorithm" paper☆44Dec 13, 2022Updated 3 years ago
- (CVPR 2023) Official code of MACARONS: Mapping And Coverage Anticipation with RGB ONline Self-supervision. Also contains an updated and i…☆83Dec 23, 2023Updated 2 years ago
- High order Moment Models☆40Nov 13, 2025Updated 3 months ago
- Code for SCAM! Transferring humans between images with Semantic Cross Attention Modulation. Also contains implementation for SPADE, CLADE…☆56Nov 8, 2022Updated 3 years ago
- Official Pytorch implementation of the "Online Segmentation of LiDAR Sequences: Dataset and Algorithm" paper☆65Jul 20, 2022Updated 3 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 7 months ago
- (CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery☆53Oct 27, 2022Updated 3 years ago
- [CVPR 2022] Pytorch implementation of "Templates for 3D Object Pose Estimation Revisited: Generalization to New objects and Robustness to…☆188Dec 31, 2024Updated last year
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆203Nov 13, 2023Updated 2 years ago
- (3DV 2021 oral) PyTorch implementation of paper "PoseContrast: Class-Agnostic Object Viewpoint Estimation in the Wild with Pose-Aware Con…☆45Dec 18, 2023Updated 2 years ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.☆30May 27, 2022Updated 3 years ago
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆19Apr 5, 2024Updated last year
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆20Jul 26, 2025Updated 6 months ago
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆25Jun 4, 2025Updated 8 months ago
- (ECCV 2022) Code for Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency☆166Dec 15, 2022Updated 3 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Code for "Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation"☆271Sep 29, 2025Updated 4 months ago
- Implementation of Conditional ViT on LAION — Referred Visual Search — Fashion☆42Aug 27, 2024Updated last year
- Implementation of "Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation" from CVPR Workshop on Human Motion Generation…☆139Jun 18, 2024Updated last year
- ☆91Feb 21, 2023Updated 2 years ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 7 months ago