Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"
☆90Jun 6, 2025Updated 9 months ago
Alternatives and similar repositories for chapter-llama
Users that are interested in chapter-llama are comparing it to the libraries listed below
Sorting:
- Code for "Don’t drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlight☆57Jul 24, 2025Updated 7 months ago
- (EarthVision 2025 - CVPR Workshop) Official repository of DAFA-LS, a dataset of satellite image time series for the task of archaeologica…☆38Nov 21, 2024Updated last year
- Multi-Camera Hand-Eye Calibration Framework for calibrating a camera network with respect to a robot arm☆32Jan 21, 2026Updated last month
- Implementation of the multi-temporal UTAE for the task of satellite image time series semantic change detection (SITS-SCD)☆60Jul 11, 2024Updated last year
- Toolbox for the Earth Parser Dataset, a dataset presented in the "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans" pape…☆26Aug 23, 2023Updated 2 years ago
- ☆77Oct 25, 2024Updated last year
- Official Pytorch implementation of the "A Model You Can Hear: Audio Identification with Playable Prototypes" paper☆37Aug 8, 2022Updated 3 years ago
- ☆91Oct 24, 2024Updated last year
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆95Nov 13, 2025Updated 3 months ago
- Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".☆118Oct 9, 2025Updated 5 months ago
- ☆19Nov 23, 2022Updated 3 years ago
- (ICCV 2021) Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper☆46Feb 1, 2023Updated 3 years ago
- Official Pytorch implementation of the "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans" paper☆61May 3, 2024Updated last year
- [CVPR 2025 - Spotlight] Official PyTorch implementation of MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism F…☆257Apr 8, 2025Updated 11 months ago
- Toolbox for HelixNet, a dataset presented in the "Online Segmentation of LiDAR Sequences: Dataset and Algorithm" paper☆44Dec 13, 2022Updated 3 years ago
- (CVPR 2023) Official code of MACARONS: Mapping And Coverage Anticipation with RGB ONline Self-supervision. Also contains an updated and i…☆83Dec 23, 2023Updated 2 years ago
- High order Moment Models☆41Nov 13, 2025Updated 3 months ago
- Code for SCAM! Transferring humans between images with Semantic Cross Attention Modulation. Also contains implementation for SPADE, CLADE…☆56Nov 8, 2022Updated 3 years ago
- Official Pytorch implementation of the "Online Segmentation of LiDAR Sequences: Dataset and Algorithm" paper☆65Jul 20, 2022Updated 3 years ago
- (CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery☆53Oct 27, 2022Updated 3 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 7 months ago
- [CVPR 2022] Pytorch implementation of "Templates for 3D Object Pose Estimation Revisited: Generalization to New objects and Robustness to…☆189Dec 31, 2024Updated last year
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆204Nov 13, 2023Updated 2 years ago
- Feature Translation for Exemplar-Free Class-Incremental Learning☆47Jun 5, 2023Updated 2 years ago
- Handwritten Text Recognition and Character Detection☆165Sep 28, 2025Updated 5 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.☆30May 27, 2022Updated 3 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆21Jul 26, 2025Updated 7 months ago
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆19Apr 5, 2024Updated last year
- (ECCV 2022) Code for Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency☆166Dec 15, 2022Updated 3 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Implementation of "Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation" from CVPR Workshop on Human Motion Generation…☆140Jun 18, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- [CVPR 2024] PyTorch implementation of GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence☆261Jan 6, 2025Updated last year
- Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025☆25Nov 28, 2025Updated 3 months ago
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆21Jun 12, 2025Updated 8 months ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆23Feb 11, 2026Updated 3 weeks ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆11Nov 30, 2025Updated 3 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 7 months ago