ekazakos / groveView external linksLinks
Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)
☆28Jan 18, 2026Updated 3 weeks ago
Alternatives and similar repositories for grove
Users that are interested in grove are comparing it to the libraries listed below
Sorting:
- B-cell Hybrid Immune Variant Engine☆11Aug 18, 2025Updated 5 months ago
- [NeurIPS 2024] "NovoBench: Benchmarking Deep Learning-based \emph{De Novo} Sequencing Methods in Proteomics"☆12Nov 23, 2024Updated last year
- Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch☆19Dec 16, 2021Updated 4 years ago
- Code for the paper "Learning to engineer protein flexibility".☆22Jan 21, 2026Updated 3 weeks ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆23Jan 26, 2025Updated last year
- The official PyTorch implementation of Manifold Preserving Guided Diffusion (MPGD)☆38Apr 16, 2024Updated last year
- RetroBridge: Markov Bridge Model for Retrosynthesis Planning☆33Mar 26, 2024Updated last year
- CryoBoltz code for protein structure prediction with cryo-EM guidance. NeurIPS 2025.☆21Nov 30, 2025Updated 2 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- Enzyme datasets used to benchmark enzyme-substrate promiscuity models☆40Jun 28, 2021Updated 4 years ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆42Mar 11, 2025Updated 11 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆58Jan 26, 2026Updated 2 weeks ago
- Reaction-Conditioned Virtual Screening of Enzymes☆42Jun 11, 2025Updated 8 months ago
- ☆11Dec 6, 2024Updated last year
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 7 months ago
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated last year
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆64Jan 27, 2026Updated 2 weeks ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆22Nov 23, 2025Updated 2 months ago
- ☆14Dec 25, 2024Updated last year
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆11Jul 2, 2024Updated last year
- Terminal viewer for pdb files☆18Jan 5, 2026Updated last month
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 2 months ago
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 7 months ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- ☆14Dec 2, 2025Updated 2 months ago
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆46Apr 29, 2024Updated last year
- ☆28Dec 4, 2025Updated 2 months ago
- multiview and self-supervised learning☆11May 8, 2022Updated 3 years ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 5 months ago
- HD-EPIC Python script to download the entire datasets or parts of it☆17Oct 7, 2025Updated 4 months ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation (ECCV 2024 ORAL)☆15Sep 3, 2024Updated last year
- Solutions to "A First Course in Bayesian Statistical Methods" Peter D. Hoff☆15Jan 5, 2018Updated 8 years ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆29Jan 18, 2026Updated 3 weeks ago
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 5 months ago