Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)
☆28Jan 18, 2026Updated last month
Alternatives and similar repositories for grove
Users that are interested in grove are comparing it to the libraries listed below
Sorting:
- B-cell Hybrid Immune Variant Engine☆11Aug 18, 2025Updated 6 months ago
- [NeurIPS 2024] "NovoBench: Benchmarking Deep Learning-based \emph{De Novo} Sequencing Methods in Proteomics"☆12Nov 23, 2024Updated last year
- Code for the paper "NovoMolGen: Rethinking Molecular Language Model Pretraining"☆25Jan 18, 2026Updated last month
- Support library for the MaskRCNN masks extracted on EPIC-KITCHENS-100☆14Dec 1, 2020Updated 5 years ago
- Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch☆19Dec 16, 2021Updated 4 years ago
- Library of models for Protein Function prediction (part of the 18th top solution out of 1625 teams in CAFA5)☆20May 23, 2025Updated 9 months ago
- REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation☆47Updated this week
- Code for the paper "Learning to engineer protein flexibility".☆22Jan 21, 2026Updated last month
- Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024☆58Aug 19, 2025Updated 6 months ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆24Jan 26, 2025Updated last year
- The official PyTorch implementation of Manifold Preserving Guided Diffusion (MPGD)☆39Apr 16, 2024Updated last year
- RetroBridge: Markov Bridge Model for Retrosynthesis Planning☆34Mar 26, 2024Updated last year
- ☆11Oct 7, 2025Updated 4 months ago
- Enzyme datasets used to benchmark enzyme-substrate promiscuity models☆40Jun 28, 2021Updated 4 years ago
- CryoBoltz code for protein structure prediction with cryo-EM guidance. NeurIPS 2025.☆24Nov 30, 2025Updated 3 months ago
- CARE: a Benchmark Suite for the Classification and Retrieval of Enzymes☆41Jun 17, 2025Updated 8 months ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆43Mar 11, 2025Updated 11 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆59Jan 26, 2026Updated last month
- ☆11Dec 6, 2024Updated last year
- Reaction-Conditioned Virtual Screening of Enzymes☆42Jun 11, 2025Updated 8 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- Whisper finetuning☆16Apr 9, 2025Updated 10 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆64Jan 27, 2026Updated last month
- ☆15Dec 2, 2025Updated 3 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆11Jul 2, 2024Updated last year
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- ☆10Oct 24, 2024Updated last year
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- Finding patch of conserved amino acid sites in 3D structure