ekazakos/grove

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ekazakos/grove)

ekazakos / grove

Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)

☆31

Alternatives and similar repositories for grove

Users that are interested in grove are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ninatu / howtocaption
View on GitHub
Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024
☆59Aug 19, 2025Updated 11 months ago
martin-sedlacek / REALM
View on GitHub
[IEEE RA-L 2026] REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation
☆64Updated this week
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
epic-kitchens / epic-kitchens-100-object-masks
View on GitHub
Support library for the MaskRCNN masks extracted on EPIC-KITCHENS-100
☆14Dec 1, 2020Updated 5 years ago
BorchLab / bHIVE
View on GitHub
B-cell Hybrid Immune Variant Engine
☆12Jun 26, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jingbo02 / NovoBench
View on GitHub
[NeurIPS 2024] "NovoBench: Benchmarking Deep Learning-based \emph{De Novo} Sequencing Methods in Proteomics"
☆14Nov 23, 2024Updated last year
ekazakos / MTCN
View on GitHub
Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch
☆20Dec 16, 2021Updated 4 years ago
ratthachat / awesome-biochem-ai
View on GitHub
Curated list on Deep Transformers Applications on Biology and Chemistry
☆19Apr 8, 2023Updated 3 years ago
SamusRam / ProFun
View on GitHub
Library of models for Protein Function prediction (part of the 18th top solution out of 1625 teams in CAFA5)
☆20May 23, 2025Updated last year
WRiegs / Squidly
View on GitHub
Repository for the usage of Squidly.
☆15Updated this week
pluskal-lab / EnzymeExplorer
View on GitHub
Highly accurate discovery of terpene synthases powered by machine learning
☆17Jun 9, 2026Updated last month
KoubaPetr / Flexpert
View on GitHub
Code for the paper "Learning to engineer protein flexibility".
☆22Mar 24, 2026Updated 4 months ago
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
Visual-AI / Pancap
View on GitHub
[NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text
☆38Jan 31, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sooyoung-cha / Structty
View on GitHub
Terminal viewer for pdb files
☆22Jan 5, 2026Updated 6 months ago
qirui-chen / MultiHop-EgoQA
View on GitHub
[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
☆38May 27, 2025Updated last year
qirui-chen / RGA3-release
View on GitHub
[ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring
☆24Aug 8, 2025Updated 11 months ago
McGill-NLP / AURORA
View on GitHub
Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation
☆35Jun 30, 2025Updated last year
EvolvingLMMs-Lab / MGPO
View on GitHub
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
☆55Jul 23, 2025Updated last year
ADL-X / LLAVIDAL
View on GitHub
This is the offical repository of LLAVIDAL
☆25Oct 4, 2025Updated 9 months ago
V-STaR-Bench / V-STaR
View on GitHub
Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
☆45Mar 2, 2026Updated 4 months ago
hjbahng / cyclereward
View on GitHub
CycleReward is a reward model trained on cycle consistency preferences to measure image-text alignment.
☆55Nov 3, 2025Updated 8 months ago
ArianeMora / enzyme-tk
View on GitHub
A toolkit for enzyme discovery
☆28May 18, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
NIneeeeeem / LangDC
View on GitHub
[EMNLP 2025 Oral] Official codebase for Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors.
☆18Sep 7, 2025Updated 10 months ago
igashov / RetroBridge
View on GitHub
RetroBridge: Markov Bridge Model for Retrosynthesis Planning
☆36Mar 26, 2024Updated 2 years ago
zhengrongz / AoTD
View on GitHub
[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".
☆58Updated this week
Lzq5 / UniTime
View on GitHub
Universal Video Temporal Grounding with Generative Multi-modal Large Language Models
☆56May 20, 2026Updated 2 months ago
MejriY / MS2DECIDE
View on GitHub
☆11Apr 25, 2026Updated 3 months ago
Sid2697 / EgoProceL-egocentric-procedure-learning
View on GitHub
Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"
☆35Feb 5, 2024Updated 2 years ago
facebookresearch / htstep
View on GitHub
HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos
☆26Mar 20, 2024Updated 2 years ago
zhang9302002 / ThinkingWithVideos
View on GitHub
The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"
☆102Oct 15, 2025Updated 9 months ago
Becomebright / GroundVQA
View on GitHub
Official PyTorch code of GroundVQA (CVPR'24)
☆63Sep 13, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
OpenGVLab / EgoVideo
View on GitHub
[CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024
☆136May 11, 2025Updated last year
Jayce1kk / SpaceVLLM
View on GitHub
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability
☆17May 8, 2025Updated last year
JacobChalk / TIM
View on GitHub
Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"
☆54Nov 7, 2024Updated last year
soCzech / GenHowTo
View on GitHub
Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024
☆54Mar 3, 2024Updated 2 years ago
OpenGVLab / VideoChat-R1
View on GitHub
[NIPS2025] VideoChat-R1 & R1.5: Enhancing Spatio-Temporal Perception and Reasoning via Reinforcement Fine-Tuning
☆268Oct 18, 2025Updated 9 months ago
OurBluePrint / easy_video
View on GitHub
☆20Mar 3, 2025Updated last year
G-JWLee / COINCIDE_code
View on GitHub
☆23Nov 4, 2024Updated last year