☆16Sep 25, 2025Updated 5 months ago
Alternatives and similar repositories for Ego-ST
Users that are interested in Ego-ST are comparing it to the libraries listed below
Sorting:
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆18Mar 19, 2025Updated 11 months ago
- ☆36Mar 18, 2025Updated 11 months ago
- For Ego4D VQ3D Task☆22Jan 9, 2024Updated 2 years ago
- ☆26Apr 26, 2025Updated 10 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆46Feb 18, 2026Updated 2 weeks ago
- Auditing agents for fine-tuning safety☆20Oct 21, 2025Updated 4 months ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆30Jul 4, 2018Updated 7 years ago
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆33Oct 12, 2024Updated last year
- ☆41Sep 9, 2025Updated 5 months ago
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆45Jul 1, 2025Updated 8 months ago
- PyTorch implementation for "Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation"☆10Apr 11, 2024Updated last year
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆89Jun 12, 2023Updated 2 years ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆141Aug 21, 2025Updated 6 months ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆39Feb 17, 2023Updated 3 years ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆86Oct 26, 2025Updated 4 months ago
- A Google Chrome Extension that replaces the official New Tab page with a beautiful to-do list.☆12Mar 7, 2018Updated 7 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- ECG analysis to classify anterior myocardial infarction cases.☆10May 17, 2017Updated 8 years ago
- Disable YubiKey output on MacOS without a modifier key pressed☆10Aug 10, 2022Updated 3 years ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- ☆11Nov 21, 2022Updated 3 years ago
- ☆38Feb 20, 2026Updated last week
- Self-Supervised Learning with Multi-View Rendering for 3D Point Cloud Analysis (ACCV 2022)☆10Jul 22, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- ☆18Aug 7, 2025Updated 6 months ago
- Ready to run PyTorch implementation of Data2Vec 2.0: Highly efficient self-supervised representation learning for vision, speech and text…☆16Mar 29, 2023Updated 2 years ago
- [CHI24] AI-Assisted In-Context Writing on OHMD During Travels☆11Dec 19, 2024Updated last year
- A few TensorFlow techniques I'm saving for future reference.☆13Oct 4, 2016Updated 9 years ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆18Oct 19, 2025Updated 4 months ago
- paper爬取+多agent分析(Polaris)☆26Updated this week
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated 3 weeks ago
- 基于langchain和chatglm6b构建的智能问答系统,支持自定义语料☆10Jun 25, 2023Updated 2 years ago
- ☆13Jul 22, 2022Updated 3 years ago
- ☆23Feb 12, 2026Updated 3 weeks ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 2 years ago
- [ICML 2025] Repository for M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Predictive Embedding Architecture☆20Nov 4, 2025Updated 4 months ago
- ☆109Dec 30, 2024Updated last year