ioanacroi / longmoment-detrView external linksLinks
Moment Detection in Long Tutorial Videos
☆20May 8, 2024Updated last year
Alternatives and similar repositories for longmoment-detr
Users that are interested in longmoment-detr are comparing it to the libraries listed below
Sorting:
- ☆15Jan 16, 2024Updated 2 years ago
- Repository for 3 papers on Summarization and Entailment for Medical User-Generated Questions.☆13Jun 7, 2022Updated 3 years ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos☆27Jun 24, 2024Updated last year
- ☆33Sep 22, 2024Updated last year
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- Official Tensorflow implementation of ISCL (Under review)☆10Oct 29, 2021Updated 4 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- [ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds☆96Jul 4, 2024Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆51Oct 14, 2024Updated last year
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- ☆13Oct 17, 2020Updated 5 years ago
- Code repository for Blackbox Attacks via Surrogate Ensemble Search (BASES), NeurIPS 2022☆13Aug 6, 2024Updated last year
- ☆10Jul 16, 2024Updated last year
- Video Summarization Transformer: Implementation in PyTorch of the Transformer model for video summarisation☆10Oct 27, 2020Updated 5 years ago
- Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, …☆11Nov 24, 2024Updated last year
- IIRC baseline☆10Jan 13, 2021Updated 5 years ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆22Nov 23, 2025Updated 2 months ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- ☆13Feb 2, 2026Updated 2 weeks ago
- ☆10Oct 16, 2025Updated 4 months ago
- ☆11Oct 24, 2022Updated 3 years ago
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- ☆10Jan 18, 2024Updated 2 years ago
- ☆51May 11, 2025Updated 9 months ago
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆13Feb 24, 2025Updated 11 months ago
- [ICCV 2023] Accurate and Fast Compressed Video Captioning☆52Jul 28, 2025Updated 6 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- Solutions to "A First Course in Bayesian Statistical Methods" Peter D. Hoff☆15Jan 5, 2018Updated 8 years ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- ☆11Aug 10, 2022Updated 3 years ago
- ☆23Dec 6, 2025Updated 2 months ago
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 2 years ago
- Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"☆10Dec 17, 2023Updated 2 years ago
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year