jhCOR / EgoOrientBenchLinks
The Official Code Repo for EgoOrientBench [CVPR25]
☆11Updated 2 months ago
Alternatives and similar repositories for EgoOrientBench
Users that are interested in EgoOrientBench are comparing it to the libraries listed below
Sorting:
- ☆22Updated 2 weeks ago
- Welcome to AudioCIL, the toolbox for audio class-incremental learning with the most implemented methods.☆32Updated 6 months ago
- Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…☆21Updated 4 months ago
- KV cache compression via sparse coding☆11Updated 2 months ago
- [ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding☆9Updated 3 months ago
- ☆34Updated last month
- [⭐️ WACV 2025 Oral ⭐️] PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition☆13Updated last month
- HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models☆12Updated 4 months ago
- ☆78Updated 3 weeks ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆20Updated last week
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆90Updated 7 months ago
- [ACL 2024 Findings] Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mO…☆96Updated last year
- SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context☆5Updated 6 months ago
- ☆8Updated 7 months ago
- ☆11Updated 3 months ago
- LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval☆8Updated 7 months ago
- ☆31Updated last year
- The official implement of Freeze-Omni.☆13Updated this week
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆26Updated last year
- ☆32Updated last month
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆42Updated 7 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆75Updated last month
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Updated 11 months ago
- Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"☆18Updated 2 months ago
- Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generati…☆9Updated 7 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆70Updated 10 months ago
- ☆12Updated 6 months ago
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆47Updated 2 months ago
- ☆17Updated last year
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆41Updated 5 months ago