itsnotacie / CVPR2023-EPIC-SOUNDS-Audio-Based-Interaction-Recognition-3rd-place-solutionLinks
☆30Updated last year
Alternatives and similar repositories for CVPR2023-EPIC-SOUNDS-Audio-Based-Interaction-Recognition-3rd-place-solution
Users that are interested in CVPR2023-EPIC-SOUNDS-Audio-Based-Interaction-Recognition-3rd-place-solution are comparing it to the libraries listed below
Sorting:
- itsnotacie / ICCV2023-OOD-CV-Challenge-Classification-Track-Self-supervised-pretrain-3rd-place-solution☆28Updated last year
- 在没有sudo权限的情况下,在linux上使用clash☆125Updated 8 months ago
- This is for ACL 2025 Findings Paper: From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalitiesModels☆41Updated last week
- [ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning☆70Updated last year
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆241Updated last year
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆45Updated 2 years ago
- ICLR 2025☆27Updated 2 months ago
- 学术双语简历模板,涵盖教育背景、论文发表、项目经历、竞赛经历和个人陈述等关键部分,可适用于申请研究生项目、学术职位或相关行业岗位。☆93Updated 3 weeks ago
- ☆80Updated 8 months ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆277Updated 6 months ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆46Updated 6 months ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆37Updated 5 months ago
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆39Updated 3 months ago
- [ECCV 2024] Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning".☆29Updated 7 months ago
- ☆41Updated 2 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆141Updated last week
- 视觉实验室新手任务☆154Updated last year
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆31Updated 9 months ago
- 一款便捷的抢占显卡脚本☆339Updated 6 months ago
- Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)☆27Updated last year
- [ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".☆25Updated last year
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆19Updated 2 weeks ago
- [AAAI 2023] Official PyTorch Code for "Curriculum Temperature for Knowledge Distillation"☆176Updated 7 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆218Updated 7 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆46Updated 7 months ago
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆60Updated 3 weeks ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆147Updated 4 months ago
- ☆129Updated 5 months ago
- ☆31Updated last year
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆47Updated 6 months ago