dynamic-superb / multimodal-llamaView external linksLinks
The official implementation of ImageBind-LLM and Whisper-LLM from the paper "Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech".
☆21Oct 30, 2023Updated 2 years ago
Alternatives and similar repositories for multimodal-llama
Users that are interested in multimodal-llama are comparing it to the libraries listed below
Sorting:
- Comprehensive quantitative comparison of lossless and lossy audio codecs☆39Feb 11, 2023Updated 3 years ago
- A casual and simple ChatGPT Python script that can run using terminal (as long as you have an API). Support Azure API.☆21May 3, 2025Updated 9 months ago
- Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"☆25Dec 3, 2023Updated 2 years ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 4 months ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆58Apr 17, 2024Updated last year
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆28Apr 3, 2024Updated last year
- "Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa" Official Implementation☆10Jan 21, 2026Updated 3 weeks ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- This is an Ai based Yoga Pose Detection System☆10Jul 5, 2022Updated 3 years ago
- Primus-SaFE(Stability and Fault Endurance)☆50Updated this week
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆267May 19, 2024Updated last year
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- ☆11Sep 26, 2022Updated 3 years ago
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- ☆11Feb 16, 2025Updated 11 months ago
- UW DigiPsych Prosody Feature Extraction Repository☆13May 16, 2019Updated 6 years ago
- python爬取历年天气并用pyecharts可视化分析☆10Dec 23, 2021Updated 4 years ago
- ☆10Oct 17, 2022Updated 3 years ago
- The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…☆12Mar 25, 2025Updated 10 months ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"☆11Jun 25, 2024Updated last year
- Repository for the WACV 2024 paper "PsyMo: A Dataset for Estimating Self-Reported Psychological Traits from Gait"☆13Feb 22, 2024Updated last year
- Latex template for CUHK PhD Thesis☆11Jun 29, 2025Updated 7 months ago
- ☆10Sep 6, 2020Updated 5 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- The official repository of Dynamic-SUPERB.☆197Jun 24, 2025Updated 7 months ago
- ☆11Jul 14, 2023Updated 2 years ago
- ☆27Sep 13, 2025Updated 5 months ago
- Hung-Yi Lee Linear Algebra 2018 Fall Homework☆10May 5, 2019Updated 6 years ago
- A dataset of real-world robocall audio recordings☆14Jul 25, 2024Updated last year
- 购物车数字加减框(HTML+CSS+JS一条龙)☆13Mar 21, 2018Updated 7 years ago
- ☆10Feb 24, 2022Updated 3 years ago
- EMO-SUPERB submission☆50Oct 13, 2025Updated 4 months ago
- Automatic diagnosis of alzheimer☆11Apr 19, 2019Updated 6 years ago
- ☆13Nov 16, 2020Updated 5 years ago
- Muse 2/S EEG Headset Vanilla Javascript Library☆12Jun 24, 2022Updated 3 years ago
- A PyTorch native platform for training generative AI models☆15Nov 18, 2025Updated 2 months ago
- ☆15May 16, 2024Updated last year