drumpt / SGEMLinks
Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization (INTERSPEECH 2023 Oral Presentation)
☆35Updated 10 months ago
Alternatives and similar repositories for SGEM
Users that are interested in SGEM are comparing it to the libraries listed below
Sorting:
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆19Updated 3 years ago
- Details of the datasets for Few-shot class-incremental audio classification☆10Updated last year
- ☆11Updated last year
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆36Updated last year
- [ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks☆49Updated last year
- A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare To…☆10Updated 3 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Updated 2 years ago
- Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"☆30Updated last year
- ☆12Updated 6 months ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆15Updated 8 months ago
- ☆9Updated 10 months ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆16Updated last year
- ☆17Updated last year
- Rainbow Keywords - Official PyTorch Implementation☆13Updated last year
- Continual Learning Method RWM for AAAI 2024☆22Updated 9 months ago
- Can audio-visual integration strengthen robustness under multimodal attacks?☆28Updated 3 years ago
- ☆28Updated 9 months ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆12Updated 8 months ago
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆20Updated last year
- Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"☆12Updated 3 years ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Updated 2 years ago
- ☆12Updated 2 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆92Updated last year
- (ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…