Google-Health / hearLinks
☆22Updated last month
Alternatives and similar repositories for hear
Users that are interested in hear are comparing it to the libraries listed below
Sorting:
- This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models☆65Updated 7 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- This repository contains the SpeechBrain Benchmarks☆128Updated 3 months ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆88Updated 5 years ago
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆155Updated 10 months ago
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆19Updated last year
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆13Updated last year
- ☆85Updated last year
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆126Updated last year
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆152Updated 2 years ago
- Emotion recognition library for PyTorch☆22Updated 4 years ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆71Updated 7 months ago
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆71Updated 7 months ago
- ICSD Dataset☆35Updated 4 months ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆95Updated last year
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆50Updated 2 years ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆56Updated 11 months ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆91Updated 6 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆118Updated 3 weeks ago
- Repo for Visual Acoustic Matching, CVPR 2022☆68Updated 2 years ago
- (INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classificatio…☆25Updated 3 months ago
- The repo host the code and model of MAViL.☆44Updated 2 years ago
- Code for voicing silent speech from EMG. Official repository for the papers "Digital Voicing of Silent Speech" at EMNLP 2020 and "An Imp…☆140Updated last year
- Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…☆56Updated 2 months ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆34Updated 4 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆45Updated last year
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation☆182Updated 2 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Updated 2 years ago