aliencaocao / TIL-2023
Champion at Brainhack TIL 2023: Team 10000SGDMRT
☆16Updated 11 months ago
Alternatives and similar repositories for TIL-2023
Users that are interested in TIL-2023 are comparing it to the libraries listed below
Sorting:
- Champion at Brainhack TIL 2022: Team 8000SGD_CAT☆13Updated last year
- ☆84Updated last year
- ☆287Updated 11 months ago
- OneService Hotline is a helpful AI assistant, responsible for helping users (primarily elderly) to submit a case or feedback on municipal…☆12Updated 4 months ago
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation☆155Updated last week
- Pytorch implementation of BigVSAN☆204Updated last year
- ☆165Updated 5 months ago
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆139Updated 5 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆84Updated 8 months ago
- 😎 Awesome lists about Speech Emotion Recognition☆88Updated 4 months ago
- Inference code for PaSST, using the HEAR API.☆33Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆95Updated 8 months ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- Base repository for BrainHack TIL-AI Competition 2024☆13Updated 11 months ago
- Package pymcd☆35Updated 2 years ago
- Audiogen Codec☆135Updated 10 months ago
- Real-time binaural target sound extraction model.☆84Updated last year
- Real-time Speech-Text Foundation Model Toolkit (wip)☆228Updated last month
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆96Updated 9 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆431Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆88Updated 4 months ago
- A simple, hackable text-to-speech system in PyTorch and MLX☆159Updated 2 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆131Updated 2 years ago
- This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recogniti…☆117Updated 8 months ago
- EVAR ~ Evaluation package for Audio Representations☆54Updated 2 weeks ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆99Updated 9 months ago
- XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)☆320Updated 9 months ago
- ☆214Updated last month
- ☆46Updated 8 months ago