aliencaocao / TIL-2023
Champion at Brainhack TIL 2023: Team 10000SGDMRT
☆15Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for TIL-2023
- Champion at Brainhack TIL 2022: Team 8000SGD_CAT☆13Updated 7 months ago
- Base repository for BrainHack TIL-AI Competition 2024☆13Updated 5 months ago
- ☆257Updated 5 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆134Updated 10 months ago
- ☆152Updated last year
- ☆54Updated this week
- A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.☆48Updated 6 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆221Updated 3 weeks ago
- Speaker Diarization with Transformers☆59Updated 6 months ago
- This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recogniti…☆112Updated 2 months ago
- Solutions provided to Chip Huyen's Machine Learning Interview Book with GPT☆34Updated 11 months ago
- [Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation☆80Updated this week
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆141Updated 11 months ago
- Official repository of SepReformer for speech separation☆135Updated 2 weeks ago
- Collection of Open Source Speech Data☆147Updated 2 weeks ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆232Updated 6 months ago
- Reference implementation of Mistral AI 7B v0.1 model.☆27Updated 10 months ago
- ☆85Updated 7 months ago
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆42Updated 3 years ago
- Joint speech-language model - respond directly to audio!☆30Updated 6 months ago
- Finetune VITS and MMS using HuggingFace's tools☆123Updated 7 months ago
- The Hugging Face Course on Transformers for Audio☆338Updated this week
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆26Updated last year
- Joint speech-language model - respond directly to audio!☆356Updated 4 months ago
- This is the audio sample repository for speech separation model "MossFormer2".☆108Updated 8 months ago
- ☆347Updated 8 months ago
- ☆24Updated 10 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆259Updated last year
- ☆307Updated 2 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆86Updated last month