Hume AI ML Competitions
☆28Apr 7, 2026Updated this week
Alternatives and similar repositories for competitions
Users that are interested in competitions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).☆16Dec 8, 2022Updated 3 years ago
- ☆31Jun 30, 2023Updated 2 years ago
- ☆28May 13, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- UW DigiPsych Prosody Feature Extraction Repository☆13May 16, 2019Updated 6 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated last month
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- ☆12Aug 24, 2020Updated 5 years ago
- Reproducing the baselines of the 2nd Multimodal Sentiment Analysis Challenge (MuSe 2021)☆40Nov 28, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆18Apr 21, 2023Updated 2 years ago
- ☆43Jan 13, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆28Jan 16, 2026Updated 2 months ago
- The Kyoyo Language Modeling Toolkit☆27Nov 27, 2014Updated 11 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆16Sep 2, 2024Updated last year
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 4 years ago
- Changes to QEMU to accomodate the teensy3.x arm platform (Cortex-m4)☆16Oct 13, 2019Updated 6 years ago
- Poetry binary builds☆22May 27, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆20Feb 27, 2024Updated 2 years ago
- TheDeepChecker: Dynamic Debugger for Neural Networks Training Programs☆10Nov 2, 2022Updated 3 years ago
- Speech Recognition Scoring Toolkit☆13Sep 30, 2015Updated 10 years ago
- ☆17Jan 30, 2023Updated 3 years ago
- The SEILS Dataset☆17Oct 24, 2021Updated 4 years ago
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆25Jan 9, 2024Updated 2 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code to reproduce LREC Paper Simplifying Semantic Annotations of SMCalFlow☆25Mar 28, 2024Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Dec 20, 2022Updated 3 years ago
- Yet another list of ML resources☆25Mar 27, 2025Updated last year
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago