Vaibhavs10/ml-with-audio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Vaibhavs10/ml-with-audio)

Vaibhavs10 / ml-with-audio

HF's ML for Audio study group

☆201

Alternatives and similar repositories for ml-with-audio

Users that are interested in ml-with-audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Vaibhavs10 / how-to-asr
View on GitHub
☆18Aug 29, 2022Updated 3 years ago
Vaibhavs10 / ml-with-text
View on GitHub
[Tutorial] Demystifying Natural Language Processing with Python
☆23Sep 7, 2019Updated 6 years ago
anton-l / wav2vec-toolkit
View on GitHub
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆30Apr 21, 2021Updated 5 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
View on GitHub
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆110Aug 31, 2022Updated 3 years ago
jqueguiner / wav2vec2-sprint
View on GitHub
docker for HF wav2vec2-sprint
☆13Mar 26, 2021Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
Hamtech-ai / Persian-Image-Captioning
View on GitHub
A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.
☆20Feb 27, 2022Updated 4 years ago
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
gbegus / DeepPhonologyTool
View on GitHub
Train a fiwGAN or ciwGAN model using your own training data
☆14Oct 13, 2022Updated 3 years ago
msalhab96 / Conformer
View on GitHub
An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper
☆20Aug 16, 2022Updated 3 years ago
HamedHemati / Tacotron-2-Persian
View on GitHub
Tacotron 2 - Persian
☆37Dec 28, 2021Updated 4 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
jonatasgrosman / huggingsound
View on GitHub
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
☆470Sep 20, 2023Updated 2 years ago
AyumuKasuga / MoneyTrackerBot
View on GitHub
Simple interface between telegram and google spread sheet to track money spending
☆11May 19, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
huggingface / community-events
View on GitHub
Place where folks can contribute to 🤗 community events
☆427Dec 7, 2023Updated 2 years ago
clam004 / unsupervised-speech-representation-learning
View on GitHub
This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…
☆10Jan 25, 2021Updated 5 years ago
jatinchowdhury18 / RNNAudioEffects
View on GitHub
Real-time audio effects using single sample recurrent neural networks
☆24Jul 14, 2020Updated 6 years ago
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
huggingface / speechbox
View on GitHub
☆358Mar 17, 2024Updated 2 years ago
emiljoswin / Deep-Humor-Generation-Analysis-and-Classification-of-Humor-using-Transformers
View on GitHub
Analyse the self-attention patterns in BERT for humor classification and verify the linguistic theory of humor, use GPT-2 to create humor…
☆11Apr 30, 2020Updated 6 years ago
reniew / NSMC_Sentimental-Analysis
View on GitHub
네이버 영화 리뷰데이터를 활용한 한글 텍스트 감정 분석
☆12Aug 22, 2018Updated 7 years ago
nkandpa2 / music_enhancement
View on GitHub
Implementation for "Music Enhancement via Image Translation and Vocoding"
☆54Apr 28, 2022Updated 4 years ago
noajshu / scotus-speech
View on GitHub
Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court
☆22Dec 8, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jeongukjae / namuwiki-corpus
View on GitHub
문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.
☆19Jun 16, 2021Updated 5 years ago
EdwinYam / J-Net
View on GitHub
J-Net is aimed for audio separation with randomly weighted encoder.
☆12Oct 23, 2019Updated 6 years ago
HLasse / multidiagnosis-speech
View on GitHub
☆10Jun 23, 2023Updated 3 years ago
mlcoursemm / ml2020autumn
View on GitHub
Machine Learning, Course 2020 Autumn (lectures + seminars)
☆10Sep 4, 2021Updated 4 years ago
mallahyari / Farsi-datasets
View on GitHub
A collection of Farsi (Persian) datasets
☆27Jul 15, 2021Updated 5 years ago
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
aliabd / fastgradio
View on GitHub
Build fast gradio demos of fastai learners
☆35Sep 23, 2021Updated 4 years ago
nathantspencer / DMC-ColorCodes
View on GitHub
A simple Python scraper and the resulting CSV file, which contains RGB hex color codes for each of the DMC embroidery floss colors.
☆13Dec 13, 2017Updated 8 years ago
korean-named-entity / konne
View on GitHub
Korean Nested Named Entity Corpus
☆20May 13, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
seanockert / rgb-to-dmc
View on GitHub
Convert an RGB colour to it closest matching DMC thread for embroidery and cross-stitching
☆16Jun 9, 2021Updated 5 years ago
Kazuhito00 / onnx-model-encrypt-sample
View on GitHub
ONNXモデルをpyca/cryptographyを用いて暗号化/復号化するサンプル
☆16Mar 19, 2022Updated 4 years ago
Vaibhavs10 / notebooks
View on GitHub
☆127Mar 19, 2025Updated last year
Vaibhavs10 / translate-with-whisper
View on GitHub
☆157Jun 26, 2023Updated 3 years ago
csteinmetz1 / auraloss
View on GitHub
Collection of audio-focused loss functions in PyTorch
☆874Jul 30, 2024Updated last year
shoegazerstella / instruments_activity_detection
View on GitHub
Detect individual instruments activity in an audio file. 🎤🎹🎸🥁
☆17Jun 29, 2021Updated 5 years ago
msalhab96 / AraSpell
View on GitHub
A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs
☆25Jul 21, 2024Updated 2 years ago