☆18Aug 29, 2022Updated 3 years ago
Alternatives and similar repositories for how-to-asr
Users that are interested in how-to-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Tutorial at EuroSciPy 2019/2022☆11Aug 15, 2023Updated 2 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- A repo with scripts to test and play around with Facebook's recent llama models! 🤗☆28Jul 25, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A unified dataset of multilingual emotional human utterances☆28Jan 16, 2026Updated 3 months ago
- ☆10Nov 15, 2025Updated 5 months ago
- ☆13Jul 14, 2018Updated 7 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- MCW Continuous delivery in VSTS and Azure Cloud Workshop☆11Jan 15, 2021Updated 5 years ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆11Dec 14, 2017Updated 8 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- ☆10May 25, 2023Updated 2 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The simplest repository for training medium-sized BackpackLM for cs224n☆25Aug 13, 2023Updated 2 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Pretrained spoken language classifiers from audio.☆10Jan 21, 2021Updated 5 years ago
- Demos and exercises for skills-back-end-java☆10Jun 3, 2020Updated 5 years ago
- A PyTorch implementation of "Self-Supervised GNN that Jointly Learns to Augment" or "Jointly Learnable Data Augmentations for Self-Superv…☆13Dec 13, 2021Updated 4 years ago
- This repository contains the contents of Session 1 of the 'Practically ML' Workshop.☆12Apr 28, 2019Updated 6 years ago
- Unofficial instructions for changing Python kernel version on Google Colab.☆25Apr 21, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Buscombe & Ritchie (2018) Landscape Classification with Deep Neural Networks. Geosciences 2018, 8(7), 244☆22Jul 6, 2018Updated 7 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- AI-powered yoga pose correction web application☆17Jul 6, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pure-PyTorch Parakeet TDT inference☆36Mar 10, 2026Updated last month
- Objective metrics used in several text-to-speech (TTS) papers.☆53Jun 17, 2025Updated 10 months ago
- stratigraphic machine-learning - active work moved to Predictatops☆19Apr 16, 2019Updated 7 years ago
- visual question answering prompting recipes for large vision-language models☆28Sep 14, 2024Updated last year
- Analyzing the tree of imports of running Python code.☆12Feb 17, 2023Updated 3 years ago
- ☆16Oct 29, 2023Updated 2 years ago
- Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live …☆12Jul 9, 2023Updated 2 years ago