☆18Aug 29, 2022Updated 3 years ago
Alternatives and similar repositories for how-to-asr
Users that are interested in how-to-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆20Apr 19, 2023Updated 2 years ago
- Tutorial at EuroSciPy 2019/2022☆11Aug 15, 2023Updated 2 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A repo with scripts to test and play around with Facebook's recent llama models! 🤗☆28Jul 25, 2023Updated 2 years ago
- HF's ML for Audio study group☆202Feb 27, 2023Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆29Jan 16, 2026Updated 2 months ago
- A Rust library and command-line tool to manage LDraw files (.ldr)☆15May 11, 2024Updated last year
- ☆14Mar 28, 2025Updated last year
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Lego Machine Learning Dataset☆12Oct 30, 2020Updated 5 years ago
- ☆18Mar 10, 2026Updated 2 weeks ago
- Polish datsets for grammatical error correction☆12Oct 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Artificial Inteligence/Machine Learning for programming LEGO with Pybricks☆17Feb 22, 2026Updated last month
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Repository to identify Lego bricks automatically only using images☆15Nov 13, 2021Updated 4 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pretrained spoken language classifiers from audio.☆10Jan 21, 2021Updated 5 years ago
- A curated list of events, hackathons, and communities focused on AI and tech in Poland☆29Aug 21, 2025Updated 7 months ago
- A PyTorch implementation of "Self-Supervised GNN that Jointly Learns to Augment" or "Jointly Learnable Data Augmentations for Self-Superv…☆13Dec 13, 2021Updated 4 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- ☆10Sep 19, 2022Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- Python scripts for generating images of Lego structures (masks, normals, Blender Cycles renders) for training ML models.☆13Jan 20, 2022Updated 4 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Jun 17, 2025Updated 9 months ago
- Analyzing the tree of imports of running Python code.☆12Feb 17, 2023Updated 3 years ago
- Evaluation of Sentence Representations in Polish☆23Dec 29, 2022Updated 3 years ago
- Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live …☆12Jul 9, 2023Updated 2 years ago
- A Web app demonstrating multimodal image search using Visualized-BGE model☆15Dec 1, 2024Updated last year