Speech to Text with Hugging Face and Wav2vec 2.0
☆35Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for speech-to-text
Users that are interested in speech-to-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- Creating a hybrid recommender system using LightFM. Learn how to tackle the cold start problem.☆13Feb 14, 2022Updated 4 years ago
- Universal differential equations for ecologists☆15Apr 24, 2026Updated last month
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆23Mar 18, 2026Updated 2 months ago
- ☆25Aug 29, 2025Updated 9 months ago
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆170Sep 21, 2020Updated 5 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Minimal implementation of Contrastive Predictive Coding for audio.☆18Nov 17, 2019Updated 6 years ago
- ☆18Nov 15, 2021Updated 4 years ago
- ☆13Feb 5, 2022Updated 4 years ago
- This repository will contain links to the most famous available books of ML that are online☆13Oct 15, 2024Updated last year
- ☆13Feb 5, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆15Dec 3, 2021Updated 4 years ago
- Hierarchical annotation - line (phrase), syllable, phoneme annotations of the jingju (Beijing opera) a-cappella singing dataset☆21Mar 1, 2017Updated 9 years ago
- Code for the paper: Image Denoising and the Generative Accumulation of Photons☆22Aug 2, 2023Updated 2 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Microsoft Teams sample app for tabs Azure AD SSO in Node.js☆25Nov 8, 2024Updated last year
- A simple Python script to convert FOA audio to binaural.☆16Nov 29, 2022Updated 3 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- An implementation of Neural Style Transfer for Audio using Pytorch.☆11Dec 14, 2017Updated 8 years ago
- Github repository for inzva-ai project Audio Style Transfer☆56Oct 13, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆45Dec 15, 2022Updated 3 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆12Oct 16, 2024Updated last year
- Deploying Models to Production with Mlflow and AWS Sagemaker☆24Sep 15, 2021Updated 4 years ago
- Official implementation of "sound distance estimation" WASPAA 23☆20Dec 31, 2023Updated 2 years ago
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Pretrained spoken language classifiers from audio.☆10Jan 21, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for AccentDB.☆24May 28, 2021Updated 5 years ago
- A haskell wrapper for neo4j's Cypher REST API.☆20Jul 31, 2012Updated 13 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- simple kv store for streams☆36Mar 14, 2013Updated 13 years ago
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆18Aug 1, 2024Updated last year