Speech to Text with Hugging Face and Wav2vec 2.0
☆35Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for speech-to-text
Users that are interested in speech-to-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- Rust crate to produce and consume Web Of Things Thing Descriptions☆13Apr 1, 2024Updated 2 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- ☆32Dec 30, 2025Updated 4 months ago
- This repository will contain links to the most famous available books of ML that are online☆13Oct 15, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Hierarchical annotation - line (phrase), syllable, phoneme annotations of the jingju (Beijing opera) a-cappella singing dataset☆21Mar 1, 2017Updated 9 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- Local Action, Global Impact (Selected as Top 50 in the 2022 Solution Challenge.)☆17Jan 18, 2024Updated 2 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- In this repo I show how to simple create an API for your machine learning models in Python☆12Nov 28, 2018Updated 7 years ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆11Dec 14, 2017Updated 8 years ago
- ☆12Sep 19, 2021Updated 4 years ago
- A Javascript Chatbot built with the Gemini AI☆10Jan 26, 2024Updated 2 years ago
- Github repository for inzva-ai project Audio Style Transfer☆56Oct 13, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Data generator for stereo sound event localization and detection task of DCASE 2025 challenge☆16Jul 17, 2025Updated 10 months ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)☆14Mar 21, 2025Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆12Oct 16, 2024Updated last year
- Official implementation of "sound distance estimation" WASPAA 23☆20Dec 31, 2023Updated 2 years ago
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆26Jan 12, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for AccentDB.☆24May 28, 2021Updated 4 years ago
- A haskell wrapper for neo4j's Cypher REST API.☆20Jul 31, 2012Updated 13 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- Tutorial for Brats 2024 BraSyn (Missing Modality Synthesis) Challenge☆20Sep 30, 2024Updated last year
- Official repository for "3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving Tumor Segmentation Tasks in Data-Scarce Reg…☆16Jun 14, 2024Updated last year
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- A python implementation of isolated word recognition using Hidden Markov Model☆41Mar 22, 2017Updated 9 years ago
- ☆12Sep 14, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Mar 24, 2023Updated 3 years ago
- Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live …☆12Jul 9, 2023Updated 2 years ago
- statically generated weekly digest of articles read in Pocket☆10May 14, 2019Updated 7 years ago
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 9 years ago
- ☆10Aug 5, 2023Updated 2 years ago
- Use Jiotv In Kodi . Free Live Tv Streamin Repo .☆11Oct 22, 2025Updated 6 months ago
- Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"☆24Oct 31, 2025Updated 6 months ago