Speech to Text with Hugging Face and Wav2vec 2.0
☆35Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for speech-to-text
Users that are interested in speech-to-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- Speech recognition with federated learning☆11Jan 9, 2020Updated 6 years ago
- Google Earth Pro image extractor and alignment☆13Feb 9, 2018Updated 8 years ago
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆22Mar 2, 2026Updated 3 months ago
- Minimal implementation of Contrastive Predictive Coding for audio.☆18Nov 17, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Eleventy 11ty e-commerce online shop ecwid free download☆12Aug 28, 2024Updated last year
- ☆18Nov 15, 2021Updated 4 years ago
- ☆34Dec 30, 2025Updated 5 months ago
- A sample project demonstrating how to use DotNetOpenAuth and ServiceStack to create an OAuth2 resource server.☆31Aug 27, 2013Updated 12 years ago
- ☆13Feb 5, 2022Updated 4 years ago
- This repository will contain links to the most famous available books of ML that are online☆13Oct 15, 2024Updated last year
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆15Dec 3, 2021Updated 4 years ago
- Ecoacoustic analysis platform empowering conservationists to analyze acoustic data and to derive insights about the ecosystem at scale☆19Jun 19, 2026Updated last week
- ☆10Feb 18, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Local Action, Global Impact (Selected as Top 50 in the 2022 Solution Challenge.)☆17Jan 18, 2024Updated 2 years ago
- Deep Complex UNet for speech enhancement, init from "https://github.com/chanil1218/DCUnet.pytorch"☆13Feb 21, 2020Updated 6 years ago
- A simple Python script to convert FOA audio to binaural.☆17Nov 29, 2022Updated 3 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Dec 27, 2019Updated 6 years ago
- In this repo I show how to simple create an API for your machine learning models in Python☆12Nov 28, 2018Updated 7 years ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Nov 12, 2022Updated 3 years ago
- UNC How to Learn to Code Python Lectures☆13Jun 11, 2025Updated last year
- A Javascript Chatbot built with the Gemini AI☆10Jan 26, 2024Updated 2 years ago
- Data generator for stereo sound event localization and detection task of DCASE 2025 challenge☆17Jul 17, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆45Dec 15, 2022Updated 3 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- ☆23Aug 31, 2022Updated 3 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆12Oct 16, 2024Updated last year
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- A PyTorch implementation of "Self-Supervised GNN that Jointly Learns to Augment" or "Jointly Learnable Data Augmentations for Self-Superv…☆13Dec 13, 2021Updated 4 years ago
- Code for AccentDB.☆24May 28, 2021Updated 5 years ago
- A haskell wrapper for neo4j's Cypher REST API.☆20Jul 31, 2012Updated 13 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- simple kv store for streams☆36Mar 14, 2013Updated 13 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Oct 2, 2024Updated last year
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆18Aug 1, 2024Updated last year
- Python Turtle - A selection of geometric patterns☆12Apr 13, 2018Updated 8 years ago
- A comprehensive framework to test audio comprehension of Large Audio Language Models.☆66Jun 9, 2026Updated 2 weeks ago