Speech to Text with Hugging Face and Wav2vec 2.0
☆35Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for speech-to-text
Users that are interested in speech-to-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- Google Earth Pro image extractor and alignment☆13Feb 9, 2018Updated 8 years ago
- ☆18Nov 15, 2021Updated 4 years ago
- ☆13Feb 5, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repository will contain links to the most famous available books of ML that are online☆12Oct 15, 2024Updated last year
- ☆14Feb 5, 2018Updated 8 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- In this repo I show how to simple create an API for your machine learning models in Python☆12Nov 28, 2018Updated 7 years ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆11Dec 14, 2017Updated 8 years ago
- Protecting Real-Time GPU Kernels on Integrated CPU-GPU SoC Platforms☆12Apr 9, 2018Updated 8 years ago
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆12Oct 16, 2024Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Pretrained spoken language classifiers from audio.☆10Jan 21, 2021Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆26Jan 12, 2025Updated last year
- A PyTorch implementation of "Self-Supervised GNN that Jointly Learns to Augment" or "Jointly Learnable Data Augmentations for Self-Superv…☆13Dec 13, 2021Updated 4 years ago
- Code for AccentDB.☆23May 28, 2021Updated 4 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- simple kv store for streams☆36Mar 14, 2013Updated 13 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆17Aug 1, 2024Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Example project showing how we can compare TensorFlow and TensorFlow Lite models☆26Apr 3, 2020Updated 6 years ago
- Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live …☆12Jul 9, 2023Updated 2 years ago
- statically generated weekly digest of articles read in Pocket☆10May 14, 2019Updated 6 years ago
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 9 years ago
- A nuxt module to expose Vuex state in the browser URL for easy sharing☆12Aug 28, 2017Updated 8 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Nov 5, 2020Updated 5 years ago
- ☆28Nov 16, 2017Updated 8 years ago
- ☆14Mar 25, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Library for fast text representation and classification.☆10Apr 17, 2022Updated 3 years ago
- How to classifiy spam sms by using NLP☆11Aug 20, 2024Updated last year
- A menu and CLI based console program to play and write songs for the PC Speaker☆15Aug 1, 2019Updated 6 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆30May 6, 2024Updated last year
- Using Extractive summarization to summarize medium posts☆11Nov 17, 2019Updated 6 years ago
- Natural Language Processing Project☆11Jul 6, 2021Updated 4 years ago
- Scheduled, asynchronous JSON fetching for Node.js applications☆12Apr 2, 2026Updated last week