Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.
☆32Mar 20, 2021Updated 5 years ago
Alternatives and similar repositories for wav2vec2_transformers
Users that are interested in wav2vec2_transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 5, 2021Updated 4 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆379Feb 4, 2024Updated 2 years ago
- ☆27Mar 29, 2021Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13Sep 8, 2020Updated 5 years ago
- This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles☆13Aug 18, 2021Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Tensorflow Node.js Examples☆25Mar 4, 2023Updated 3 years ago
- ☆12Oct 9, 2025Updated 8 months ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Jun 2, 2019Updated 7 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Nov 2, 2022Updated 3 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆11Oct 25, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆19Nov 3, 2022Updated 3 years ago
- Official repository for U-SAM (Interspeech 2025)☆28Jun 3, 2025Updated last year
- ☆11Oct 14, 2023Updated 2 years ago
- An end to end ASR Transformer model training repo☆13Dec 8, 2021Updated 4 years ago
- Wav2Vec 2.0 catalan training scripts and models☆12Jun 18, 2021Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Apr 5, 2020Updated 6 years ago
- ☆13Nov 26, 2019Updated 6 years ago
- A simple and humble image captioning application, based on a neural network built with Keras☆10Sep 23, 2022Updated 3 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆206Feb 22, 2022Updated 4 years ago
- ☆15Mar 31, 2022Updated 4 years ago
- ☆14Jul 13, 2021Updated 4 years ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 6 years ago
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆17Oct 29, 2024Updated last year
- ☆14Dec 3, 2019Updated 6 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Dec 11, 2020Updated 5 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Quran Offline powered by Electron, React, NeDB☆13Apr 9, 2018Updated 8 years ago
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model☆13Oct 23, 2019Updated 6 years ago
- Some Flutter applications to interact with all the home automation features in my home.☆16Jan 25, 2021Updated 5 years ago
- Mellotron singing synthesizer using CPU☆13Mar 24, 2023Updated 3 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- Code for ProtAugment: Unsupervised diverse short-texts paraphrasing for intent detection meta-learning☆25Aug 22, 2022Updated 3 years ago