Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.
☆32Mar 20, 2021Updated 5 years ago
Alternatives and similar repositories for wav2vec2_transformers
Users that are interested in wav2vec2_transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 5, 2021Updated 4 years ago
- ☆14Jan 24, 2022Updated 4 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 11 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆378Feb 4, 2024Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Sep 8, 2020Updated 5 years ago
- This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles☆13Aug 18, 2021Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Tensorflow Node.js Examples☆25Mar 4, 2023Updated 3 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆11Nov 26, 2019Updated 6 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Jun 2, 2019Updated 6 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Nov 2, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆19Nov 3, 2022Updated 3 years ago
- Official repository for U-SAM (Interspeech 2025)☆26Jun 3, 2025Updated 10 months ago
- A complete training recipe for kaldi-based Automatic Lyrics Transcription.☆31Nov 30, 2021Updated 4 years ago
- An end to end ASR Transformer model training repo☆13Dec 8, 2021Updated 4 years ago
- Wav2Vec 2.0 catalan training scripts and models☆12Jun 18, 2021Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Apr 5, 2020Updated 6 years ago
- ☆13Nov 26, 2019Updated 6 years ago
- A simple and humble image captioning application, based on a neural network built with Keras☆10Sep 23, 2022Updated 3 years ago
- ☆18Dec 29, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for "Jukebox: A Generative Model for Music"☆18Dec 15, 2020Updated 5 years ago
- ☆203Feb 22, 2022Updated 4 years ago
- ☆15Mar 31, 2022Updated 4 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- A simple program on how you can use tensor-board for visualization and how you can freeze your model graph and later use if for testing☆14Nov 6, 2018Updated 7 years ago
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆17Oct 29, 2024Updated last year
- ☆16Jan 4, 2022Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Dec 11, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 4 years ago
- Quran Offline powered by Electron, React, NeDB☆14Apr 9, 2018Updated 8 years ago
- PyTorch implementation of Robust Subspace Recovery Layer for Unsupervised Anomaly Detection https://arxiv.org/abs/1904.00152☆15Apr 26, 2021Updated 4 years ago
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- Discussion of Islamic projects and tools that should be developed (see issues).☆12Dec 1, 2019Updated 6 years ago
- Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model☆13Oct 23, 2019Updated 6 years ago
- Mellotron singing synthesizer using CPU☆13Mar 24, 2023Updated 3 years ago