Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.
☆32Mar 20, 2021Updated 4 years ago
Alternatives and similar repositories for wav2vec2_transformers
Users that are interested in wav2vec2_transformers are comparing it to the libraries listed below
Sorting:
- ☆11Nov 5, 2021Updated 4 years ago
- ☆10Oct 9, 2025Updated 4 months ago
- Tensorflow Node.js Examples☆25Mar 4, 2023Updated 3 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Apr 5, 2020Updated 5 years ago
- A simple and humble image captioning application, based on a neural network built with Keras☆10Sep 23, 2022Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Dec 8, 2022Updated 3 years ago
- ☆15Mar 31, 2022Updated 3 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- ☆14Jan 24, 2022Updated 4 years ago
- Wave2vec 2.0 Recognize pipeline☆33Dec 22, 2020Updated 5 years ago
- ☆14Dec 3, 2019Updated 6 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- ☆18Jan 26, 2023Updated 3 years ago
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Dec 29, 2020Updated 5 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- A complete training recipe for kaldi-based Automatic Lyrics Transcription.☆31Nov 30, 2021Updated 4 years ago
- generate lyrics with GPT-2☆38Mar 14, 2019Updated 6 years ago
- Code for "Jukebox: A Generative Model for Music"☆18Dec 15, 2020Updated 5 years ago
- Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model☆13Oct 23, 2019Updated 6 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Nov 2, 2022Updated 3 years ago
- ☆22Sep 26, 2022Updated 3 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 4 years ago
- Conditioned U-Net for Music Source Separation☆20May 15, 2021Updated 4 years ago
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Jul 23, 2021Updated 4 years ago
- Get an OpenCV video capture from an YouTube video URL☆27Aug 26, 2024Updated last year
- ☆37Sep 21, 2025Updated 5 months ago
- Implementations of Amazon SageMaker-compatible custom containers for training.☆25Jan 3, 2021Updated 5 years ago
- Wanwu models release, code will be released soon☆24Aug 24, 2022Updated 3 years ago
- MSDS593 -- Exploratory data analysis (EDA) at the University of San Francisco☆25Jun 24, 2021Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Python module to load and use native Vamp plugins for audio feature analysis.☆31Aug 12, 2023Updated 2 years ago
- ☆31Feb 28, 2021Updated 5 years ago
- R Ultimate 2023 - R for Data Science and Machine Learning, by Packt Publishing☆15Dec 15, 2025Updated 2 months ago
- Library for translating between 200 languages. Built on 🤗 transformers.☆497Sep 2, 2024Updated last year
- "Neural Loop Combiner: Neural Network Models For Assessing The Compatibility of Loops", ISMIR 2020☆33Nov 8, 2020Updated 5 years ago