chuachinhon / wav2vec2_transformersView external linksLinks
Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.
☆32Mar 20, 2021Updated 4 years ago
Alternatives and similar repositories for wav2vec2_transformers
Users that are interested in wav2vec2_transformers are comparing it to the libraries listed below
Sorting:
- ☆10Oct 9, 2025Updated 4 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- A simple and humble image captioning application, based on a neural network built with Keras☆10Sep 23, 2022Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Apr 5, 2020Updated 5 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Jun 2, 2019Updated 6 years ago
- ☆13Nov 26, 2019Updated 6 years ago
- ☆12Sep 8, 2020Updated 5 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 8 months ago
- This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles☆13Aug 18, 2021Updated 4 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Dec 8, 2022Updated 3 years ago
- ☆14Jan 24, 2022Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆376Feb 4, 2024Updated 2 years ago
- Wave2vec 2.0 Recognize pipeline☆33Dec 22, 2020Updated 5 years ago
- ☆14Dec 3, 2019Updated 6 years ago
- Mellotron singing synthesizer using CPU☆13Mar 24, 2023Updated 2 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Dec 29, 2020Updated 5 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- ☆18Jan 26, 2023Updated 3 years ago
- generate lyrics with GPT-2☆38Mar 14, 2019Updated 6 years ago
- Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model☆13Oct 23, 2019Updated 6 years ago
- Code for "Jukebox: A Generative Model for Music"☆18Dec 15, 2020Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- Conditioned U-Net for Music Source Separation☆20May 15, 2021Updated 4 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 4 years ago
- ☆22Sep 26, 2022Updated 3 years ago
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Jul 23, 2021Updated 4 years ago
- ☆30Jun 23, 2022Updated 3 years ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- ☆26Mar 5, 2018Updated 7 years ago
- ☆27Mar 13, 2021Updated 4 years ago
- Wanwu models release, code will be released soon☆24Aug 24, 2022Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- Implementations of Amazon SageMaker-compatible custom containers for training.☆25Jan 3, 2021Updated 5 years ago
- MSDS593 -- Exploratory data analysis (EDA) at the University of San Francisco☆25Jun 24, 2021Updated 4 years ago
- ☆31Feb 28, 2021Updated 4 years ago
- R Ultimate 2023 - R for Data Science and Machine Learning, by Packt Publishing☆15Dec 15, 2025Updated last month