chuachinhon/wav2vec2_transformers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chuachinhon/wav2vec2_transformers)

chuachinhon / wav2vec2_transformers

Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.

☆32

Alternatives and similar repositories for wav2vec2_transformers

Users that are interested in wav2vec2_transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

oliverguhr / wav2vec2-live
View on GitHub
A live speech recognition using Facebooks wav2vec 2.0 model.
☆378Feb 4, 2024Updated 2 years ago
helemanc / audio-based-lyrics-matching
View on GitHub
Official Implementation of the paper "Leveraging Whisper Embeddings for Audio-based Lyrics Matching"
☆17Apr 23, 2026Updated 3 months ago
sayakpaul / Generating-categories-from-arXiv-paper-titles
View on GitHub
This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles
☆13Aug 18, 2021Updated 4 years ago
sayakpaul / MLPMixer-jax2tf
View on GitHub
This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.
☆15Sep 29, 2021Updated 4 years ago
nikhil-vartak / json-to-html-converter
View on GitHub
Converts JSON data to HTML table with collapsible details view for nested objects.
☆14May 1, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
loretoparisi / tensorflow-node-examples
View on GitHub
Tensorflow Node.js Examples
☆25Mar 4, 2023Updated 3 years ago
elmahdik / wasabi_project
View on GitHub
☆13Nov 26, 2019Updated 6 years ago
luca-ant / WhatsSee
View on GitHub
A simple and humble image captioning application, based on a neural network built with Keras
☆10Sep 23, 2022Updated 3 years ago
ictnlp / BT4ST
View on GitHub
Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
☆11Oct 25, 2023Updated 2 years ago
ashwanitanwar / nmt-transfer-learning-xlm-r
View on GitHub
Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning
☆20Nov 3, 2022Updated 3 years ago
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
MegEngine / End-to-end-ASR-Transformer
View on GitHub
An end to end ASR Transformer model training repo
☆13Dec 8, 2021Updated 4 years ago
geneing / WaveRNN
View on GitHub
Pytorch implementation of Deepmind's WaveRNN model
☆13Apr 5, 2020Updated 6 years ago
SMarioMan / jukebox
View on GitHub
Code for "Jukebox: A Generative Model for Music"
☆18Dec 15, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
elliotvilhelm / Spam-Detector-LSTM
View on GitHub
A Tensorflow LSTM spam detector utilizing GloVe word embeddings.
☆12Nov 9, 2019Updated 6 years ago
jonatasgrosman / wav2vec2-sprint
View on GitHub
☆206Feb 22, 2022Updated 4 years ago
unoti / voice-embeddings
View on GitHub
Audio processing using deep neural networks. Speaker identification using voice embeddings.
☆13Dec 8, 2022Updated 3 years ago
linzhiqiu / continual-learning
View on GitHub
☆15Mar 31, 2022Updated 4 years ago
SijieSong / CVPR21-Cogrounding_semantic_attention
View on GitHub
☆14Jul 13, 2021Updated 5 years ago
nuqayah / deen-projects
View on GitHub
Discussion of Islamic projects and tools that should be developed (see issues).
☆12Dec 1, 2019Updated 6 years ago
chitralekha18 / lyrics-aligned-solo-singing-dataset
View on GitHub
☆15Sep 26, 2022Updated 3 years ago
aframires / freesound-loop-annotator
View on GitHub
A web app for annotating Freesound loops, and the tools to analyse the dataset created.
☆20Jul 6, 2023Updated 3 years ago
philschmid / keras-vision-transformer-huggingface
View on GitHub
☆16Jan 4, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jayshah19949596 / Tensorboard-Visualization-Freezing-Graph
View on GitHub
A simple program on how you can use tensor-board for visualization and how you can freeze your model graph and later use if for testing
☆14Nov 6, 2018Updated 7 years ago
TuringTrain / lyrics_segmentation
View on GitHub
☆13Dec 3, 2019Updated 6 years ago
ictnlp / CRESS
View on GitHub
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
☆16Oct 25, 2023Updated 2 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
View on GitHub
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆110Aug 31, 2022Updated 3 years ago
ejabu / quran-tron
View on GitHub
Quran Offline powered by Electron, React, NeDB
☆13Apr 9, 2018Updated 8 years ago
sniemi / DataAnalysisPythonCourse
View on GitHub
Data Analysis and Image Processing Python Course
☆12Nov 4, 2014Updated 11 years ago
multitel-ai / urban-sound-classification-and-comparison
View on GitHub
Urban Sound Classification : striving towards a fair comparison
☆17Dec 11, 2020Updated 5 years ago
sayakpaul / Distributed-Training-in-TensorFlow-2-with-AI-Platform
View on GitHub
Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.
☆20Apr 28, 2021Updated 5 years ago
pranaysawant / Memes-Classification-Model-End-to-End-Solution
View on GitHub
Nowdays there are so many memes picture are share over internet. Over whatsApp so many people share memes image then we have to take effo…
☆14Feb 28, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Hixie / house-of-rooves
View on GitHub
Some Flutter applications to interact with all the home automation features in my home.
☆16Jan 25, 2021Updated 5 years ago
seven1240 / VoiceEvents
View on GitHub
A combination of Ruby/Rails gem and Erlang to subscribe/route/deliver FreeSWITCH events
☆15Nov 20, 2009Updated 16 years ago
neso613 / ASR_TFLite
View on GitHub
Collection of ASR models for English TFLite models for faster inference.
☆14Feb 21, 2022Updated 4 years ago
ictnlp / DiSeg
View on GitHub
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
☆37Dec 6, 2023Updated 2 years ago
mathigatti / MellotronCPU
View on GitHub
Mellotron singing synthesizer using CPU
☆13Mar 24, 2023Updated 3 years ago
YLQY / WhisperMultitaskFinetuning
View on GitHub
关于Whisper语音大模型的多任务微调
☆16Oct 3, 2024Updated last year
qiuyue1993 / Notes
View on GitHub
Research Notes
☆11Sep 13, 2020Updated 5 years ago