Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.
☆32Mar 20, 2021Updated 5 years ago
Alternatives and similar repositories for wav2vec2_transformers
Users that are interested in wav2vec2_transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 5, 2021Updated 4 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆378Feb 4, 2024Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- ☆13Sep 8, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles☆13Aug 18, 2021Updated 4 years ago
- Tensorflow Node.js Examples☆25Mar 4, 2023Updated 3 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆12Nov 26, 2019Updated 6 years ago
- ☆12Oct 9, 2025Updated 7 months ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Jun 2, 2019Updated 6 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆11Oct 25, 2023Updated 2 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Nov 2, 2022Updated 3 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆19Nov 3, 2022Updated 3 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A complete training recipe for kaldi-based Automatic Lyrics Transcription.☆32Nov 30, 2021Updated 4 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Apr 29, 2020Updated 6 years ago
- Converts JSON data to HTML table with collapsible details view for nested objects.☆14May 1, 2021Updated 5 years ago
- An end to end ASR Transformer model training repo☆13Dec 8, 2021Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Apr 5, 2020Updated 6 years ago
- ☆13Nov 26, 2019Updated 6 years ago
- A simple and humble image captioning application, based on a neural network built with Keras☆10Sep 23, 2022Updated 3 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Dec 8, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆205Feb 22, 2022Updated 4 years ago
- ☆15Mar 31, 2022Updated 4 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- A selective noise filter architecture driven by a CNN and Wiener filter☆17Nov 21, 2019Updated 6 years ago
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆17Oct 29, 2024Updated last year
- ☆16Jan 4, 2022Updated 4 years ago
- ☆14Dec 3, 2019Updated 6 years ago
- Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".☆16Oct 25, 2023Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Urban Sound Classification : striving towards a fair comparison☆17Dec 11, 2020Updated 5 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 5 years ago
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- Discussion of Islamic projects and tools that should be developed (see issues).☆12Dec 1, 2019Updated 6 years ago
- Mellotron singing synthesizer using CPU☆12Mar 24, 2023Updated 3 years ago
- Some Flutter applications to interact with all the home automation features in my home.☆16Jan 25, 2021Updated 5 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago