Demo and samples for universal speech translator
☆24Nov 15, 2022Updated 3 years ago
Alternatives and similar repositories for speech_translation
Users that are interested in speech_translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆16Aug 26, 2020Updated 5 years ago
- CMU multilingual speech repository☆30Apr 15, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Nov 22, 2020Updated 5 years ago
- Helping build fair, safe, ethical, and RIGHT General Artificial Intelligence and helping to introduce humanity to (RG)AI through the magi…☆11Aug 6, 2020Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Vu Tran, Gihan Jayatilaka, Ashwin Ashok and Archan Misra, 2021, April. Deeplight : Robust & Unobtrusive Real-time Screen-Camera Communica…☆14Feb 10, 2023Updated 3 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆22Aug 9, 2023Updated 2 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆222Aug 26, 2022Updated 3 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Oct 11, 2022Updated 3 years ago
- Lip and hair color editor using face parsing maps.☆11Jun 10, 2019Updated 6 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆20Jun 11, 2022Updated 3 years ago
- [DEPRECEATED] A miniature replica of OpenAI's MuseNet☆17Sep 11, 2022Updated 3 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆39Jul 25, 2019Updated 6 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 11 months ago
- An android VoIP application using native SIP API & ConnectionService (CallKit in iOS) API☆10Mar 13, 2020Updated 6 years ago
- Deep voice 3 + WORLD vocoder.☆16Jan 7, 2020Updated 6 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Implements a proof-of-concept of a multi-level clustering algorithm designed to enable extremely fast approximate match search in a large…☆12Feb 24, 2013Updated 13 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Pytorch implementation of GauGAN, from https://arxiv.org/abs/1903.07291 (Park et al. 2019)☆14Oct 7, 2020Updated 5 years ago
- A generative model that could generate photo-realistic face images from hand-sketch face images.☆15Jun 17, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- Directed masked autoencoders☆15Mar 25, 2026Updated 2 months ago
- ☆13Jun 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆34Oct 11, 2021Updated 4 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 10 years ago
- Script to perform statistical significance test between ASR hypotheses.☆23Aug 13, 2017Updated 8 years ago
- Real time multi style transfer implementation in PyTorch☆18Sep 17, 2019Updated 6 years ago