Demo and samples for universal speech translator
☆24Nov 15, 2022Updated 3 years ago
Alternatives and similar repositories for speech_translation
Users that are interested in speech_translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- BERT Baseline for the Natural Questions☆11Jan 24, 2019Updated 7 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆16Aug 26, 2020Updated 5 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Vu Tran, Gihan Jayatilaka, Ashwin Ashok and Archan Misra, 2021, April. Deeplight : Robust & Unobtrusive Real-time Screen-Camera Communica…☆14Feb 10, 2023Updated 3 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆222Aug 26, 2022Updated 3 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Oct 11, 2022Updated 3 years ago
- Lip and hair color editor using face parsing maps.☆11Jun 10, 2019Updated 6 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆20Jun 11, 2022Updated 3 years ago
- Pytorch library for factorized L0-based pruning.☆45Oct 10, 2023Updated 2 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [DEPRECEATED] A miniature replica of OpenAI's MuseNet☆17Sep 11, 2022Updated 3 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆39Jul 25, 2019Updated 6 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 10 months ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- An android VoIP application using native SIP API & ConnectionService (CallKit in iOS) API☆10Mar 13, 2020Updated 6 years ago
- Deep voice 3 + WORLD vocoder.☆17Jan 7, 2020Updated 6 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Pytorch implementation of GauGAN, from https://arxiv.org/abs/1903.07291 (Park et al. 2019)☆14Oct 7, 2020Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- Directed masked autoencoders☆14Mar 25, 2026Updated last month
- ☆13Jun 18, 2024Updated last year
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- OCR post processing and spelling correction.☆11Nov 12, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆34Oct 11, 2021Updated 4 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- Script to perform statistical significance test between ASR hypotheses.☆23Aug 13, 2017Updated 8 years ago
- neuralpy - neural network library written in python☆12Jun 25, 2023Updated 2 years ago
- Pre-trained models for Honk☆11Apr 1, 2019Updated 7 years ago