Demo and samples for universal speech translator
☆24Nov 15, 2022Updated 3 years ago
Alternatives and similar repositories for speech_translation
Users that are interested in speech_translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆46Aug 6, 2025Updated 7 months ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆16Aug 26, 2020Updated 5 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- Helping build fair, safe, ethical, and RIGHT General Artificial Intelligence and helping to introduce humanity to (RG)AI through the magi…☆11Aug 6, 2020Updated 5 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆221Aug 26, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Oct 11, 2022Updated 3 years ago
- Lip and hair color editor using face parsing maps.☆11Jun 10, 2019Updated 6 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆20Jun 11, 2022Updated 3 years ago
- Pytorch library for factorized L0-based pruning.☆45Oct 10, 2023Updated 2 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆39Jul 25, 2019Updated 6 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 9 months ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Deep voice 3 + WORLD vocoder.☆17Jan 7, 2020Updated 6 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Implements a proof-of-concept of a multi-level clustering algorithm designed to enable extremely fast approximate match search in a large…☆12Feb 24, 2013Updated 13 years ago
- Pytorch implementation of GauGAN, from https://arxiv.org/abs/1903.07291 (Park et al. 2019)☆14Oct 7, 2020Updated 5 years ago
- A generative model that could generate photo-realistic face images from hand-sketch face images.☆16Jun 17, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Directed masked autoencoders☆14Mar 17, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆34Oct 11, 2021Updated 4 years ago
- Real time multi style transfer implementation in PyTorch☆18Sep 17, 2019Updated 6 years ago
- Script to perform statistical significance test between ASR hypotheses.☆23Aug 13, 2017Updated 8 years ago
- neuralpy - neural network library written in python☆12Jun 25, 2023Updated 2 years ago
- Pre-trained models for Honk☆11Apr 1, 2019Updated 6 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Alphabot: a screen-less interactive spelling primer powered by computer vision☆14Sep 11, 2018Updated 7 years ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.☆35Jul 8, 2024Updated last year
- Unsupervised Anomaly Detection via Deep Metric Learning with End-to-End Optimization☆12Mar 23, 2023Updated 3 years ago
- A python package that allows you to generate text with just a few lines of code using GPT2.☆21Mar 4, 2021Updated 5 years ago
- YOLO reimplement in caffe, written with python layer.☆14Apr 11, 2017Updated 8 years ago
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 3 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago