RedHenLab / ASR-for-Chinese-Pipeline
Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese
☆10Updated 6 years ago
Alternatives and similar repositories for ASR-for-Chinese-Pipeline:
Users that are interested in ASR-for-Chinese-Pipeline are comparing it to the libraries listed below
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated last month
- Dalle service☆50Updated 3 years ago
- Tensorflow Implementation of Deep Voice 3☆453Updated 6 years ago
- ☆20Updated 10 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆12Updated 2 weeks ago
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆22Updated last year
- The logo of Apples "Spring Loaded" event implemented in Remotion.☆13Updated 11 months ago
- Convert ppt to video with audio track, using text to speech synthesis☆59Updated 6 years ago
- The primary backend service for Atila apps.☆40Updated 2 months ago
- Official Deepgram resources for deploying Deepgram services in a self-hosted environment☆12Updated this week
- A service which wraps and chains video and audio Hugging Face Spaces together☆13Updated 4 months ago
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆818Updated 3 years ago
- This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial ne…☆518Updated 5 years ago
- Collection of ChatGPT plugins☆103Updated last year
- Application configuration and scripts for search on https://docs.vespa.ai/☆13Updated 2 weeks ago
- nlp study tool☆30Updated last year
- AI-generated talking head video of fake people responding to your input question text.☆69Updated 3 years ago
- ObamaNet : Photo-realistic lip-sync from audio (Unofficial port)☆237Updated 6 years ago
- Turn text into video using Stable Diffusion and Google FILM☆41Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆110Updated last year
- TensorFlow implementation of the CVPR 2018 spotlight paper, Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs…☆795Updated 5 years ago
- execute and test code of various languages within a sandbox runtime that provides a virtualized container environment.☆14Updated 5 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆670Updated 3 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆12Updated 4 years ago
- ☆12Updated last year
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆516Updated 4 years ago
- ☆24Updated last month
- Sentence Embedding as a Service☆14Updated last year
- A collection of resources for all your Daily needs!☆33Updated 2 years ago